| --- |
| license: mit |
| language: en |
| pipeline_tag: text-generation |
| tags: |
| - video-understanding |
| - narrative-generation |
| - generative-ai |
| - multi-agent |
| - stateful-ai |
| - prompt-engineering |
| - found-protocol |
| - creator-economy |
| - data-sovereignty |
| - web3 |
| base_model: |
| - google/gemini-pro-vision |
| - google/gemini-pro |
| datasets: |
| - FOUND-LABS/found_consciousness_log |
| --- |
| |
| <div align="center"> |
| <img src="https://res.cloudinary.com/dykojggih/image/upload/v1753377308/IMG_4287_imd6zd.png" width="100px" alt="FOUND LABS Logo"> |
| <h1>The FOUND Protocol</h1> |
| <p><b>The Open-Source Engine for the Consciousness Economy</b></p> |
| |
| <div> |
| <a href="https://huggingface.co/FOUND-LABS"><img src="https://img.shields.io/badge/Organization-FOUND%20LABS-purple" alt="Organization"></a> |
| <a href="https://huggingface.co/FOUND-LABS/found_consciousness_log"><img src="https://img.shields.io/badge/Dataset-Consciousness%20Log-blue" alt="Dataset"></a> |
| <a href="https://foundprotocol.xyz"><img src="https://img.shields.io/badge/Platform-Join%20Waitlist-brightgreen" alt="Join Waitlist"></a> |
| </div> |
| </div> |
| |
| --- |
|
|
| ## Abstract |
|
|
| Current video understanding models excel at semantic labeling but fail to capture the pragmatic and thematic progression of visual narratives. We introduce **FOUND (Forensic Observer and Unified Narrative Deducer)**, a novel, stateful architecture that demonstrates the ability to extract coherent emotional and thematic arcs from a sequence of disparate video inputs. This protocol serves as the foundational engine for the **[FOUND Platform](https://foundprotocol.xyz)**, a decentralized creator economy where individuals can own, control, and monetize their authentic human experiences as valuable AI training data. |
|
|
| --- |
|
|
| ## From Open-Source Research to a New Economy |
|
|
| The FOUND Protocol is more than an academic exercise; it is the core technology powering a new paradigm for the creator economy. |
|
|
| - **The Problem:** AI companies harvest your data to train their models, reaping all the rewards. You, the creator of the data, get nothing. |
| - **Our Solution:** The FOUND Protocol transforms your raw visual moments into structured, high-value data assets. Our upcoming **FOUND Platform** will allow you to contribute this data, maintain ownership via your own wallet, and earn from its usage by AI companies. |
|
|
| **This open-source model is the proof. The FOUND Platform is the promise.** |
|
|
| --- |
|
|
| ## Model Architecture |
|
|
| The FOUND Protocol is a composite **inference pipeline** designed to simulate a stateful consciousness. It comprises two specialized agents that interact in a continuous feedback loop: |
|
|
| - **The Perceptor (`/dev/eye`):** A forensic analysis model (FOUND-1) responsible for transpiling raw visual data into a structured, symbolic JSON output. |
| - **The Interpreter (`/dev/mind`):** A contextual state model (FOUND-2) that operates on the structured output of the Perceptor and the historical system log to resolve "errors" into emotional or thematic concepts. |
| - **The Narrative State Manager:** A stateful object that maintains the "long-term memory" of the system, allowing its interpretations to evolve. |
|
|
| --- |
|
|
| ## How to Use This Pipeline |
|
|
| ### 1. Setup |
|
|
| Clone this repository and install the required dependencies into a Python virtual environment. |
| ```bash |
| git clone https://huggingface.co/FOUND-LABS/found_protocol |
| cd found_protocol |
| python3 -m venv venv |
| source venv/bin/activate |
| pip install -r requirements.txt |
| ``` |
|
|
| ### 2. Configuration |
| Set your Google Gemini API key as an environment variable (e.g., in a .env file): |
| ``` |
| GEMINI_API_KEY="your-api-key-goes-here" |
| ``` |
|
|
| ### 3. Usage via CLI |
| Analyze all videos in a directory sequentially: |
| ```bash |
| python main.py path/to/your/video_directory/ |
| ``` |
|
|
| ## Future Development: The Path to the Platform |
| This open-source protocol is the first step in our public roadmap. The data it generates is the key to our future. |
| - **Dataset Growth:** We are using this protocol to build the found_consciousness_log, the world's first open dataset for thematic video understanding. |
| - **Model Sovereignty:** This dataset will be used to fine-tune our own open-source models (found-perceptor-v1 and found-interpreter-v1), removing the dependency on external APIs and creating a fully community-owned intelligence layer. |
| - **Platform Launch:** These sovereign models will become the core engine of the FOUND Platform, allowing for decentralized, low-cost data processing at scale. |
|
|
| ➡️ Follow our journey and join the waitlist at foundprotocol.xyz |
|
|
| ## Citing this Work |
| If you use the FOUND Protocol in your research, please use the following BibTeX entry. |
| ```bibtex |
| @misc{found_protocol_2025, |
| author = {FOUND LABS Community}, |
| title = {FOUND Protocol: A Symbiotic Dual-Agent Architecture for the Consciousness Economy}, |
| year = {2025}, |
| publisher = {Hugging Face}, |
| journal = {Hugging Face repository}, |
| howpublished = {\url{https://huggingface.co/FOUND-LABS/found_protocol}} |
| } |
| ``` |