🔬 Deep Research con API Local y DuckDuckGo

Este README explica cómo configurar y usar el sistema Deep Research con tu API local (LM Studio) y DuckDuckGo como motor de búsqueda, sin necesidad de APIs externas costosas.

🚀 Características

✅ API Local: Usa tu modelo local (LM Studio) en lugar de APIs externas
✅ Búsqueda Gratuita: DuckDuckGo como motor de búsqueda sin costos
✅ Sin Autenticación: Configuración simplificada para desarrollo local
✅ Investigación Profunda: Sistema completo de investigación automatizada
✅ Interfaz Web: LangGraph Studio para interacción fácil

📋 Requisitos Previos

1. LM Studio

Instala LM Studio
Carga un modelo compatible (ej: openai/gpt-oss-20b)
Inicia el servidor en localhost:1234

2. Python y Dependencias

# Instalar dependencias
uv sync
# o
pip install -r requirements.txt

⚙️ Configuración

1. Archivo de Configuración

El archivo config_local.env contiene toda la configuración necesaria:

# Configuración del modelo local (localhost:1234)
RESEARCH_MODEL=openai:http://localhost:1234/v1
SUMMARIZATION_MODEL=openai:http://localhost:1234/v1
COMPRESSION_MODEL=openai:http://localhost:1234/v1
FINAL_REPORT_MODEL=openai:http://localhost:1234/v1

# API Key para el modelo local - usar cualquier valor ya que tu API acepta cualquier clave
OPENAI_API_KEY=test-key

# Configuración de búsqueda - usar DuckDuckGo en lugar de Tavily
SEARCH_API=duckduckgo

# Configuración de tokens
RESEARCH_MODEL_MAX_TOKENS=10000
SUMMARIZATION_MODEL_MAX_TOKENS=8192
COMPRESSION_MODEL_MAX_TOKENS=8192
FINAL_REPORT_MODEL_MAX_TOKENS=10000

# Configuración de investigación
MAX_CONCURRENT_RESEARCH_UNITS=3
MAX_RESEARCHER_ITERATIONS=4
MAX_REACT_TOOL_CALLS=8
MAX_STRUCTURED_OUTPUT_RETRIES=3

# Configuración de contenido
MAX_CONTENT_LENGTH=50000

# Configuración de clarificación
ALLOW_CLARIFICATION=true

2. Verificar API Local

Antes de usar el sistema, verifica que tu API local esté funcionando:

curl http://localhost:1234/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "openai/gpt-oss-20b",
    "messages": [
      {"role": "user", "content": "Hello"}
    ],
    "temperature": 0.7,
    "max_tokens": 10,
    "stream": false
}'

🚀 Uso

Opción 1: Script de Inicio Automático

# Hacer ejecutable
chmod +x start_local_dev.sh

# Ejecutar
./start_local_dev.sh

Opción 2: Comando Manual

# Cargar configuración
export $(grep -v '^#' config_local.env | xargs)

# Iniciar servidor
uvx --refresh --from "langgraph-cli[inmem]" --with-editable . --python 3.11 langgraph dev --allow-blocking --auth ./src/security/auth_local.py:auth

Opción 3: Sin Autenticación (Recomendado para desarrollo)

# Usar autenticación local simplificada
uvx --refresh --from "langgraph-cli[inmem]" --with-editable . --python 3.11 langgraph dev --allow-blocking --auth ./src/security/auth_local.py:auth

🌐 Acceso a la Interfaz

Una vez iniciado el servidor:

API: http://127.0.0.1:2024
Studio UI: https://smith.langchain.com/studio/?baseUrl=http://127.0.0.1:2024
API Docs: http://127.0.0.1:2024/docs

Configuración en la Interfaz

Abre el Studio UI
Ve a "Manage Assistants"
Configura:
- Search API: DuckDuckGo
- Research Model: openai:http://localhost:1234/v1
- Summarization Model: openai:http://localhost:1234/v1
- Compression Model: openai:http://localhost:1234/v1
- Final Report Model: openai:http://localhost:1234/v1

🧪 Pruebas

Script de Prueba Automático

python3 test_local_config.py

Este script verifica:

✅ Conexión a API local
✅ Funcionamiento de DuckDuckGo
✅ Investigación completa end-to-end

Prueba Manual

import asyncio
from open_deep_research.deep_researcher import deep_researcher

async def test_research():
    config = {
        "configurable": {
            "research_model": "openai:http://localhost:1234/v1",
            "summarization_model": "openai:http://localhost:1234/v1",
            "compression_model": "openai:http://localhost:1234/v1",
            "final_report_model": "openai:http://localhost:1234/v1",
            "search_api": "duckduckgo",
            "max_concurrent_research_units": 3,
            "max_researcher_iterations": 4,
            "max_react_tool_calls": 8,
            "max_structured_output_retries": 3,
            "max_content_length": 50000,
            "allow_clarification": True
        }
    }
    
    result = await deep_researcher.ainvoke(
        {"messages": [{"role": "user", "content": "¿Cuáles son las últimas noticias sobre IA?"}]},
        config=config
    )
    
    print(result["final_report"])

# Ejecutar
asyncio.run(test_research())

🔧 Personalización

Cambiar Modelo Local

Edita config_local.env:

# Cambiar a otro modelo
RESEARCH_MODEL=openai:http://localhost:1234/v1
# El modelo real se especifica en la URL de tu API local

Ajustar Parámetros de Investigación

# Más unidades de investigación concurrentes (más rápido, más recursos)
MAX_CONCURRENT_RESEARCH_UNITS=5

# Más iteraciones de investigación (más profundo)
MAX_RESEARCHER_ITERATIONS=6

# Más llamadas de herramientas por iteración
MAX_REACT_TOOL_CALLS=10

Cambiar Motor de Búsqueda

# Usar Tavily (requiere API key)
SEARCH_API=tavily

# Usar búsqueda nativa de OpenAI
SEARCH_API=openai

# Usar búsqueda nativa de Anthropic
SEARCH_API=anthropic

# Sin búsqueda (solo modelo local)
SEARCH_API=none

🐛 Solución de Problemas

Error: "Authorization header missing"

Problema: El servidor requiere autenticación.

Solución: Usa el script con autenticación local:

./start_local_dev.sh

Error: "Incorrect API key provided"

Problema: Tu API local rechaza la clave API.

Solución: Verifica que tu API local no requiera autenticación:

curl http://localhost:1234/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"model": "openai/gpt-oss-20b", "messages": [{"role": "user", "content": "test"}]}'

Error: "No tools found to conduct research"

Problema: DuckDuckGo no está configurado correctamente.

Solución: Verifica la configuración:

SEARCH_API=duckduckgo

Error: "Connection refused"

Problema: LM Studio no está ejecutándose.

Solución:

Abre LM Studio
Carga un modelo
Inicia el servidor en puerto 1234

📊 Rendimiento

Recursos del Sistema

RAM: ~12-15 GB (dependiendo del modelo)
GPU: Recomendado para modelos grandes
CPU: Mínimo 8 cores para buen rendimiento

Optimizaciones

# Reducir concurrencia para sistemas con menos recursos
MAX_CONCURRENT_RESEARCH_UNITS=2

# Reducir tokens para ahorrar memoria
RESEARCH_MODEL_MAX_TOKENS=5000
SUMMARIZATION_MODEL_MAX_TOKENS=4000

🔒 Seguridad

Desarrollo Local

✅ No se envían datos a servicios externos
✅ Búsquedas a través de DuckDuckGo (sin tracking)
✅ Modelo ejecutándose localmente

Producción

Para uso en producción, considera:

Configurar autenticación real
Usar HTTPS
Implementar rate limiting
Monitoreo de logs

📚 Ejemplos de Uso

Investigación de Noticias

pregunta = "¿Cuáles son las últimas noticias sobre inteligencia artificial en 2024?"

Análisis de Mercado

pregunta = "¿Cuál es el estado actual del mercado de criptomonedas?"

Investigación Científica

pregunta = "¿Cuáles son los últimos avances en computación cuántica?"

🤝 Contribuir

Fork el repositorio
Crea una rama para tu feature
Haz commit de tus cambios
Push a la rama
Abre un Pull Request

📄 Licencia

Este proyecto está bajo la licencia MIT. Ver LICENSE para más detalles.

🙏 Agradecimientos

LangChain - Framework de IA
LangGraph - Grafo de agentes
LM Studio - Servidor local de modelos
DuckDuckGo - Motor de búsqueda privado

¡Disfruta investigando con tu propio sistema de IA local! 🚀

🔬 Open Deep Research (Documentación Original)

Deep research has broken out as one of the most popular agent applications. This is a simple, configurable, fully open source deep research agent that works across many model providers, search tools, and MCP servers. It's performance is on par with many popular deep research agents (see Deep Research Bench leaderboard).

🔥 Recent Updates

August 14, 2025: See our free course here (and course repo here) on building open deep research.

August 7, 2025: Added GPT-5 and updated the Deep Research Bench evaluation w/ GPT-5 results.

August 2, 2025: Achieved #6 ranking on the Deep Research Bench Leaderboard with an overall score of 0.4344.

July 30, 2025: Read about the evolution from our original implementations to the current version in our blog post.

July 16, 2025: Read more in our blog and watch our video for a quick overview.

🚀 Quickstart

Clone the repository and activate a virtual environment:

git clone https://github.com/langchain-ai/open_deep_research.git
cd open_deep_research
uv venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install dependencies:

uv sync
# or
uv pip install -r pyproject.toml

Set up your .env file to customize the environment variables (for model selection, search tools, and other configuration settings):

cp .env.example .env

Launch agent with the LangGraph server locally:

# Install dependencies and start the LangGraph server
uvx --refresh --from "langgraph-cli[inmem]" --with-editable . --python 3.11 langgraph dev --allow-blocking

This will open the LangGraph Studio UI in your browser.

- 🚀 API: http://127.0.0.1:2024
- 🎨 Studio UI: https://smith.langchain.com/studio/?baseUrl=http://127.0.0.1:2024
- 📚 API Docs: http://127.0.0.1:2024/docs

Ask a question in the messages input field and click Submit. Select different configuration in the "Manage Assistants" tab.

⚙️ Configurations

LLM 🧠

Open Deep Research supports a wide range of LLM providers via the init_chat_model() API. It uses LLMs for a few different tasks. See the below model fields in the configuration.py file for more details. This can be accessed via the LangGraph Studio UI.

Summarization (default: openai:gpt-4.1-mini): Summarizes search API results
Research (default: openai:gpt-4.1): Power the search agent
Compression (default: openai:gpt-4.1): Compresses research findings
Final Report Model (default: openai:gpt-4.1): Write the final report

Note: the selected model will need to support structured outputs and tool calling.

Note: For OpenRouter: Follow this guide and for local models via Ollama see setup instructions.

Search API 🔍

Open Deep Research supports a wide range of search tools. By default it uses the Tavily search API. Has full MCP compatibility and work native web search for Anthropic and OpenAI. See the search_api and mcp_config fields in the configuration.py file for more details. This can be accessed via the LangGraph Studio UI.

Other

See the fields in the configuration.py for various other settings to customize the behavior of Open Deep Research.

📊 Evaluation

Open Deep Research is configured for evaluation with Deep Research Bench. This benchmark has 100 PhD-level research tasks (50 English, 50 Chinese), crafted by domain experts across 22 fields (e.g., Science & Tech, Business & Finance) to mirror real-world deep-research needs. It has 2 evaluation metrics, but the leaderboard is based on the RACE score. This uses LLM-as-a-judge (Gemini) to evaluate research reports against a golden set of reports compiled by experts across a set of metrics.

Usage

Warning: Running across the 100 examples can cost ~$20-$100 depending on the model selection.

The dataset is available on LangSmith via this link. To kick off evaluation, run the following command:

# Run comprehensive evaluation on LangSmith datasets
python tests/run_evaluate.py

This will provide a link to a LangSmith experiment, which will have a name YOUR_EXPERIMENT_NAME. Once this is done, extract the results to a JSONL file that can be submitted to the Deep Research Bench.

python tests/extract_langsmith_data.py --project-name "YOUR_EXPERIMENT_NAME" --model-name "you-model-name" --dataset-name "deep_research_bench"

This creates tests/expt_results/deep_research_bench_model-name.jsonl with the required format. Move the generated JSONL file to a local clone of the Deep Research Bench repository and follow their Quick Start guide for evaluation submission.

Results

Name	Commit	Summarization	Research	Compression	Total Cost	Total Tokens	RACE Score	Experiment
GPT-5	ca3951d	openai:gpt-4.1-mini	openai:gpt-5	openai:gpt-4.1		204,640,896	0.4943	Link
Defaults	6532a41	openai:gpt-4.1-mini	openai:gpt-4.1	openai:gpt-4.1	$45.98	58,015,332	0.4309	Link
Claude Sonnet 4	f877ea9	openai:gpt-4.1-mini	anthropic:claude-sonnet-4-20250514	openai:gpt-4.1	$187.09	138,917,050	0.4401	Link
Deep Research Bench Submission	c0a160b	openai:gpt-4.1-nano	openai:gpt-4.1	openai:gpt-4.1	$87.83	207,005,549	0.4344	Link

🚀 Deployments and Usage

LangGraph Studio

Follow the quickstart to start LangGraph server locally and test the agent out on LangGraph Studio.

Hosted deployment

You can easily deploy to LangGraph Platform.

Open Agent Platform

Open Agent Platform (OAP) is a UI from which non-technical users can build and configure their own agents. OAP is great for allowing users to configure the Deep Researcher with different MCP tools and search APIs that are best suited to their needs and the problems that they want to solve.

We've deployed Open Deep Research to our public demo instance of OAP. All you need to do is add your API Keys, and you can test out the Deep Researcher for yourself! Try it out here

You can also deploy your own instance of OAP, and make your own custom agents (like Deep Researcher) available on it to your users.

Legacy Implementations 🏛️

The src/legacy/ folder contains two earlier implementations that provide alternative approaches to automated research. They are less performant than the current implementation, but provide alternative ideas understanding the different approaches to deep research.

1. Workflow Implementation (`legacy/graph.py`)

Plan-and-Execute: Structured workflow with human-in-the-loop planning
Sequential Processing: Creates sections one by one with reflection
Interactive Control: Allows feedback and approval of report plans
Quality Focused: Emphasizes accuracy through iterative refinement

2. Multi-Agent Implementation (`legacy/multi_agent.py`)

Supervisor-Researcher Architecture: Coordinated multi-agent system
Parallel Processing: Multiple researchers work simultaneously
Speed Optimized: Faster report generation through concurrency
MCP Support: Extensive Model Context Protocol integration

Name		Name	Last commit message	Last commit date
Latest commit History 199 Commits
.github/workflows		.github/workflows
examples		examples
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
config_local.env		config_local.env
langgraph.json		langgraph.json
pyproject.toml		pyproject.toml
start_local.sh		start_local.sh
start_local_dev.sh		start_local_dev.sh
test_local_config.py		test_local_config.py
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

🔬 Deep Research con API Local y DuckDuckGo

🚀 Características

📋 Requisitos Previos

1. LM Studio

2. Python y Dependencias

⚙️ Configuración

1. Archivo de Configuración

2. Verificar API Local

🚀 Uso

Opción 1: Script de Inicio Automático

Opción 2: Comando Manual

Opción 3: Sin Autenticación (Recomendado para desarrollo)

🌐 Acceso a la Interfaz

Configuración en la Interfaz

🧪 Pruebas

Script de Prueba Automático

Prueba Manual

🔧 Personalización

Cambiar Modelo Local

Ajustar Parámetros de Investigación

Cambiar Motor de Búsqueda

🐛 Solución de Problemas

Error: "Authorization header missing"

Error: "Incorrect API key provided"

Error: "No tools found to conduct research"

Error: "Connection refused"

📊 Rendimiento

Recursos del Sistema

Optimizaciones

🔒 Seguridad

Desarrollo Local

Producción

📚 Ejemplos de Uso

Investigación de Noticias

Análisis de Mercado

Investigación Científica

🤝 Contribuir

📄 Licencia

🙏 Agradecimientos

🔬 Open Deep Research (Documentación Original)

🔥 Recent Updates

🚀 Quickstart

⚙️ Configurations

LLM 🧠

Search API 🔍

Other

📊 Evaluation

Usage

Results

🚀 Deployments and Usage

LangGraph Studio

Hosted deployment

Open Agent Platform

Legacy Implementations 🏛️

1. Workflow Implementation (legacy/graph.py)

2. Multi-Agent Implementation (legacy/multi_agent.py)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. Workflow Implementation (`legacy/graph.py`)

2. Multi-Agent Implementation (`legacy/multi_agent.py`)

Packages