init

2025-08-04 00:20:29 +08:00
commit 03f2c49b7e
1173 changed files with 102921 additions and 0 deletions
--- a/docs/core/concepts.md
+++ b/docs/core/concepts.md
@@ -0,0 +1,61 @@
+---
+title: "Concepts"
+description: "Understand the core terms and features that make Eigent unique."
+icon: "key"
+---
+
+## Workers
+
+Autonomous agents tailored to specific roles that run tasks independently or together. Think of them as individual members of your team, like a "Researcher," a "Programmer," or a "Writer."
+
+Each Worker is designed with specific capabilities and can be customized to handle particular types of tasks efficiently.
+
+![Workers concept illustration](/docs/images/concepts_worker.png)
+
+## Workforce
+
+A coordinated team of Workers that collaborate to complete complex workflows. Think of it as your AI project team.
+
+The Workforce orchestrates multiple Workers, ensuring they work together seamlessly to achieve your goals.
+
+![Workforce collaboration illustration](/docs/images/concepts_workforce.gif)
+
+## Workspace
+
+A live window into a Worker's process where you can watch or take control. For example, a terminal, a browser, or a file viewer.
+
+Workspaces provide real-time visibility into what your Workers are doing, allowing you to monitor progress and intervene when needed.
+
+![Workspace interface illustration](/docs/images/concepts_workspace.gif)
+
+## Tasks & Subtasks
+
+You define a mission (task), the Workforce breaks it into components (subtasks), and assigns them to the appropriate Workers.
+
+This hierarchical approach ensures complex projects are broken down into manageable pieces and executed efficiently.
+
+![Tasks and subtasks breakdown illustration](/docs/images/concepts_tasks_subtasks.gif)
+
+## Chat
+
+Your primary interface for communicating with your Workforce. You use it to define your main Task, sharing files and interacting with agents in real time.
+
+The Chat interface serves as your command center, where you can give instructions, ask questions, and receive updates from your AI team.
+
+![Chat interface illustration](/docs/images/concepts_chat.png)
+
+## MCP
+
+Model Context Protocol that allows Workers to use external tools. It connects your agents to databases, APIs, and documentation sources, empowering them to act across platforms.
+
+MCP extends your Workers' capabilities by providing access to real-world data and tools, making them more powerful and versatile.
+
+![MCP protocol illustration](/docs/images/concepts_mcp.png)
+
+## Models
+
+Different AI "brains" that power your Workers. Eigent allows you to choose from various models (like GPT-4.1 or Gemini 2.5 Pro), each with different strengths in speed, reasoning, and cost.
+
+Choose the right model for each task based on your specific needs for performance, accuracy, or cost efficiency.
+
+![AI models illustration](/docs/images/concepts_models.png)
--- a/docs/core/models.md
+++ b/docs/core/models.md
@@ -0,0 +1,91 @@
+---
+title: Models
+description: Configure and deploy your preferred LLM models with Eigent.
+icon: server
+---
+
+Eigent supports flexible integration and deployment of top LLMs and multimodal models. You can follow the steps below to set up your preferred LLM models.
+
+1. Click Settings
+
+![click_settings](/docs/images/models_settings.png)
+
+2. Close Eigent Cloud Version
+
+![close_eigent](/docs/images/models_close.png)
+
+3. Configure your APIKEY and Model Type
+
+![configure_api](/docs/images/models_configure_models.png)
+
+4. Configure the Google Search toolkit
+
+![configure_searchtools](/docs/images/models_configure_tools.png)
+
+![configure_searchtoolsapi](/docs/images/models_configure_tools_key.png)
+
+You can refer to the following document for detailed information on how to configure **GOOGLE_API_KEY** and **SEARCH_ENGINE_ID :** https://developers.google.com/custom-search/v1/overview
+
+Now，start enjoying Eigent！
+
+## **Self-Host Model**
+
+1. Configure your self-host model
+
+First, you need to set up your local models and expose them as an **OpenAI-Compatible Server.**
+
+```bash
+#Vllm https://docs.vllm.ai/en/latest/getting_started/quickstart.html#openai-compatible-server
+vllm serve Qwen/Qwen2.5-1.5B-Instruct
+```
+
+```python
+#Sglang https://docs.sglang.ai/backend/openai_api_completions.html
+from sglang.test.test_utils import is_in_ci
+
+if is_in_ci():
+    from patch import launch_server_cmd
+else:
+    from sglang.utils import launch_server_cmd
+
+from sglang.utils import wait_for_server, print_highlight, terminate_process
+
+server_process, port = launch_server_cmd(
+    "python3 -m sglang.launch_server --model-path qwen/qwen2.5-0.5b-instruct --host 0.0.0.0 --mem-fraction-static 0.8"
+)
+
+wait_for_server(f"http://localhost:{port}")
+print(f"Server started on http://localhost:{port}")
+```
+
+```bash
+#Ollama https://github.com/ollama/ollama
+ollama pull qwen2.5:7b
+```
+
+2. Setting your model
+
+![set_local_model](/docs/images/models_local_model.png)
+
+3. Configure the Google Search toolkit
+
+![configure_searchtools](/docs/images/models_configure_tools.png)
+
+![configure_searchtoolsapi](/docs/images/models_configure_tools_key.png)
+You can refer to the following document for detailed information on how to configure **GOOGLE_API_KEY** and **SEARCH_ENGINE_ID :** https://developers.google.com/custom-search/v1/overview
+
+## **API KEY Reference**
+
+Gemini: https://ai.google.dev/gemini-api/docs/api-key
+
+OpenAI: https://platform.openai.com/api-keys
+
+Anthropic: https://console.anthropic.com/
+
+Qwen:https://www.alibabacloud.com/help/en/model-studio/get-api-key
+
+Deepseek: https://platform.deepseek.com/api_keys
+
+AWS Bedrock:https://github.com/aws-samples/bedrock-access-gateway/blob/main/README.md
+
+Azure:https://azure.microsoft.com/products/cognitive-services/openai-service/
--- a/docs/core/tools.md
+++ b/docs/core/tools.md
@@ -0,0 +1,29 @@
+---
+title: Tools
+description: Extend Eigent with MCP servers and tools
+icon: plug
+---
+
+**Add your MCP server**
+
+**You can follow the steps below to set up your model**
+
+1. Click Settings
+
+![click_settings](/docs/images/models_settings.png)
+
+2. Click Add MCP Server
+
+![add_mcp](/docs/images/tools_add_mcp.png)
+
+3. Configure Your MCP Server and install
+
+![configure_mcp](/docs/images/tools_configure_mcp.png)
+
+4.Add external servers to your own Agent
+
+- You can check the installed mcp server in the Added external servers column
+
+![check_mcp](/docs/images/tools_check.png)
+
+- After configuring your mcp server, you can add it to a Custom  Agent.
--- a/docs/core/workforce.md
+++ b/docs/core/workforce.md
@@ -0,0 +1,216 @@
+---
+title: Workforce
+description: Understand core concepts and features that power Eigent.
+icon: cubes
+---
+## Concept: What is a Workforce?
+
+Workforce is CAMEL-AI's multi-agent teamwork engine. 
+
+Instead of relying on a single agent, Workforce lets you organize a *team* of specialized agents—each with its own strengths—under a single, coordinated system. You can quickly assemble, configure, and launch collaborative agent "workforces" for any task that needs parallelization, diverse expertise, or complex workflows.
+
+With Workforce, agents plan, solve, and verify work together—like a project team in an organization, but fully automated.
+
+
+<aside>
+**📌 Key advantages of Workforce:**
+
+- Instantly scale from single-agent to multi-agent workflows
+- Built-in coordination, planning, and failure recovery
+- Ideal for hackathons, evaluations, code review, brainstorming, and more
+- Configure roles, toolsets, and communication patterns for *any* scenario
+</aside>
+
+## System Design
+
+### **Architecture: How Workforce Works**
+
+Workforce uses a **hierarchical, modular design** for real-world team problem-solving.
+![Workforce](/docs/images/workforce.jpg)
+
+See how the coordinator and task planner agents orchestrate a multi-agent workflow:
+
+- **Workforce:** The “team” as a whole.
+- **Worker nodes:** Individual contributors—each node can contain one or more agents, each with their own capabilities.
+- **Coordinator agent:** The “project manager”—routes tasks to worker nodes based on their role and skills.
+- **Task planner agent:** The “strategy lead”—breaks down big jobs into smaller, doable subtasks and organizes the workflow.
+
+### **Communication: A Shared Task Channel**
+
+Every Workforce gets a **shared task channel** when it’s created.
+**How it works:**
+- All tasks are posted into this channel.
+- Worker nodes “listen” and accept their assigned tasks.
+- Results are posted back to the channel, where they’re available as dependencies for the next steps.
+
+*This design lets agents build on each other’s work and ensures no knowledge is lost between steps.*
+
+### **Failure Handling: Built-In Robustness**
+
+Workforce is designed to handle failures and recover gracefully.
+
+If a worker fails a task, the coordinator agent will:
+- **Decompose and retry:** Break the task into even smaller pieces and reassign.
+- **Escalate:** If the task keeps failing, create a new worker designed for that problem.
+To prevent infinite loops, if a task has failed or been decomposed more than a set number of times (default: 3), Workforce will automatically halt that workflow.
+
+<aside>
+
+
+**📌 Tip:** Workforce automatically stops stuck workflows—so you don’t waste compute or get caught in agent loops!
+
+</aside>
+
+## Worker Nodes
+
+Eigent comes with a set of pre-configured agents, each designed for a specific domain of expertise. These agents are equipped with a curated set of toolkits to make them effective right out of the box.
+
+### DeveloperAgent
+
+*A skilled coding assistant that can write and execute code, run terminal commands, and verify solutions to complete tasks.*
+
+**Equipped Toolkits:**
+
+- HumanToolkit
+- TerminalToolkit
+- NoteTakingToolkit
+- WebDeployToolkit
+
+### SearchAgent
+
+*Can search the web, extract webpage content, simulate browser actions, and provide relevant information to solve the given task.*
+
+**Equipped Toolkits:**
+
+- SearchToolkit
+- HybridBrowserToolkit
+- HumanToolkit
+- NoteTakingToolkit
+- TerminalToolkit
+
+### DocumentAgent
+
+*A document processing assistant for creating, modifying, and managing various document formats, including presentations.*
+
+**Equipped Toolkits:**
+
+- FileWriteToolkit
+- PPTXToolkit
+- HumanToolkit
+- MarkItDownToolkit
+- ExcelToolkit
+- NoteTakingToolkit
+- TerminalToolkit
+- GoogleDriveMCPToolkit
+- SearchToolkit
+
+### Multi-ModalAgent
+
+*A multi-modal processing assistant for analyzing and generating media content like audio and images.*
+
+**Equipped Toolkits:**
+
+- VideoDownloaderToolkit
+- AudioAnalysisToolkit
+- ImageAnalysisToolkit
+- OpenAIImageToolkit
+- HumanToolkit
+- TerminalToolkit
+- NoteTakingToolkit
+- SearchToolkit
+
+## Toolkit Reference
+
+Toolkits are the collections of functions that give your agents their powers. Here is a reference for the toolkits used by the pre-configured agents.
+
+*(in alphabetical order)*
+
+### [AudioAnalysisToolkit](https://docs.camel-ai.org/reference/camel.toolkits.audio_analysis_toolkit)
+
+*Provides tools for audio processing and analysis.*
+
+This toolkit allows an agent to process audio files. It can take an audio file (from a local path or URL) and transcribe the speech into text. It can also answer specific questions about the content of an audio file, enabling agents to extract information from podcasts, meetings, or voice notes.
+
+### [ExcelToolkit](https://docs.camel-ai.org/reference/camel.toolkits.excel_toolkit)
+
+*Enables agents to create, read, and manipulate Excel spreadsheets.*
+
+This toolkit provides comprehensive functions for interacting with Excel files (`.xlsx/.xls/. csv`). Agents can create new workbooks, add or delete worksheets, read data from specific cells or ranges, write data to the spreadsheet, and convert data into Markdown formatted table.
+
+### [FileWriteToolkit](https://docs.camel-ai.org/reference/camel.toolkits.file_write_toolkit)
+
+*A toolkit for creating, writing, and modifying text in files.*
+
+This toolkit gives an agent the ability to create and write to files on the local file system (macOS, Linux, Windows). It provides support for writing to various file formats (Markdown, DOCX, PDF, and plaintext), replacing text in existing files, automatic filename uniquification to prevent overwrites, custom encoding and enhanced formatting options for specialized formats.
+
+### [GoogleDriveMCPToolkit](https://docs.camel-ai.org/reference/camel.toolkits.google_drive_mcp_toolkit)
+
+*Connects to Google Drive to manage files and folders.*
+
+This toolkit allows agents to interact with a user's Google Drive. It can read files and folders from Google Drive. It acts as a bridge between the agent's local environment and cloud storage.
+
+### [HumanToolkit](https://docs.camel-ai.org/reference/camel.toolkits.human_toolkit)
+
+*Allows an agent to pause its task and ask the user for help.*
+
+This is a critical toolkit for handling situations that require human intervention. When an agent is stuck, needs credentials, or requires a subjective decision, it can use this toolkit to send a prompt to the user and wait for a response before continuing its task.
+
+### [HybridBrowserToolkit](https://docs.camel-ai.org/reference/camel.toolkits.hybrid_browser_toolkit.ws_wrapper)
+
+*Provides a powerful, stateful browser for web navigation and interaction.*
+
+This toolkit gives an agent a fully-featured web browser that it can control programmatically. Unlike simple web scraping, this toolkit maintains a session, allowing the agent to click, type, hover, screenshot, and live *Take Control* from the UI. 
+
+### [ImageAnalysisToolkit](https://docs.camel-ai.org/reference/camel.toolkits.image_analysis_toolkit)
+
+*Provides tools for understanding the content of images.*
+
+This toolkit enables an agent to "see" and interpret images. It can generate a detailed text description of an image or answer specific questions about what an image contains. This is crucial for tasks that involve visual data, such as describing products, analyzing charts, or identifying objects in a photo.
+
+### [MarkItDownToolkit](https://docs.camel-ai.org/reference/camel.toolkits.markitdown_toolkit)
+
+*A specialized toolkit for converting content into clean Markdown.*
+
+This toolkit is designed to scrape content from a list of local files and convert each into a structured Markdown format. The conversion is performed in parallel for efficiency. Supported file formats include: PDF, Office, EPUB, HTML, Images (ORC), Audio, Text, ZIP.
+
+### [NoteTakingToolkit](https://docs.camel-ai.org/reference/camel.toolkits.note_taking_toolkit)
+
+*A toolkit for managing and interacting with Markdown note files.*
+
+This toolkit provides tools for creating, reading, appending to, and listing notes. All notes are stored as **`.md`** files in a dedicated working directory and are tracked in a registry. An agent can use it to write down any important information. Other agents can then read these notes to get context and build upon previous work, facilitating effective collaboration.
+
+### [OpenAIImageToolkit](https://docs.camel-ai.org/reference/camel.toolkits.openai_image_toolkit)
+
+*Generates images from text prompts using OpenAI's DALL-E models.*
+
+This toolkit allows an agent to create new images based on a descriptive text prompt. It leverages models like DALL-E 3 to generate high-quality visuals, which can then be saved locally. This is essential for creative tasks, generating illustrations for documents, or any workflow requiring original image content.
+
+### [PPTXToolkit](https://docs.camel-ai.org/reference/camel.toolkits.pptx_toolkit)
+
+*Enables agents to create and write Microsoft PowerPoint presentations.*
+
+This toolkit provides a suite of functions for building PowerPoint (`.pptx`) files. An agent can create a new presentation, add title and content slides, format text, create lists, and insert tables and images. It allows for the automated creation of professional-looking presentations.
+
+### [SearchToolkit](https://docs.camel-ai.org/reference/camel.toolkits.search_toolkit)
+
+*Provides access to various web search engines.*
+
+This toolkit is the primary tool for web research. It allows an agent to search information on engines like Google, Wikipedia, Bing, and Baidu. The agent can submit a query and receive a list of relevant URLs and snippets, which it can then use as a starting point for deeper investigation with the `HybridBrowserToolkit`.
+
+### [TerminalToolkit](https://docs.camel-ai.org/reference/camel.toolkits.terminal_toolkit)
+
+*A toolkit for terminal operations across multiple operating systems.*
+
+This toolkit gives an agent access to a command-line interface. It supports terminal operations such as searching for files by name or content, executing shell commands, and managing terminal sessions.
+
+### [VideoDownloaderToolkit](https://docs.camel-ai.org/reference/camel.toolkits.video_download_toolkit)
+
+*Allows an agent to download and process videos from popular platforms.*
+
+This toolkit enables an agent to download video content from URLs (e.g., from YouTube) and optionally split them into chunks. The saved video can then be analyzed by other toolkits, such as the `AudioAnalysisToolkit` for transcription, or `ImageAnalysisToolkit` for object detection.
+
+### [WebDeployToolkit](https://docs.camel-ai.org/reference/camel.toolkits.web_deploy_toolkit)
+
+*Provides tools to deploy web content on a local server.*
+
+This toolkit allows the `DeveloperAgent` to instantly host web applications or static files. It can serve a single HTML file or an entire folder (like a built React app) on a local port, making it easy to preview and test web development work.