Agents
Overviewβ
Enterprise h2oGPTe agents enhance the functionality and versatility of Enterprise h2oGPTe to execute a broader range of tasks autonomously. In other words, this setting allows the large language model (LLM) to perform actions such as running code, generating plots, searching the web, conducting research, developing and preparing models, and more.
Enterprise h2oGPTeΒ agents are equipped with a diverse suite of tools and features designed to optimize workflows, enhance productivity, and simplify complex tasks.
To learn how to use agents for dataset analysis and model development and preparation, consider the following tutorials:
Tool requirementsβ
Enterprise h2oGPTe includes multiple agent tools, each with specific requirements. Basic agent functionality is enabled by default, but many tools require additional configuration.
- Check the requirements for each tool you intend to use
- Configure the necessary environment variables in your deployment
- Ensure any required external services or credentials are set up properly
- Verify that hardware requirements (such as GPU for image generation) are met
- Replace all
<your-*-key>
placeholders with actual keys/tokens
This basic agent configuration enables several tools by default:
ALLOW_API: "1"
H2OGPT_OPENAI_LOG_PATH: /workspace/save/h2ogpt_openai/
TRANSFORMERS_CACHE: /workspace/.cache
Instructionsβ
To use agents in Enterprise h2oGPTe (enable an agent in a chat session) and configure the type of tools the agent can use, consider the following steps:
- In Enteprise h2oGPTe, click + New chat.
- Click Use agent.
- Click Customize.
- Click the Configuration tab.
- By default, in the Tools setting (under the Use agent setting), all available tools for an agent are presented.
To learn about each available tool for an agent, see Tools.
Toolsβ
Data analysis capabilitiesβ
-
H2O Driverless AI Data Science: Leverage AI capabilities for automated machine learning, data analysis, and model generation to extract actionable insights from complex datasets.
Requirements:β
- For DAI Cloud:
- For DAI on-premises:
ENABLE_DAI: "1"
DAI_INSTANCE_TYPE: cloud
DAI_CLIENT_ID: hac-platform-public
DAI_ENGINE: <your-dai-engine>
DAI_ENVIRONMENT: <your-dai-environment-url>
DAI_TOKEN: <your-dai-token>
DAI_TOKEN_ENDPOINT_URL: <your-token-endpoint-url>ENABLE_DAI: "1"
DAI_INSTANCE_TYPE: onprem
DAI_ADDRESS: <your-dai-address>
DAI_USERNAME: <your-username>
DAI_PASSWORD: <your-password> -
Python Coding: Execute Python code snippets for custom data computations, machine learning tasks, or to automate data preprocessing.
Requirements:β
- Enabled with basic agent configuration:
ALLOW_API: "1"
- Enabled with basic agent configuration:
-
Advanced Reasoning: Perform context-aware reasoning to enhance decision-making processes.
Requirements:β
-
Requires models that support reasoning capabilities.
-
Configure in Helm chart:
agent:
enabled: true
overrideConfig:
visible_reasoning_models:
- meta-llama/Llama-3.1-8B-Instruct
-
Information retrieval and researchβ
-
Scholar Papers Search: Locate scholarly papers, research articles, and academic content.
Requirements:β
- Requires the Internet Access tool to be enabled.
-
Google search: Access Google's search capabilities to gather timely and accurate information.
Requirements:β
- Requires the Internet Access tool to be enabled.
- Optional:
- Configure the Google API key:
GOOGLE_API_KEY: <your-google-api-key>
- Configure the Google API key:
-
Bing search: Access Bing's search capabilities to gather timely and accurate information.
Requirements:β
- Requires the Internet Access tool to be enabled.
- Optional:
- Configure the Bing API key:
BING_API_KEY: <your-bing-api-key>
- Configure the Bing API key:
-
Internet Access: Utilize general internet access for a wide range of tasks, from data retrieval to live updates.
Requirements:β
- Basic agent configuration:
ALLOW_API: "1"
- Basic agent configuration:
-
Wolfram|Alpha Math Science Search: Solve scientific and mathematical problems using the computational knowledge engine for complex computations.
Requirements:β
- Requires the Internet Access tool to be enabled.
- Optional:
- Configure the Wolfram|Alpha API key:
WOLFRAM_ALPHA_APPID: <your-wolfram-alpha-appid>
- Configure the Wolfram|Alpha API key:
-
Wikipedia Articles Search: Retrieve information from Wikipedia for learning or to support data analysis and research.
Requirements:β
- Requires the Internet Access tool to be enabled.
-
Wayback Machine Search: Access archived versions of websites for research, historical context, or to reference past web content.
Requirements:β
- Requires the Internet Access tool to be enabled.
-
Web Image Search: Find relevant images from the web to support visual analysis, presentations, or content creation.
Requirements:β
- Requires the Internet Access tool to be enabled.
-
Ask Question About Documents: Extract or query information from uploaded documents for detailed insights.
Requirements:β
- Basic agent configuration:
ALLOW_API: "1"
- Basic agent configuration:
Content generation and transformationβ
-
Image Generation: Create custom images based on textual descriptions. Integrate image generation capabilities from different providers, including local models, OpenAI, Azure OpenAI, and Black Forest Labs.
Requirements:β
- Requires GPU in agent pod.
- Shell Scripting tool must be enabled as a prerequisite.
- Configuration in Helm chart:
h2ogpt:
config:
externalLLM:
enabled: true
secret:
ALLOW_API: "1"
# Black Forest Labs Configuration
BFL_API_KEY: <your-black-forest-labs-api-key>
# OpenAI Configuration
IMAGEGEN_OPENAI_BASE_URL: https://api.openai.com/v1
IMAGEGEN_OPENAI_API_KEY: <your-openai-api-key>
# Azure OpenAI Configuration
AZURE_OPENAI_BASE_URL: openai.azure.com
AZURE_OPENAI_API_KEY: <your-azure-openai-api-key>
agent:
enabled: true
additionalConfig:
function_api_key: <your-function-api-key>
enforce_h2ogpt_api_key: false
enforce_h2ogpt_ui_key: false
enable_image: true
visible_image_models:
- "flux.1-schnell" # Local model for image generation
# Optional: Additional image generation models
# Uncomment and replace with actual model names when using external providers
# - "openai-model" # OpenAI model
# - "azure-model" # Azure model
-
Audio-Video Transcription: Convert spoken content from audio or video files into text.
Requirements:β
-
Basic agent configuration:
ALLOW_API: "1"
-
Optional:
- Configure the OpenAI base URL and API key for STT:
STT_OPENAI_BASE_URL: <your-stt-openai-base-url>
STT_OPENAI_API_KEY: <your-stt-openai-api-key>
# Additional Configuration:
asr_model: "distil-whisper/distil-large-v3" # Model for converting audio/video transcription
enable_stt: true # Enable speech-to-text for audio/video transcriptionnoteEnsure you have selected Agents Only when uploading.
- Configure the OpenAI base URL and API key for STT:
-
-
Convert Document to Text: Transform scanned or digital documents into editable text formats.
Requirements:β
- Basic agent configuration
ALLOW_API: "1"
- Basic agent configuration
-
Screenshot Webpage: Capture screenshots of webpages.
Requirements:β
- Requires the Internet Access tool to be enabled.
-
Mermaid Chart-Diagram Renderer: Generate flowcharts and diagrams.
Requirements:β
- Enabled with the default configuration.
Automation and scriptingβ
-
Shell Scripting: Automate system-level tasks to streamline operations.
Requirements:β
- Basic agent configuration:
ALLOW_API: "1"
- Basic agent configuration:
Specialized AI featuresβ
-
RAG Text: Enhance text-based tasks with retrieval-augmented generation.
Requirements:β
- Uses default models in ModelLock.
- Basic agent configuration:
ALLOW_API: "1"
-
RAG Vision: Apply retrieval-augmented generation for vision-based tasks.
Requirements:β
- Uses default vision models in ModelLock.
Multimedia processingβ
-
Download Web Video: Download web videos.
Requirements:β
- Requires the Internet Access tool to be enabled.
- Shell Scripting tool must be enabled as a prerequisite.
-
Ask Question About Image: Query and analyze visual data.
Requirements:β
- Requires a vision model configured in ModelLock.
Software Development Toolsβ
-
Aider Code Generation: Automatically generate code snippets to accelerate development tasks, particularly for data science and machine learning models.
Requirements:β
- Models are fetched from ModelLock.
- No additional configuration required.
-
GitHub: Integrates with the GitHub platform for version control, code repository browsing, and collaboration.
Requirements:β
- Requires configuring
GITHUB_TOKEN
as a secret:GITHUB_TOKEN: <your-github-token>
- Requires configuring
-
Evaluate Answer: Provides automated evaluation of responses generated by agents.
Requirements:β
- Basic agent configuration:
ALLOW_API: "1"
- Basic agent configuration:
-
SWEBench Evaluate Patch: Evaluates software patches by running automated tests, quality checks, and performance benchmarks.
Requirements:β
- Basic agent configuration:
ALLOW_API: "1"
- Basic agent configuration:
To use tools that require internet access, such as GitHub, Wikipedia Articles Search, Web Image Search, etc., ensure that the Internet Access tool is selected.
- Submit and view feedback for this page
- Send feedback about Enterprise h2oGPTe to cloud-feedback@h2o.ai