TypeScriptADK-TS

Models

Configure and use different LLM models with your agents

ADK TypeScript provides flexible model integration, allowing you to use various Large Language Models (LLMs) with your agents. You can use Google Gemini models directly or integrate external models through wrapper classes.

Model Integration Methods

ADK TypeScript supports two primary integration approaches:

🎯 Direct Integration

Use Google Gemini models with simple string identifiers

🔌 Wrapper Classes

Access external models through LiteLLM and other adapters

Google Gemini Models

The most straightforward way to use Google's flagship models with ADK TypeScript.

Access Methods

Best for: Rapid prototyping and development

Requirements:

  • Google API key
  • Simple environment variable setup

Features:

  • Easy to get started
  • Quick iteration and testing
  • Direct API access

Best for: Production applications

Requirements:

  • Google Cloud project
  • Application Default Credentials
  • Enterprise-grade setup

Features:

  • Enterprise security and compliance
  • Advanced monitoring and scaling
  • Integration with Google Cloud services

Available Models

  • Gemini 2.0 Flash: Latest high-speed model for most use cases
  • Gemini 2.5 Pro: Powerful model for complex reasoning tasks
  • Gemini 1.5 Pro: Stable model with large context windows
  • Live API Models: Special models supporting voice/video streaming

Model Selection

Choose Flash models for speed and efficiency, Pro models for complex reasoning, and Live API models for real-time audio/video applications.

External Models via LiteLLM

Access models from OpenAI, Anthropic, Cohere, and 100+ other providers through the LiteLLM integration.

Supported Providers

🤖 OpenAI

GPT-4, GPT-3.5, and latest ChatGPT models

🧠 Anthropic

Claude models for advanced reasoning

⚡ Cohere

Command models for enterprise applications

🌐 Many Others

100+ models from various providers

Setup Requirements

  1. Install LiteLLM: Add the LiteLLM package to your project
  2. Configure API Keys: Set environment variables for your chosen providers
  3. Create Wrapper: Use the LiteLLM wrapper class with your agents

Local and Open Source Models

Run models locally for privacy, cost control, or offline operation.

Local Deployment Options

🦙 Ollama

Easy local model deployment and management

🏗️ Self-Hosted

Custom model server deployments

☁️ Private Cloud

Models hosted in your own infrastructure

Considerations for Local Models

  • Tool Support: Ensure your chosen model supports function calling
  • Performance: Consider hardware requirements for model size
  • Reliability: Local models may have different reliability characteristics
  • Model Quality: Open source models vary in capability and consistency

Tool Compatibility

When using local models with tools, verify that the model supports function calling. Not all open source models have reliable tool support.

Model Configuration

Generation Parameters

Control how models generate responses:

  • Temperature: Randomness in responses (0.0 = deterministic, 1.0 = creative)
  • Max Tokens: Maximum response length
  • Top-P/Top-K: Advanced sampling parameters
  • Safety Settings: Content filtering and safety controls

Performance Optimization

  • Model Selection: Choose appropriate model size for your use case
  • Caching: Implement response caching for repeated queries
  • Batching: Group requests when possible
  • Rate Limiting: Respect provider rate limits

Cost Management

  • Model Tier Selection: Balance cost vs capability
  • Request Optimization: Minimize unnecessary model calls
  • Usage Monitoring: Track costs and usage patterns
  • Efficient Prompting: Design prompts for optimal token usage

Best Practices

Model Selection Strategy

  1. Start Simple: Begin with Fast/Flash models for development
  2. Test Thoroughly: Validate model performance with your specific use case
  3. Consider Latency: Factor in response time requirements
  4. Evaluate Quality: Test output quality across different scenarios

Environment Management

  • Separate Environments: Use different models for dev/staging/production
  • Configuration Management: Use environment variables for model selection
  • Fallback Strategies: Implement backup models for reliability
  • Monitoring: Track model performance and availability

Security Considerations

  • API Key Management: Secure storage and rotation of API keys
  • Network Security: Protect communications with model providers
  • Data Privacy: Consider data residency and privacy requirements
  • Audit Trails: Log model interactions for compliance

Troubleshooting

Common Issues

Common Problems

  • Authentication Errors: Check API keys and credentials
  • Rate Limiting: Implement proper retry logic
  • Model Availability: Verify model endpoints are accessible
  • Version Compatibility: Ensure model versions match your requirements

Performance Issues

  • Monitor response times and adjust model selection
  • Check network connectivity and latency
  • Verify proper resource allocation for local models
  • Consider model warming strategies for improved performance