Chat Relay: OpenAI-Compatible Relay for AI Chat Interfaces

Chat Relay is a system that allows Cline/RooCode to communicate with web-based AI chat interfaces (like Gemini, AI Studio, ChatGPT, and Claude) through an OpenAI-compatible API. This enables using models that may not have public APIs through their web interfaces. It also alows you to use models that are fast or good at tools (Claude) in combination with slower smarter models (Google Pro 2.5).

Architecture Overview

The system consists of three main components:

OpenAI-Compatible API Server: Implements an OpenAI-compatible API endpoint that Cline/RooCode can connect to.
Browser Extension: Connects to the API server via WebSocket and interacts with the chat interface.
MCP Server: Provides additional tools and resources for the system.

ASCII Diagram

┌─────────────┐            ┌─────────────┐               ┌─────────────┐         ┌──────────────┐
│             │            │             │               │             │         │              │
│  Cline/     │◄── http ──►│  OpenAI-    │◄─ websocket -►│  Browser    │◄───────►│  Chat        │
│  RooCode    │            │  Compatible │               │  Extension  │         │  Interface   │
│  (App)      │            │  Relay      │               │             │         │ (Gemini,     │
└─────────────┘            └─────────────┘               └─────────────┘         │  AI Studio,  │
                                                                                 │  ChatGPT,    │
                                                                                 │  Claude)     │
                                                                                 └──────────────┘

Mermaid Diagram

graph TD
    A["Client / RooCode"] -->|HTTP POST /chat/completions| B["API Relay Server"]
    B -->|WebSocket| C["Browser Extension"]
    C -->|User Input| D["Chat Interface
     (Gemini / AI Studio / ChatGPT / Claude)"]
    D -->|AI Response| C
    C -->|Captured Response| B
    B -->|OpenAI-format JSON| A

Loading

Data Flow

Cline/RooCode to API Server:
- Sends HTTP POST to /v1/chat/completions
API Server to Browser Extension:
- Sends message + requestId via WebSocket
Extension to Chat Interface:
- Inserts text, clicks send
Chat Interface to Extension:
- Captures response from UI
Extension to API Server:
- Returns response tied to requestId
API Server to Cline/RooCode:
- Formats response in OpenAI structure

Components

1. OpenAI-Compatible API Server

The API server implements an OpenAI-compatible endpoint and manages browser extension connectivity.

Key Features:

OpenAI-style /v1/chat/completions endpoint
WebSocket server for real-time relay
Comprehensive admin interface with real-time monitoring
Live server logs and message history tracking
Configurable settings with persistent storage
Ping/pong health checks
Request timeouts, retry handling
Tracks connection state

2. Browser Extension

Interacts with Gemini, AI Studio, ChatGPT, and Claude UIs. Injects chat, captures responses.

Key Features:

Auto-send with retry/backoff
DOM/debugger-based response capture
Modular provider architecture:

Supported Chat Interfaces:

Gemini (gemini.google.com)
AI Studio (aistudio.google.com)
ChatGPT (chatgpt.com)
Claude (claude.ai)

ChatGPT is a trademark of OpenAI. Gemini and AI Studio are trademarks of Google. Claude is a trademark of Anthropic. This project is not affiliated with, endorsed by, or sponsored by OpenAI, Google, or Anthropic.

3. MCP Server

An optional developer utility server for simulating messages, testing extensions, or viewing traffic.

Important Considerations & Responsible Use

Compliance with Provider Terms: Users of Chat Relay are solely responsible for ensuring that their use of this tool complies with the terms of service, usage policies, and any other relevant agreements of the AI service providers they access through Chat Relay (e.g., OpenAI, Google, Anthropic).

Respect Provider Limits: It is crucial to be mindful of and respect any rate limits, usage quotas, or other restrictions imposed by the AI service providers. Automating interactions can potentially exceed these limits if not managed carefully.

Avoid System Disruption: Do not use Chat Relay in any manner that could disrupt, overburden, or impair the services of the AI providers. Responsible use includes avoiding excessive requests or any activity that could negatively impact the stability or performance of their systems.

User Responsibility: By using Chat Relay, you acknowledge that you understand these considerations and agree to use the tool responsibly and at your own risk concerning compliance with third-party provider terms. The developers of Chat Relay assume no liability for misuse or violations of third-party terms of service.

Installation & Setup

Prerequisites

Node.js (v14+)
npm (v6+)
Chrome browser

API Server Setup

cd api-relay-server
npm install
nodemon start

Browser Extension Setup

Open Chrome → chrome://extensions/
Enable Developer Mode
Click "Load unpacked"
Select extension/ directory

MCP Server (Optional)

cd mcp-server
npm run build
npm install -g .

or package installation and install

cd mcp-server
npm run build
npm pack
npm install -g C:/Users/user/Projects/chat-relay/mcp-server/chat-relay-mcp-0.0.1.tgz

---

Usage

Configuring Cline/RooCode

Open settings → API Provider: OpenAI Compatible
Base URL: http://localhost:3003
API Key: Any value (not validated)
Model ID: gemini-pro, chatgpt, claude-3-sonnet, or any label

Using the System

Start the API Server
Open Gemini, AI Studio, ChatGPT, or Claude in Chrome
Use Cline/RooCode to send messages
Responses are relayed through the extension

Testing

curl -X POST http://localhost:3003/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-pro",
    "messages": [{ "role": "user", "content": "Hello!" }],
    "temperature": 0.7,
    "max_tokens": 100
  }'

Configuration

API Server

Edit api-relay-server/src/server.js:

PORT (default: 3003)
REQUEST_TIMEOUT (default: 180000ms)
PING_INTERVAL (default: 30000ms)
CONNECTION_TIMEOUT (default: 45000ms)

Admin Interface

The API server includes a comprehensive admin interface accessible at http://localhost:3003/admin/admin.html with the following features:

Settings Management:

Server configuration (port, request timeout)
Request handling strategy (queue vs drop)
Auto-kill conflicting process toggle
Real-time settings validation and persistence

Real-Time Monitoring:

Live server logs with color-coded entries (info, warn, error)
Message history tracking with detailed request/response data
Server status monitoring (uptime, connected extensions)
WebSocket connection status

Modern UI Features:

Responsive design with professional styling
Real-time log updates every 2 seconds
Optimized layout with logs panel taking 67% of screen width
Interactive toggle switches and form controls
Chat bubble branding and modern navigation

The admin interface provides both configuration management and real-time monitoring capabilities for effective server administration.

Browser Extension

Edit extension/background.js:

serverHost, serverPort, serverProtocol
reconnectInterval

License

This project is licensed under the GNU Affero General Public License v3.0. See the LICENSE file for details.

Troubleshooting

API Server

Check port availability
If using Cline/Roo, Check chat relay provider config and ensure:
- Base URL is http://localhost:3003/v1 (adjust port if changed form default)
- R1 model parameters are enabled
- Streaming is not enabled
Inspect logs for request mapping or timeout issues

Browser Extension

Confirm connection to ws://localhost:3003
Check content.js for updated selectors if UI changes
Reload browser extension then refresh chat page

WebSocket Issues

Try increasing PING_INTERVAL or CONNECTION_TIMEOUT
Verify firewall or proxy settings

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Files

README.md

README.md

Chat Relay: OpenAI-Compatible Relay for AI Chat Interfaces

Architecture Overview

ASCII Diagram

Mermaid Diagram

Data Flow

Components

1. OpenAI-Compatible API Server

2. Browser Extension

3. MCP Server

Important Considerations & Responsible Use

Installation & Setup

Prerequisites

API Server Setup

Browser Extension Setup

MCP Server (Optional)

Usage

Configuring Cline/RooCode

Using the System

Testing

Configuration

API Server

Admin Interface

Browser Extension

License

Troubleshooting

Collapse file tree

Files

README.md

Latest commit

History

README.md

File metadata and controls

Chat Relay: OpenAI-Compatible Relay for AI Chat Interfaces

Architecture Overview

ASCII Diagram

Mermaid Diagram

Data Flow

Components

1. OpenAI-Compatible API Server

2. Browser Extension

3. MCP Server

Important Considerations & Responsible Use

Installation & Setup

Prerequisites

API Server Setup

Browser Extension Setup

MCP Server (Optional)

Usage

Configuring Cline/RooCode

Using the System

Testing

Configuration

API Server

Admin Interface

Browser Extension

License

Troubleshooting