Any AI Proxy API (Go)

A Go-based proxy server that provides OpenAI-compatible API endpoints for Any AI website using browser automation.

Overview

This project creates a bridge between OpenAI's API format and various AI websites' web interfaces. It uses ChromeDP for browser automation to interact with Any AI website and provides a REST API that mimics OpenAI's chat completions endpoint.

Features

OpenAI-Compatible API: Supports /v1/chat/completions endpoints
Browser Automation: Uses ChromeDP with Fingerprint Chromium browser for web automation
Request Queue: Implements a queue system to handle requests sequentially
Configurable Workflows: YAML-based configuration for different automation workflows
Multi-AI Service Support: Supports ChatGPT, Gemini AI Studio, Grok, and more
Multi-Instance Support: Can manage multiple AI service instances simultaneously
Screenshot API: Built-in screenshot functionality for debugging
Authentication Management: Automatic cookie and session management

Supported AI Services

Currently supports the following AI services:

ChatGPT (https://chatgpt.com/)
Gemini AI Studio (https://aistudio.google.com/)
Grok (https://grok.com/)

Each service has a dedicated adapter to handle its specific response format and interaction patterns.

Architecture

The application consists of several key components:

Core Components

API Server (internal/api/): Gin-based HTTP server providing OpenAI-compatible endpoints
Browser Manager (internal/browser/chrome/): Manages ChromeDP browser instances and contexts
Runner System (internal/runner/): Executes YAML-defined workflows for browser automation
Method Library (internal/method/): Collection of automation methods (click, input, etc.)
Adapter System (internal/adapter/): Handles response format conversion for different AI services
Configuration (internal/config/): Application configuration management
Utils (internal/utils/): Common utility functions

Request Flow

Client sends OpenAI-format request to /v1/chat/completions
Request is queued in the request queue system
Runner executes the appropriate YAML workflow to interact with the AI service
Browser automation performs the necessary actions (input text, click buttons, etc.)
Adapter intercepts and processes the response from the AI service
Response is formatted and returned to the client

Installation

Prerequisites

Go 1.24 or later
Fingerprint Chromium browser

Setup

Clone the repository:

git clone https://github.com/luispater/anyAIProxyAPI.git
cd anyAIProxyAPI

Install dependencies:

go mod download

Configure the application by editing runner/main.yaml

Configuration

The main configuration file is runner/main.yaml:

version: "1"
debug: true
browser:
  fingerprint-chromium-path: "/Applications/Chromium.app/Contents/MacOS/Chromium"
  args:
    - "--fingerprint=1000"
    - "--timezone=America/Los_Angeles"
    - "--remote-debugging-port=9222"
    - "--lang=en-US"
    - "--accept-lang=en-US"
  user-data-dir: "/anyAIProxyAPI/user-data-dir"
  proxy-url: "http://user:pass@192.168.1.1:8080/" # proxy url for browser, if instance-alone is false, this proxy setting will be ignored
api-port: "2048"
headless: false
instance-alone: true # if true, each instance will have its own browser instance
logfile: "any-ai-proxy.log"
tokens: # Global tokens for API validation (optional)
  - "global-token-1"
  - "global-token-2"
instance:
  - name: "gemini-aistudio"
    adapter: "gemini-aistudio"
    proxy-url: "socks5://user:pass@192.168.1.1:1080/" # proxy url for each instance browser, if instance-alone is true, this proxy setting will be used
    url: "https://aistudio.google.com/prompts/new_chat"
    sniff-url:
      - "https://alkalimakersuite-pa.clients6.google.com/$rpc/google.internal.alkali.applications.makersuite.v1.MakerSuiteService/GenerateContent"
    auth:
      file: "auth/gemini-aistudio.json"
      check: "ms-settings-menu"
    runner: # must be init, chat_completions, context_canceled
      init: "init-system" # init runner
      chat_completions: "chat_completions" # chat_completions runner
      context_canceled: "context-canceled" # context canceled(client disconnect) runner
    tokens: # Instance-specific tokens for API validation (optional)
      - "gemini-token-3"
      - "gemini-token-4"
  - name: "chatgpt"
    adapter: "chatgpt"
    proxy-url: "" # proxy url for each instance browser, if this setting is empty, the browser will be directly connected to the internet
    url: "https://chatgpt.com/"
    sniff-url:
      - "https://chatgpt.com/backend-api/conversation"
    auth:
      file: "auth/chatgpt.json"
      check: 'div[id="sidebar-header"]'
    runner:
      init: "init"
      chat_completions: "chat_completions"
      context_canceled: "context-canceled"
  - name: "grok"
    adapter: "grok"
    proxy-url: ""
    url: "https://grok.com/"
    sniff-url:
      - "https://grok.com/rest/app-chat/conversations/new"
    auth:
      file: "auth/grok.json"
      check: 'a[href="/chat#private"]'
    runner:
      init: "init-system"
      chat_completions: "chat_completions"
      context_canceled: "context-canceled"

Configuration Parameters

debug: Enable debug mode for detailed logging
browser: Browser executable settings
- fingerprint-chromium-path: Path to Fingerprint Chromium browser
- args: Browser launch arguments
- user-data-dir: User data directory
- proxy-url: Proxy URL for browser, if instance-alone is false, this proxy setting will be ignored
api-port: Port for the API server
headless: Run browser in headless mode
instance-alone: Run each instance will have its own browser instance
tokens: Global tokens for API validation (optional)
instance: Array of AI service instances to manage. Each instance has its own configuration
- name: Instance name
- adapter: Adapter name (corresponds to different AI services)
- proxy-url: Proxy URL for each instance browser, if instance-alone is false, this proxy setting will be ignored
- url: AI service URL
- sniff-url: URL patterns for intercepting responses
- auth: Authentication configuration
  - file: File to store authentication information
  - check: CSS selector to check login status
- runner: Runner configuration. All runner files must be defined in a directory corresponding to the instance name
- tokens: Instance specific tokens for API validation (optional)

For details on the runner file syntax, please refer to runner.md

Usage

Starting the Server

go run main.go

The server will start on the configured port (default: 2048).

Management Web Interface

Uploading Auth information for AI website

http://localhost:2048/v1/auth/upload

API Endpoints

Chat Completions

curl -X POST http://localhost:2048/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "instance-name/model-name",
    "messages": [
      {
        "role": "user",
        "content": "Hello, how are you?"
      }
    ]
  }'

Headless Screenshot

GET http://localhost:2048/screenshot?instance=instance-name

Auth Information Upload

POST http://localhost:2048/v1/auth/upload \
  -H "Content-Type: application/json" \
  -d '{
  "name": "instance-name",
  "auth": "{\"cookies\":[],\"local_storage\":{\"key\":\"value\"}}"
}'

Server Information

GET http://localhost:2048/

Usage Examples

Interacting with ChatGPT

curl -X POST http://localhost:2048/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "chatgpt/gpt-4",
    "messages": [
      {
        "role": "user",
        "content": "Explain the basic principles of quantum computing"
      }
    ]
  }'

Interacting with Gemini

curl -X POST http://localhost:2048/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini/gemini-pro",
    "messages": [
      {
        "role": "user",
        "content": "Write a Python quicksort algorithm"
      }
    ]
  }'

Interacting with Grok

curl -X POST http://localhost:2048/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "grok/grok3",
    "messages": [
      {
        "role": "user",
        "content": "What are the latest developments in AI?"
      }
    ]
  }'

Workflow System

The application uses a YAML-based workflow system to define browser automation sequences. Workflows are stored in the runner/ directory and define step-by-step instructions for interacting with AI services.

Workflow Structure

Each AI service instance has its own workflow directory:

runner/instance-name/ - Any AI website related workflows

Each directory contains the following core workflow files:

init.yaml or init-system.yaml - Initialization workflow
chat_completions.yaml - Chat completion workflow
context-canceled.yaml - Context cancellation workflow

For detailed information about the runner system, see runner.md.

Development

Project Structure

├── main.go                    # Application entry point
├── go.mod                     # Go module file
├── go.sum                     # Go dependency checksum file
├── LICENSE                    # MIT license
├── README.md                  # Project documentation
├── runner.md                  # Runner system documentation
├── internal/                  # Internal packages
│   ├── adapter/               # AI website adapters
│   │   ├── adapter.go         # Adapter interface
│   │   ├── chatgpt.go         # ChatGPT adapter
│   │   ├── gemini-aistudio.go # Gemini AI Studio adapter
│   │   └── grok.go            # Grok adapter
│   ├── api/                   # HTTP API server
│   │   ├── server.go          # Server main
│   │   ├── handlers.go        # API handlers
│   │   ├── queue.go           # Request queue
│   │   └── processor.go       # Chat processor
│   ├── browser/               # Browser management
│   │   └── chrome/            # ChromeDP manager
│   ├── config/                # Configuration handling
│   ├── html/                  # HTML content
│   ├── method/                # Automation methods
│   ├── proxy/                 # Proxy server
│   ├── runner/                # Workflow execution engine
│   └── utils/                 # Utility functions
├── runner/                    # Workflow configurations
│   ├── main.yaml              # Main configuration file
│   └── instance-name/         # Instance workflows
└── auth/                      # Authentication files

Building

go build -o any-ai-proxy main.go

Running Tests

go test ./...

Technology Stack

Go 1.24+: Main programming language
ChromeDP: Browser automation library
Gin: HTTP web framework
YAML: Configuration file format
Logrus: Structured logging library

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

License

This project is licensed under the MIT License. Refer to the LICENSE file for details.

Acknowledgements

This project was inspired by AIStudioProxyAPI

Disclaimer

This project is for educational and research purposes. Please ensure you comply with Any AI website's terms of service when using this software.

FAQ

Q: How to add support for a new AI service?

A: You need to create a new adapter (in internal/adapter/) and corresponding workflow configurations (in runner/ directory).

Q: What to do if the browser fails to start?

A: Please check if the Fingerprint Chromium path configuration is correct and ensure the browser executable exists.

Q: How to debug workflows?

A: Set debug: true in runner/main.yaml, which will enable detailed debug logging.

Q: Which operating systems are supported?

A: Supports macOS, Linux, and Windows but requires the corresponding platform's Fingerprint Chromium browser.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.github/workflows		.github/workflows
internal		internal
runner @ 7d556b3		runner @ 7d556b3
.gitignore		.gitignore
.gitmodules		.gitmodules
.goreleaser.yaml		.goreleaser.yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go
runner.md		runner.md

License

luispater/anyAIProxyAPI

Folders and files

Latest commit

History

Repository files navigation

Any AI Proxy API (Go)

Overview

Features

Supported AI Services

Architecture

Core Components

Request Flow

Installation

Prerequisites

Setup

Configuration

Configuration Parameters

Usage

Starting the Server

Management Web Interface

Uploading Auth information for AI website

API Endpoints

Chat Completions

Headless Screenshot

Auth Information Upload

Server Information

Usage Examples

Interacting with ChatGPT

Interacting with Gemini

Interacting with Grok

Workflow System

Workflow Structure

Development

Project Structure

Building

Running Tests

Technology Stack

Contributing

License

Acknowledgements

Disclaimer

FAQ

Q: How to add support for a new AI service?

Q: What to do if the browser fails to start?

Q: How to debug workflows?

Q: Which operating systems are supported?

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 25

Packages 0

Languages

Packages