A blazingly fast, privacy first & OPEN AI compatible Chat Interface

JavaScript 97.8%
CSS 1.1%
Shell 0.6%
HTML 0.3%
Dockerfile 0.2%

Find a file

Mike Key 41e5b7b06f Some checks failed CI / format-check (push) Has been cancelled Details CI / build-frontend (push) Has been cancelled Details CI / build-server (push) Has been cancelled Details CI / test (push) Has been cancelled Details chore: bump version to 0.2.1 Security fixes from PR #14 plus update-notifier proxy from PR #15. Bumps package.json across root, frontend, and server so the in-app banner can detect the upgrade for users on 0.2.0.		2026-04-23 20:52:34 -06:00
.claude	Minor Cleanup & Agents Documentation	2026-04-23 19:46:14 -06:00
.github	test: add comprehensive backend test suite (172 tests)	2026-02-27 20:12:09 -07:00
frontend	chore: bump version to 0.2.1	2026-04-23 20:52:34 -06:00
packages/shared	fix: resolve 13 data-flow review findings (CRIT→LOW) (#14 )	2026-04-23 20:46:13 -06:00
scripts	Add web search tools, citations persistence, and admin search key management	2026-04-02 18:00:00 -06:00
server	chore: bump version to 0.2.1	2026-04-23 20:52:34 -06:00
.dockerignore	Including Docker	2025-11-20 20:45:05 -07:00
.gitignore	Minor Cleanup & Agents Documentation	2026-04-23 19:46:14 -06:00
.prettierignore	v0.2.0: Complete rewrite - Next.js → Preact + Hono monorepo	2025-11-20 01:29:11 -07:00
.prettierrc	feat: Add llama 3.3 70B & other things	2025-01-18 21:28:01 -07:00
AGENTS.md	Minor Cleanup & Agents Documentation	2026-04-23 19:46:14 -06:00
bun.lock	feat: proxy GitHub release check through server, fix repo URL (#15 )	2026-04-23 20:45:11 -06:00
bunfig.toml	Major revisions	2026-04-02 19:52:19 -06:00
Caddyfile	Including Docker	2025-11-20 20:45:05 -07:00
CLAUDE.md	Minor Cleanup & Agents Documentation	2026-04-23 19:46:14 -06:00
connections.png	Massive UX/DX improvements	2025-11-22 01:07:05 -07:00
CONTRIBUTING.md	v0.2.0: Complete rewrite - Next.js → Preact + Hono monorepo	2025-11-20 01:29:11 -07:00
docker-compose.caddy.yml	Including Docker	2025-11-20 20:45:05 -07:00
docker-compose.yml	Updates to Docker	2025-12-04 21:09:28 -07:00
Dockerfile	Updates to Docker	2025-12-04 21:09:28 -07:00
faster-chat.code-workspace	v0.2.0: Complete rewrite - Next.js → Preact + Hono monorepo	2025-11-20 01:29:11 -07:00
faster-chat.png	Updating the README with new details	2025-12-04 21:25:08 -07:00
fly.toml	Wrote files to enable easy one click deployment	2025-12-03 15:27:33 -07:00
focus-mode.png	Updating the README with new details	2025-12-04 21:25:08 -07:00
LICENSE	Refactor: Major architecture improvements and simplifications	2025-09-16 17:52:14 -06:00
models.png	Massive UX/DX improvements	2025-11-22 01:07:05 -07:00
oneclick-deploy-notes.md	pushing forward	2025-12-03 15:27:33 -07:00
package.json	chore: bump version to 0.2.1	2026-04-23 20:52:34 -06:00
railway.json	Wrote files to enable easy one click deployment	2025-12-03 15:27:33 -07:00
README.md	Updating the README	2026-04-02 20:03:58 -06:00
release.yml	pushing forward	2025-12-03 15:27:33 -07:00
render.yaml	Wrote files to enable easy one click deployment	2025-12-03 15:27:33 -07:00
themes.png	Updating the README with new details	2025-12-04 21:25:08 -07:00
white-label.png	Updating the README with new details	2025-12-04 21:25:08 -07:00

README.md

⚡ Faster Chat

A blazingly fast, privacy-first chat interface for AI that works with any LLM provider—cloud or completely offline.

Connect to OpenAI, Anthropic, Google, Groq, Mistral, xAI, DeepSeek, and more—or run completely offline with Ollama, LM Studio, or llama.cpp. Your conversations stay on your machine. No vendor lock-in, no tracking, full control.

✨ Features

Core

💬 Real-time streaming chat with Vercel AI SDK
⚡ Blazingly fast — 3KB Preact runtime, zero SSR overhead, instant responses
🗄️ Server-side SQLite storage — Conversations persist across devices and browser tabs
🤖 19+ providers: OpenAI, Anthropic, Google, Groq, Mistral, xAI, DeepSeek, Cohere, Fireworks, Cerebras, Amazon Bedrock, Azure, OpenRouter, Replicate, Ollama, LM Studio, and more
🧠 Cross-conversation memory — AI remembers your preferences, projects, and context across chats
🔍 Web search — AI can search the web and cite sources inline (Brave Search)
🖼️ Image support — Upload images for vision analysis, generate images with DALL-E, FLUX, and OpenRouter models
📥 Import conversations from ChatGPT exports (more formats coming soon)
📎 File attachments with preview and download
📝 Markdown rendering with syntax highlighting (Shiki) and LaTeX support
🎨 Themable UI — 15+ color themes, dark/light mode, custom fonts, syntax highlighting themes
🎤 Voice input/output — Speech-to-text and text-to-speech capabilities
⌨️ Keyboard shortcuts for power users (Ctrl+B sidebar, Ctrl+Shift+O new chat, etc.)
📱 Responsive design for desktop, tablet, and mobile

Administration

🔐 Multi-user authentication with role-based access (admin/member/readonly)
🔌 Provider Hub: Auto-discover models with models.dev integration
⬇️ Pull Ollama models directly from Admin Panel with progress streaming (no CLI needed)
🛡️ Admin panel for user management (CRUD, password reset, role changes)
🔑 Encrypted API key storage with server-side encryption
🎭 White labeling — Customize app name and logo icon for your organization

Deployment

🌐 Works completely offline with local models (Ollama, LM Studio, etc.)
🐳 One-command Docker deployment with optional HTTPS via Caddy
🎨 Modern stack: Preact + Hono + TanStack + Tailwind 4.1

🧠 Memory System

Faster Chat can learn about you across conversations — your preferences, projects, tech stack, and communication style. Memories are extracted automatically after each response and injected naturally into future chats.

Three-level control: Global toggle (admin), per-user toggle, per-chat opt-out
Zero latency impact: Extraction happens asynchronously after the response streams
Full transparency: View, delete individual facts, or clear all memories from Settings
Configurable extraction model: Use a cheap/fast model (e.g., Haiku, GPT-4o-mini) to minimize cost
Privacy: Memories are strictly user-scoped. No admin access to user memories.

🔍 Web Search

When enabled, the AI can autonomously search the web and fetch pages to answer questions with up-to-date information.

Brave Search integration with encrypted API key storage
SSRF-protected URL fetching with DNS validation and private IP blocking
Source citations displayed as clickable pills with favicons below the response
Real-time status: "Searching the web..." and "Reading {domain}" indicators during tool execution
5-minute cache for repeated queries, max 5 results per search

🖼️ Image Support

Vision/Multimodal: Upload images alongside messages for analysis by vision-capable models (Claude, GPT-4, Gemini, Grok, Mistral, and more).

Image Generation: Toggle image mode to generate images with configurable aspect ratios (1:1, 16:9, 9:16, 4:3, 3:4, 3:2, 2:3).

Providers: OpenAI (DALL-E 3), Replicate (FLUX 1.1 Pro), OpenRouter image models
Generated images display inline with download controls and metadata

🚀 Quick Start

One-Click Docker Deploy (Recommended)

git clone https://github.com/1337hero/faster-next-chat.git
cd faster-next-chat
docker compose up -d

That's it. Open http://localhost:8787, register your first user (becomes admin), and configure your AI providers.

With HTTPS (for production):

docker compose -f docker-compose.yml -f docker-compose.caddy.yml up -d

Local Development

Prerequisites: Bun (recommended) or Node.js 20+

git clone https://github.com/1337hero/faster-next-chat.git
cd faster-next-chat
bun install
bun run dev

On first run, the server automatically generates encryption keys and initializes the database.

Frontend: http://localhost:3000
API Server: http://localhost:3001

Important: Backup server/.env — contains the encryption key for stored API keys.

First-Time Setup

Register an account at http://localhost:3000/login
- The first account is automatically promoted to admin
Configure AI providers in the Admin Panel (/admin → Providers tab):
- Add OpenAI, Anthropic, or other cloud providers with API keys
- Configure local providers (Ollama, LM Studio) with custom endpoints
- API keys are encrypted and stored securely server-side
Enable models in the Admin Panel (Providers tab → Refresh Models)
- Select which models appear in the chat interface
- Set default model for new chats

Configure providers and API keys in the Admin Panel

Enable and manage models from all your providers

New Focus Mode

New Appearance Options, to change colors and fonts

You can now white label and customize the app title and icon

Using Offline with Ollama

# Install Ollama (macOS/Linux)
curl -fsSL https://ollama.ai/install.sh | sh

# In Faster Chat: Admin Panel → Connections → Search "Ollama" → Add
# Then: Admin Panel → Models → Click "Pull Model" on Ollama row → Enter model name

You can pull models directly from the Admin Panel—no CLI needed! Just click Pull Model next to your Ollama provider, enter a model name (e.g., llama3.2, mistral, codellama), and watch the download progress in real-time.

The Provider Hub auto-discovers 50+ providers including Ollama, LM Studio, OpenAI, Anthropic, Groq, Mistral, OpenRouter, and more. Just search and add.

💻 Development

Commands

Root (recommended)

bun run dev         # Start frontend + backend concurrently
bun run build       # Build all packages for production
bun run start       # Run production builds
bun run clean       # Remove all build artifacts
bun run format      # Format code with Prettier

Frontend

cd frontend
bun run dev         # Vite dev server on :3000
bun run build       # Production build to dist/
bun run preview     # Preview production build

Backend

cd server
bun run dev         # Hono dev server on :3001
bun run build       # Build for production
bun run start       # Run production server on :3001

🐳 Docker Details

The Docker setup uses a hybrid build (Bun for deps, Node.js 22 runtime) with SQLite storage in a persistent volume.

HTTPS with Caddy: For production with automatic Let's Encrypt certificates:

docker compose -f docker-compose.yml -f docker-compose.caddy.yml up -d
# Edit Caddyfile with your domain, point DNS, restart

See docs/caddy-https-setup.md and docs/docker-setup.md for details.

Configuration

Environment Variables (server/.env):

# Required: Encryption key for API keys
API_KEY_ENCRYPTION_KEY=...  # Generate with crypto.randomBytes(32)

# Optional: Configure via Admin Panel instead
APP_PORT=8787              # Internal port (default: 8787)
NODE_ENV=production        # Environment mode
DATABASE_URL=sqlite:///app/server/data/chat.db

# For local Ollama access from Docker
OLLAMA_BASE_URL=http://host.docker.internal:11434

Common Commands:

docker compose up -d                # Start
docker compose logs -f              # View logs
docker compose down                 # Stop
docker compose up -d --build        # Rebuild

# Reset database
docker compose down
docker volume rm faster-chat_chat-data
docker compose up -d

🗺️ Roadmap

Completed ✅

Preact + Hono migration from Next.js
Streaming chat with Vercel AI SDK
Server-side SQLite persistence (chats sync across devices/tabs)
Multi-provider support (19+ providers with auto-discovery)
Admin panel for providers, models, and users
Role-based access control (admin/member/readonly)
File attachments with preview/download
Markdown, code highlighting (Shiki syntax highlighting), LaTeX rendering
One-click Docker deployment with optional HTTPS
Keyboard shortcuts (Ctrl+B sidebar, Ctrl+Shift+O new chat, Ctrl+K search)
Theming system (15+ color themes, light/dark mode)
Font customization and font themes
Voice input/output (speech-to-text, text-to-speech)
Settings UI improvements (tabbed interface for user preferences)
White labeling (custom app name, custom logo icon selection)
ChatGPT conversation import (drag-drop JSON export files)
Ollama model pull UI (download models directly from Admin Panel)
Cross-conversation memory (automatic fact extraction, three-level gating, per-chat opt-out)
Web search (Brave Search integration, source citations, SSRF protection, result caching)
Image support — Vision/multimodal analysis + image generation (DALL-E, FLUX, OpenRouter)
Auto-title generation for chats
Message regeneration
Reasoning display for thinking models (DeepSeek R1, o1)
Chat folders with colors and pinned chats

Planned 📋

Chat UX

Inline message editing
Message rating (thumbs up/down)
Export conversations (JSON, Markdown, CSV)
Import from more sources (Claude, other AI assistants)

Advanced Capabilities

Prompt templates (reusable system prompts with variables)
Model comparison arena (side-by-side evaluation)
Conversation sharing and collaboration
MCP (Model Context Protocol) integration

Administration

Fine-grained permissions and feature toggles per role
API usage monitoring and cost tracking

Infrastructure

PostgreSQL backend option (for larger deployments)
Plugin system for custom extensions
Mobile app (Capacitor)

🎨 Design Philosophy

Faster Chat is built on these principles:

Self-Hosted: Your data stays on your server. No cloud dependencies.
Provider-Agnostic: Never locked into a single AI vendor.
Minimal Runtime: 3KB Preact, no SSR overhead, instant responses.
Offline-Capable: Run completely offline with local models.
Fast Iteration: Bun for speed, no TypeScript ceremony, clear patterns.
Simple Code: Small focused components, derive state in render, delete aggressively.

Why No TypeScript?

We chose speed over ceremony. TypeScript's compile step and constant type churn across fast-moving AI SDKs slowed development more than it helped.

Our guardrails:

Runtime validation at system boundaries
Shared constants and clear contracts
Tests for critical paths
JSDoc for complex functions

Trade-off: Less friction, faster iteration, easier contribution.

See WIKI for detailed coding principles and architecture documentation.

🙏 Credits & Acknowledgments

Faster Chat is built on the shoulders of excellent open source projects:

Core Infrastructure

Vercel AI SDK — Streaming chat completions and multi-provider support
models.dev — Community-maintained AI model database for auto-discovery
Preact — Lightweight 3KB React alternative
Hono — Ultrafast web framework for the backend
TanStack Router & TanStack Query — Modern routing and server state management
bun:sqlite — Fast SQLite driver for server-side persistence

UI & Styling

Tailwind CSS — Utility-first CSS framework
lucide-preact — Beautiful icon library
Catppuccin — Soothing pastel theme

External API Calls

For transparency, this application makes the following external API calls:

models.dev/api.json — Fetches provider and model metadata on server startup (cached for 1 hour)
Brave Search API — Only when web search is enabled and triggered by the AI during a conversation
Your configured AI providers (OpenAI, Anthropic, etc.) — Only when you send chat messages
No tracking, analytics, or telemetry services — Your privacy is paramount

All data (conversations, memories, settings, API keys) is stored in your self-hosted SQLite database. Nothing leaves your server except API calls to your configured AI providers and optional web search.

🤝 Contributing

Contributions welcome! We're looking for:

Bug fixes and error handling
New provider integrations
Documentation improvements
UI/UX enhancements
Tests and quality improvements

Before submitting:

Read Documentation for coding philosophy and patterns
Ensure changes align with our lightweight, offline-first approach
Test locally with bun run dev
Keep PRs focused on a single feature or fix

📄 License

MIT License — see LICENSE for details.

⭐ Star History

If Faster Chat helps you take control of your AI conversations, consider giving us a star!

Built with ❤️ by 1337Hero for developers who value privacy, speed, and control.
_{No tracking. No analytics. Just fast, local-first AI conversations.}