Quick Start¶

CLI¶

Initialize a store, write a memory, and search it:

# Initialize a store in the current directory
llmfs init
# Initialised LLMFS at /your/project/.llmfs

# Write your first memory
llmfs write /knowledge/hello "LLMFS stores memories at filesystem paths"

# Search it back
llmfs search "how does memory storage work"

# Check what's in the store
llmfs status

Python API¶

Five lines to get started:

from llmfs import MemoryFS

mem = MemoryFS()
mem.write("/projects/auth/bug", "JWT expiry misconfigured at auth.py:45", tags=["jwt", "bug"])
results = mem.search("authentication error", k=3)
print(results[0].path, results[0].score)

Store Location¶

LLMFS looks for a store in this order:

--llmfs-path flag or LLMFS_PATH environment variable
.llmfs/ in the current directory
~/.llmfs (global fallback)

# Use a project-local store
llmfs init  # creates .llmfs/ in cwd

# Use a custom path
llmfs --llmfs-path /tmp/my-store write /hello "world"

# Default: ~/.llmfs or .llmfs/ in cwd
mem = MemoryFS()

# Custom path
mem = MemoryFS(path="/tmp/myproject-memory")

What Happens Under the Hood¶

When you write a memory:

Content is split into chunks (code-aware AST splitting or prose-aware paragraph splitting)
Each chunk is embedded with all-MiniLM-L6-v2 (local, CPU, ~22 MB)
Vectors are stored in ChromaDB (HNSW index for sub-linear search)
Metadata is stored in SQLite (WAL mode for concurrent reads)
If auto_link=True, similar existing memories are linked with related_to edges

When you search:

Query is embedded and searched against ChromaDB (dense/semantic)
Query is also searched via SQLite FTS5 (sparse/BM25 keyword)
Results are fused with Reciprocal Rank Fusion
Results are de-duplicated and ranked by combined score
Results are cached for 5 minutes

Next Steps¶

Python API Reference — all methods and options
CLI Reference — all commands
Memory Layers — short_term, session, knowledge, events
Infinite Context — the ContextMiddleware
MCP Server — use with Claude, Cursor, Windsurf