Roadmap¶
This roadmap outlines the planned development of Grafeo. Priorities may shift based on community feedback and real-world usage.
0.1.x - Foundation¶
Building a fully functional graph database
Core Database¶
- Labeled Property Graph (LPG) storage model
- MVCC transactions with snapshot isolation
- Multiple index types (hash, B-tree, trie, adjacency)
- Write-ahead logging (WAL) for durability
- In-memory and persistent storage modes
Query Languages¶
- GQL (ISO standard) - full support
- Cypher - experimental
- Gremlin - experimental
- GraphQL - experimental
- SPARQL - experimental
Data Models¶
- LPG - full support
- RDF - experimental
Bindings¶
- Python - full support via PyO3
- NetworkX integration - experimental
- solvOR graph algorithms - experimental
0.2.x - Performance¶
Competitive with the fastest graph databases
Performance Improvements¶
- Factorized query processing for multi-hop traversals (avoid Cartesian products)
- Worst-case optimal joins via Leapfrog TrieJoin for cyclic patterns (O(N^1.5) triangles)
- Lock-free concurrent reads using DashMap-backed hash indexes (4-6x improvement)
- Direct lookup APIs bypassing query planning for O(1) point reads (10-20x faster)
- Query plan caching with LRU cache for repeated queries (5-10x speedup)
- NUMA-aware scheduling with same-node work-stealing preference
New Features¶
- Ring Index for RDF (
ring-indexfeature) - 3x space reduction using wavelet trees - Block-STM parallel execution (
block-stmfeature) - optimistic parallel transactions - Tiered hot/cold storage (
tiered-storagefeature) - compressed epoch archival - Succinct data structures (
succinct-indexesfeature) - rank/select bitvectors, Elias-Fano
Expanded Support¶
- RDF - full support with Ring Index and SPARQL optimization
- All 5 query languages promoted to full support
- NetworkX and solvOR integrations promoted to full support
0.3.x - AI Compatibility¶
Ready for modern AI/ML workloads
Vector Features¶
- Vector Type - First-class
Vectortype with f32 storage (4x compression vs f64) - Distance Functions - Cosine, Euclidean, Dot Product, Manhattan metrics
- HNSW Index - O(log n) approximate nearest neighbor search with batch insert/search
- Hybrid Queries - Combine graph traversal with vector similarity in GQL/Cypher/SPARQL
- Serializable Isolation - SSI for write skew prevention and strong consistency
Vector Quantization¶
- Scalar Quantization - f32 → u8, 4x compression with ~97% recall
- Binary Quantization - f32 → 1 bit, 32x compression with SIMD-accelerated hamming distance
- Product Quantization - Codebook-based 8-32x compression with asymmetric distance computation
- QuantizedHnswIndex - Two-phase search with rescoring support
- SIMD Acceleration - AVX2+FMA, SSE, and NEON support for 4-8x faster distance computations
Vector Storage & Search¶
- Memory-Mapped Vector Storage - Disk-backed storage with LRU cache for large datasets
- VectorScan Operators - HNSW and brute-force search in query execution plans
- VectorJoin Operator - Hybrid graph pattern + vector similarity search
- Vector Zone Maps - Centroid and bounding box pruning for block skipping
- Vector Cost Estimation - HNSW O(ef * log N) and brute-force O(N) cost models
- Python Quantization API - Full quantization support from Python
Execution & Quality¶
- Selective Property Loading - Projection pushdown for O(N*K) vs O(N*C) reads
- Parallel Node Scan - Morsel-driven parallel execution for 3-8x speedup on large scans
- Query Performance Metrics - Execution timing and row counts on query results
- Error Message Suggestions - Fuzzy "Did you mean X?" hints for undefined variables and labels
- Adaptive WAL Flusher - Self-tuning background flush with consistent cadence
Syntax Support¶
-- Vector literals and similarity functions
MATCH (m:Movie)
WHERE cosine_similarity(m.embedding, $query) > 0.8
RETURN m.title
-- Create vector index
CREATE VECTOR INDEX movie_embeddings ON :Movie(embedding)
WITH (dimensions: 384, metric: 'cosine')
0.4.x - Developer Accessibility¶
Reach more developers
New Bindings (Experimental)¶
- Node.js / TypeScript (
@grafeo-db/js) - native bindings with full type definitions - WebAssembly (
@grafeo-db/wasm) - raw WASM binary for edge runtimes - Go - CGO bindings for cloud-native applications
Ecosystem Integration¶
- grafeo-server - standalone HTTP server with web UI, Docker image
- grafeo-web - Grafeo in the browser via IndexedDB, Web Workers, React/Vue/Svelte
Query Languages¶
- SQL/PGQ (SQL:2023) - GRAPH_TABLE function for SQL-native graph queries
Quality¶
- Continued bug fixes
- Stricter linting rules
- Performance tuning based on real-world usage
0.5.x - Beta¶
Preparing for production readiness
Focus Areas¶
- Performance optimizations across all components
- Lots of bug hunting and fixing
- Documentation improvements, user guides and examples
- API stabilization
Goal¶
- Ready for production evaluation
- Clear path to 1.0
Future Considerations¶
These features are under consideration for future releases:
- Additional language bindings (Java/Kotlin, Swift)
- Distributed/clustered deployment
- Cloud-native integrations
Contributing¶
Interested in contributing to a specific feature? Check our GitHub Issues or join the discussion.
Last updated: February 2026