3.5
2026-05-06
Changes since 3.2.5:
Sorry for the rough edges in the last few updates. Version 3.5 is the cleanup release that should have landed sooner.
If dense PDFs, exact-value lookups, starter prompts, or long-running imports felt less reliable than they should have, this is the corrective pass. It rolls up the real fixes shipped after 3.2.5 and makes the app more dependable on hard documents.
- Exact answers are stronger across Standard, Deep Think, and Maximum for source-backed questions over tables, specs, measurements, counts, dates, prices, and similar exact values.
- Starter questions and follow-ups are more grounded and less likely to surface weak, generic, or misleading prompts.
- Onboarding, empty states, and the bundled sample workspace explain the app more clearly, including best-supported file types, the 4,096-token model limit, and when processing stays on-device versus uses Apple Private Cloud Compute.
- PDFs and images now share one adaptive visual-ingestion path, searchable figures and structured tables survive more often, and clean scientific PDFs are less likely to produce fake tables, broken headings, or reference-section noise.
- Table-heavy pages are less likely to collapse back into scrambled paragraph text during ingestion, improving retrieval quality after re-import.
- Large user-initiated imports are more reliable, with better queue recovery, background cleanup, and long-running import handling.
- Library and settings surfaces now describe per-library isolation and live runtime behavior more accurately.
This release is about making OpenIntelligence more grounded, more predictable, and more trustworthy on real documents.
3.2.5
2026-04-25
Version 3.2.5
What's changed since 3.2:
- Exact fact lookups are much more direct. If the answer is clearly present in a table, spec row, or short source passage, OpenIntelligence now locks onto that source value faster instead of overthinking it.
- Deep Think and Maximum now run a precision lookup before longer reasoning, so simple questions can still get quick cited answers even in the higher-effort modes.
- Standard, Deep Think, and Maximum share stronger table/spec retrieval rescue for measurements, capacities, limits, prices, counts, and other exact values.
- Suggested starter questions are now generated from actual passages instead of loose document labels, so "Ask something real" should be much more relevant to the library you uploaded.
- The answer text for exact measurements is cleaner, including nearby equivalent units when the source provides them.
- This is a fast corrective update for answer-quality regressions in the 3.2 line. Sorry for the quick follow-up, and thank you for bearing with the pace while I tighten the engine.
2.1.1
2026-04-08
Version 2.1.1
SETTINGS & ABOUT
- Added a public product hub with direct links to the roadmap, feedback board, changelog, GitHub repo, and App Store listing
- Cleaned up About and Settings messaging so in-app copy matches the current product more closely
LIFETIME COHORT
- Lifetime plan messaging now focuses on concrete limits and one-time purchase terms
- Added a Lifetime banner in Settings with quick links to the product hub and changelog
CHAT POLISH
- Refreshed the quality mode picker so Standard, Deep Think, and Maximum feel more visually distinct
- Added clearer mode labels like Fastest, Iterative, and Full sweep
BILLING & APP STORE
- Updated App Store metadata and in-app billing copy to better describe what OpenIntelligence actually does
- Added export compliance declaration for smoother App Store uploads
What’s next?
- Looking into OpenClaw/Claude integration as well as many many many many others.
Question for users:
What would you like to see?
2.1
2026-03-10
Version 2.1
WHAT YOU'LL NOTICE
Smarter Answers
- The AI now tells you when your documents don't cover a question instead of making something up
- Topical mismatch detection: when your question barely overlaps with retrieved content, Evidence-First mode activates — the AI states what evidence exists instead of hallucinating
- 3 new verification gates (7 total, A-G): Semantic Grounding checks the answer matches sources, Quote Faithfulness checks quoted text actually appears, Generation Quality catches repetitive output
- Critical gates can trigger full abstention — the app refuses to answer rather than answer wrong
Deep Think & Maximum Mode
- Fixed app freeze when sending a second question while one is processing (two reasoning chains were fighting over Apple FM)
- Cancel-and-replace: old query cancels cleanly, new one takes over
- Cancellation checks at all 17 LLM call sites — no wasted processing
Suggested Questions
- Fixed prompt contamination — hardcoded Kia Sportage examples were biasing every library toward car questions
- Questions now reference specific details from your actual documents
- Switching libraries instantly clears stale suggestions — no more flash of wrong questions
- Natural conversational tone instead of robotic templates
Cross-Library Isolation
- Answers no longer leak between libraries when switching mid-conversation (session ID + 8 guard checks)
- Entity index, full-text search, and suggestions fully scoped per library
- Deleting a library properly cleans all cross-references
Onboarding Rebuilt
- 2-page flow: welcome cards → live pipeline theater showing your docs being processed in real time
- Metrics dashboard: words, chunks, vectors, processing time
- 3 built-in sample docs (Pricing, RAG Architecture, Apple Intelligence) — bypass free tier limit
- Haptic feedback on all 6 interactions, VoiceOver labels throughout, reduce motion support
- Retry button when sample import fails
AI Hub Results
- Key Facts, Step-by-Step, Plain English, What's Missing, Illustrate now render with full markdown (bold, bullets, headers)
- New Share button alongside Copy and Insert
- Sheet starts half-height, draggable to full
Other Fixes
- Alert dialogs can be dismissed again (was stuck due to .constant() binding)
- "Show All Insights" now shows a proper list (was an empty TODO)
- Image Playground no longer biased toward car themes (removed Ford Mustang few-shot example)
- Settings: dynamic confidence threshold, duplicate Vietnamese removed, gates label corrected to A-G
- About screen shows actual version instead of hardcoded 1.0.0
UNDER THE HOOD
Safety
- 37 force-unwrap crash sites eliminated across 26 files (guard let, if let, default values)
- 10 fatalError() calls → graceful URL.temporaryDirectory fallbacks
- StoreKit stream continuation crash fixed (IUO → AsyncStream.makeStream)
- 3 silent Vision catch blocks now log properly
Dead Code
- 1,278 lines removed: SettingsRootView (701), DeveloperSettingsView (350), OpenAIResponsesAPIService (178), VecturaVectorDatabase (53), TipKit overlays
- 15 test files deleted (2,686 lines) — were never wired into a test target
Search Internals
- BM25Scorer: class → struct with Storage reference type, fresh NLTokenizer per call (thread safety), lemmatization removed, division-by-zero guard
- Font cipher detection prevents 93% content loss on encoded PDFs (Jaccard similarity)
- Raw regex fix (5 patterns), garbled image extraction fix, dynamic image text budget
Concurrency
- Swift 6 strict concurrency: 11 files with nonisolated(unsafe) annotations, zero runtime change
- GPU mode relabeling: Performance→Maximum, Balanced→Performance, new Balanced tier
Developer
- 25+ views gained #Preview macros for Xcode canvas
- Pipeline X-Ray Studio: new 5,540-line web app for visualizing pipeline traces (dev tool, not in app)
Includes all v2.0 features: 29-step RAG pipeline, AI Hub, 3-tier Metal GPU, device-specific OCR, markdown rendering, 102 services.
2.0
2026-03-01
Version 2.0
AI-POWERED DOCUMENT TRANSFORMS
New AI Hub toolbar with 5 document-aware transforms -- each uses your actual retrieved source chunks, not just the AI response text.
- Key Facts: Source-backed bullet points with document/page attribution
- Step-by-Step: Real procedures with specs and part numbers from your docs
- Plain English: Simplifies complex technical content into accessible language
- What's Missing?: Identifies gaps between your question and the retrieved answer
- Illustrate: Image Playground visualization powered by LLM concept extraction
SEARCH & RETRIEVAL UPGRADES
- BM25 length normalization aligned for better recall on uniform-size chunks
- 5 regex patterns pre-compiled as static constants
- NLTagger lemmatization in keyword search
- Anti-hallucination Gate E uses hardware-optimized cosine similarity
- Verification Gate thresholds tuned to reduce false abstentions
IMAGE PLAYGROUND
On-device LLM translates domain jargon into visual scenes. No more try another description on technical content.
FORMATTED RESPONSES
Full markdown rendering: headers, bullets, bold, code blocks, block quotes. 6 regex patterns normalize Apple FM single-line output.
PERFORMANCE (per Apple Silicon chip)
- 2-4x parallel cross-encoder reranking via TaskGroup
- 3x faster MLMultiArray fills with dataPointer bulk copy
- Device-specific OCR concurrency (A19 Pro 8 ops, A18 Pro 6, A17 Pro 4)
- 50-80% OCR skip rate on clean digital PDFs
- 3-tier Metal GPU: threadgroup/SIMD4/scalar auto-selected per query size
- 5 OCR candidates evaluated per recognition (was 3)
RELIABILITY
- Compression capped at 5 chunks with fresh LLM session per chunk
- 12-second compression time budget
- Rate-limit retry with 2s backoff
- Extractive fallback upgraded: 6 chunks x 500 chars with section titles
ZERO-DATA-LOSS INGESTION
- Font cipher detection prevents 93% content loss on font-encoded PDFs
- 5 Unicode regex patterns fixed
- Image batch reduced from 20 to 5 pages for lower memory usage
STABILITY
- MMR crash fixed (GPU matrix edge case)
- StoreKit 5s timeout (was 30-60s hang offline)
- BGTask registration fixed
- Swift 6 concurrency: 11 files updated, zero runtime change
- 102 services, 175+ tests across 15 files