rose-ash

Author	SHA1	Message	Date
giles	cfa68c3db3	search: synonym / query expansion + 9 tests Some checks failed Test, Build, and Deploy / test-build-deploy (push) Failing after 19s Details A synonym map [(Term,[Term])] expands a query term to itself + synonyms (expandTerm); synDocs unions and synRankTfIdf ranks the expanded set. 214/214. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-06 23:27:03 +00:00
giles	cf4e613e43	search: proximity/NEAR search + 9 tests Some checks failed Test, Build, and Deploy / test-build-deploy (push) Failing after 24s Details nearDocs k t1 t2 returns docs where both terms occur within k positions (unordered); candidates from the posting intersection, filtered on positional postings. 205/205. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-06 23:01:42 +00:00
giles	911a2f57c0	search: stemming (suffix stripping) + 18 tests Some checks failed Test, Build, and Deploy / test-build-deploy (push) Failing after 16s Details Deterministic English suffix stripping (stem), stemText/stemTokens, indexStemmed. Worked around two haskell-on-sx string gotchas: take/drop over a String yield char codes (rebuild via joinChars . map chr), and isSuffixOf's reverse trips ++ (manual suffix compare). 196/196. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-06 22:50:19 +00:00
giles	7231cb651f	search: highlight + snippet generation + 12 tests Some checks failed Test, Build, and Deploy / test-build-deploy (push) Failing after 26s Details highlight marks query-matching (normalized) tokens with [..]; snippet extracts a context window around the first match. 178/178. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-06 22:08:00 +00:00
giles	5945b51cfd	search: fuzzy matching via edit distance + 18 tests Some checks failed Test, Build, and Deploy / test-build-deploy (push) Failing after 41s Details editDist as an O(m*n) row-based Levenshtein DP (naive recursion is exponential and times out under load); fuzzyTerms/fuzzyDocs/fuzzyRankTfIdf expand a term to indexed terms within a max edit distance. 166/166. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-06 21:47:56 +00:00
giles	3ab8270a58	search: result pagination (offset/limit) + 12 tests Some checks failed Test, Build, and Deploy / test-build-deploy (push) Failing after 26s Details paginate windows a ranked list (take lim . drop off); pageTfIdf/pageBm25 and resultCount. 148/148. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-06 20:55:25 +00:00
giles	9d3b775b25	search: prefix/wildcard queries + 14 tests Some checks failed Test, Build, and Deploy / test-build-deploy (push) Failing after 31s Details prefixTerms matches indexed terms by prefix (allTerms + isPrefixOf); prefixDocs unions their docs; prefixRankTfIdf ranks via the matched terms. 136/136. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-06 20:22:23 +00:00
giles	77ab827b91	search: Phase 4 federation merge + ACL post-filter + 21 tests Some checks failed Test, Build, and Deploy / test-build-deploy (push) Failing after 39s Details fedIndex merges per-peer inverted indices (union posting lists per term) after relabelling local DocIds to global gid = peer*1000 + local — dedupe by (peer,doc-id) is automatic and positions survive, so ranking runs once over the merge and interleaves peers by score. ACL is a post-rank filter over an injected permit predicate (searchTfIdfAcl/topNTfIdfAcl/searchBm25Acl). Roadmap complete, 122/122. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-06 20:08:08 +00:00
giles	a3f9d4f6c9	search: Phase 3 ranking TF-IDF + BM25 + top-N + 23 tests Some checks failed Test, Build, and Deploy / test-build-deploy (push) Failing after 37s Details rankTfIdf and rankBm25 (configurable k1/b) over the candidate set, float scores with deterministic DocId tiebreak; topNTfIdf/topNBm25. df/idf derived from posting-list length. Tests cover tf/idf behavior, a BM25-vs-TF-IDF flip from length-norm + tf-saturation, the b-parameter effect, tiebreak stability. 101/101. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-06 19:56:50 +00:00
giles	4c84decc01	search: Phase 2 query parser + 32 tests Some checks failed Test, Build, and Deploy / test-build-deploy (push) Failing after 46s Details Query tokenizer + recursive-descent parser: OR<AND<NOT precedence, implicit AND on adjacency, quoted phrases, parens, case-insensitive keywords. parseQuery, searchQuery, showQ. Worked around haskell-on-sx parser limits (ord-based delimiters; multi-clause fns instead of []-pattern case alts). 78/78. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-06 19:43:10 +00:00
giles	0f0da0319c	search: Phase 2 query AST + boolean/phrase eval + 28 tests Some checks failed Test, Build, and Deploy / test-build-deploy (push) Failing after 53s Details Query ADT (Term\|And\|Or\|Not\|Phrase) and evalQuery over docid-sorted posting lists: boolean ops as linear merges, Not over the allDocs universe, Phrase via positional adjacency. Batched both test suites into one program eval each (search-batch) so they finish under heavy CPU load. 46/46. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-06 18:47:42 +00:00
giles	b8cf3eb1b8	search: Phase 1 tokenizer + inverted index + 18 tests Some checks failed Test, Build, and Deploy / test-build-deploy (push) Failing after 53s Details Tokenizer (lowercase, strip punctuation, positions) and a sorted assoc-list inverted index [(Term,[(DocId,[Pos])])] with indexDoc/deleteDoc/lookupTerm/ docFreq/allTerms. Search lib is haskell-on-sx source assembled into search/src; tests reuse hk-test counters via a search-eval helper. conformance.sh models lib/haskell. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-06 18:21:49 +00:00

12 Commits