Searching the transcript has the problem of missing synonyms. This can be solved... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		mrob 1 day ago \| parent \| context \| favorite \| on: No AI* Here – A Response to Mozilla's Next Chapter Searching the transcript has the problem of missing synonyms. This can be solved by the one undeniably useful type of AI: embedding vector search. Embeddings for each line of the transcript can be calculated in advance and compared with the embeddings of the user's search. These models need only a few hundred million parameters for good results.

andai 12 hours ago [–]

Yeah, but they fail surprisingly hard on grepping. So the best systems use both simultaneously:

https://www.anthropic.com/engineering/contextual-retrieval

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact