I need to scan very large JSONL files efficiently and am considering a parallel grep-style approach over line-delimited text.

Would love to hear how you would design it.

  • FizzyOrange
    link
    fedilink
    arrow-up
    2
    ·
    20 days ago

    I think mmap is unlikely to be the best option seeing as you’d be doing large sequential reads.