Hello, I am currently using codename goose as an AI client to proofread and help me with coding. I have it setup towards Googles Gemini, however I find myself quickly running out of tokens with large files. I was wondering if there are any easy way to self host an AI with similar capabilites but still have access to read and write files. I’ve tried both ollama and Jan, but neither have access to my files. Any recommendations?

  • nocteb@feddit.org
    link
    fedilink
    English
    arrow-up
    3
    ·
    edit-2
    1 day ago

    Look into setting up the “continue” plugin in vs code. It supports an ollama backend and can even do embeddings if setup correctly. That means it will try to select files itself based on your question which helps with prompt size. Here is a link to get started, you might need to choose smaller models with your card.

    https://ollama.com/blog/continue-code-assistant