code-completion model (Qwen2.5-coder) rewrites already written code instead of just completing it

Smorty [she/her]@lemmy.blahaj.zone · 2 months ago

code-completion model (Qwen2.5-coder) rewrites already written code instead of just completing it

lynx@sh.itjust.works · edit-2 2 months ago

If you want in line completions, you need a model that is trained on “fill in the middle” tasks. On their Huggingface page they even say that this is not supported and needs fine tuning:

We do not recommend using base language models for conversations. Instead, you can apply post-training, e.g., SFT, RLHF, continued pretraining, etc., or fill in the middle tasks on this model.

A model that can do it is:

starcoder2
codegemma
codellama

Another option is to just use the qwen model, but instead of only adding a few lines let it rewrite the entire function each time.

Smorty [she/her]@lemmy.blahaj.zone · 2 months ago

Have a look at the other comments. Sometimes it does fill in the code correctly, even without any prompting! The template specifically has the fill in the middle part in it.

The ollama site has the template with <fim_prefix> and such.