Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Anyone knows how to get this working locally with vscode?



That is not what I consider "local", since that uses cloud inference by default (and last I checked, they provided no useful guidance for changing that).

I don’t consider cloud inference to count as getting it working “locally” as requested by the comment above yours.

Refact worked nicely and worked locally when I tried it a few weeks ago, but the challenge with any new model is making it be supported by the existing software: https://github.com/smallcloudai/refact/


"Choose your model Requests for code generation are made via an HTTP request.

You can use the Hugging Face Inference API or your own HTTP endpoint, provided it adheres to the API specified here[1] or here[2]."

It's fairly easy to use your own model locally with the plugin. You can just use the one of the community developed inference servers, which are listed at the bottom of the page, but here's the links[3] to both[4].

[1]: https://huggingface.co/docs/api-inference/detailed_parameter...

[2]: https://huggingface.github.io/text-generation-inference/#/Te...

[3]: https://github.com/wangcx18/llm-vscode-inference-server

[4]: https://github.com/wangcx18/llm-vscode-inference-server


I have the same question, and more generally: Any generic way of doing this for any of the open source or semi open source models, especially Mistral[0]?

[0] https://news.ycombinator.com/item?id=37675496





Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: