Real-Time BitNet LLM Response on a Web Interface with Node.js and Socket.io
A web interface designed for submitting queries and viewing real-time responses through a user-friendly UI. Built with Node.js for the frontend and a Python Socket server for backend processing, the system streams responses line by line in real time.
Enter your query, hit submit, and watch the output appear live as it's generated.
Repository link: BitNet.Js
Can this repo be mentioned in the Readme as Node.Js implementation of the cpp code?
Does it keep context?
I believe this functionality is not currently supported by the official code, but as new features are introduced, I will continue to enhance my repository to maintain compatibility and improve functionality
I've thrown something similar together myself today: https://github.com/grctest/Electron-BitNet
You still need to follow Microsoft's instructions to build the required 1-bit LLM models though, the files are too big to include in either the repo or the releases