Blogposts
Articles written by me/ by my group
- Are LLMs Truly Solving Software Problems — or Are Agents Doing It?
- BloomChat-v2 a long sequence 176B model
- BloomChat a 176B multilingual chat model
- Training long sequence models
- Alibi Interpolation vs Extrapolation
- Achieving GPT3 accuracy usig a 10x smaller model
- Pushing the limits of Neural Network Compression
- TinyML Applications Require New Network Architectures
- Skip-RNN work
