DeepSeek open-sources inference optimizations with 60–85% faster generation
DeepSeek has open-sourced a set of inference optimizations that achieve 60–85% faster generation times, as detailed in a technical paper.
Uh oh!
There was an error while loading. Please reload this page.
Notifications You must be signed in to change notification settings
Fork 26
Star 441
Copy path
More file actions
Latest commit
History
History
History
706 KB
Copy path
Download raw file