Sengxian
|
a361c5c843
Update quantization results
|
3 rokov pred |
Aohan Zeng
|
0bdb6d2a92
Merge pull request #22 from THUDM/quantization
|
3 rokov pred |
Sengxian
|
96623d7cbc
Merge branch 'quantization' of github.com:THUDM/GLM-130B into quantization
|
3 rokov pred |
Sengxian
|
00f6ea61a3
Merge remote-tracking branch 'origin/main' into quantization
|
3 rokov pred |
Shaw
|
0daf7051fc
Update quantization.md
|
3 rokov pred |
Shaw
|
d405d89c87
Update README.md
|
3 rokov pred |
Sengxian
|
28e449b79f
Update quantization docs and scripts
|
3 rokov pred |
Sengxian
|
6b410ef9d2
Add checkpoint tensor parallel conversion script
|
3 rokov pred |
Shaw
|
a7a1bfb806
Update README.md
|
3 rokov pred |
Shaw
|
4c5be1093d
Create README.md
|
3 rokov pred |
Sengxian
|
113f5f1364
Add language modeling task
|
3 rokov pred |
Sengxian
|
c64d6ea33c
Fix quantization argument bug
|
3 rokov pred |
Sengxian
|
21cadf7677
Add load from quantized checkpoint
|
3 rokov pred |
Sengxian
|
96eac9f33b
Add 4-bit quantization and CUDA kernels
|
3 rokov pred |
Sengxian
|
e10b098020
Add 8-bit quantization
|
3 rokov pred |
Shaw
|
5b1b0bf8ca
Update README_zh.md
|
3 rokov pred |
Aohan Zeng
|
a8a1d5818b
Merge pull request #7 from erjanmx/fix-readme-typo
|
3 rokov pred |
Erjan Kalybek
|
738695c6c9
Fix readme typo
|
3 rokov pred |
Shaw
|
11befcbf80
Fix script name in README
|
3 rokov pred |
xiao9905
|
22d1a03c3e
Merge branch 'main' of github.com:THUDM/GLM-130B into main
|
3 rokov pred |
xiao9905
|
964695129f
update English version of the training logs
|
3 rokov pred |
Sengxian
|
20c0a9eca2
Fix script name in README
|
3 rokov pred |
Shaw
|
0aefc909ba
Fold acknowledgement list
|
3 rokov pred |
Shaw
|
63298822e7
Fold acknowledge list
|
3 rokov pred |
Shaw
|
d4a1738503
Fold acknowledgement list
|
3 rokov pred |
Shaw
|
0bf31162bf
Fold acknowledgement list
|
3 rokov pred |
Shaw
|
99509f09cc
add link to GLM
|
3 rokov pred |
Sengxian
|
737be7c740
Initial commit
|
3 rokov pred |