duzx16
|
6615fdd072
Add GLM-Base model
|
2 éve |
duzx16
|
43b27f8711
Add GLM-Large-Generation model
|
2 éve |
duzx16
|
61a6dfcb0d
Add configs for smaller models
|
2 éve |
duzx16
|
dff6780e6b
Implement unidirectional multichoice
|
2 éve |
duzx16
|
1783d9b6c5
Implement multichoice task for GLM-10B
|
2 éve |
duzx16
|
b030789021
Implement generation task for GLM-10B
|
2 éve |
Sengxian
|
634061a1e7
Add BIG-bench evaluation
|
2 éve |
Sengxian
|
8b3896dd56
Update comments
|
2 éve |
Sengxian
|
1f32e7faa3
Fix first beam
|
2 éve |
Sengxian
|
e131c8b557
Fix type bugs
|
3 éve |
Sengxian
|
605b13758a
Merge remote-tracking branch 'origin/batch-generation' into batch-generation
|
3 éve |
Sengxian
|
478eb5c9d0
Fix batch generation bugs
|
3 éve |
duzx16
|
2948e546b1
Remove max_gen_length argument in generate_text
|
3 éve |
Sengxian
|
4f5910ccd2
Merge branch 'batch-generation' of github.com:duzx16/GLM-130B
|
3 éve |
Sengxian
|
a361c5c843
Update quantization results
|
3 éve |
Aohan Zeng
|
0bdb6d2a92
Merge pull request #22 from THUDM/quantization
|
3 éve |
Sengxian
|
96623d7cbc
Merge branch 'quantization' of github.com:THUDM/GLM-130B into quantization
|
3 éve |
Sengxian
|
00f6ea61a3
Merge remote-tracking branch 'origin/main' into quantization
|
3 éve |
Shaw
|
0daf7051fc
Update quantization.md
|
3 éve |
Shaw
|
d405d89c87
Update README.md
|
3 éve |
Sengxian
|
28e449b79f
Update quantization docs and scripts
|
3 éve |
Sengxian
|
6b410ef9d2
Add checkpoint tensor parallel conversion script
|
3 éve |
Zhengxiao Du
|
7be5ba1758
Fix finalize in BeamSearchStrategy
|
3 éve |
Zhengxiao Du
|
3bb0f456d1
Fix top_p argument in generate.py
|
3 éve |
Zhengxiao Du
|
1241f03ec2
Fix sampling in BaseStrategy
|
3 éve |
Shaw
|
a7a1bfb806
Update README.md
|
3 éve |
Shaw
|
4c5be1093d
Create README.md
|
3 éve |
Zhengxiao Du
|
26543554f8
Remove redundant imports
|
3 éve |
Zhengxiao Du
|
10677a8dc2
Fix consider_end
|
3 éve |
Zhengxiao Du
|
e7d58c7d9d
Fix generate.py
|
3 éve |