Ning
|
b20ffea609
Unity.cpp dec sync (#257)
|
1 year ago |
Ning
|
42365dfb74
Cherrypick allocr related changes from public (#247)
|
1 year ago |
Guillaume Wenzek
|
a768cdf55f
Unity inc (#159)
|
1 year ago |
Guillaume Wenzek
|
353a001d64
fix merge
|
1 year ago |
Guillaume Wenzek
|
f2ef995b95
format/isort
|
1 year ago |
Guillaume Wenzek
|
c25c280302
dtype("bool")
|
1 year ago |
Guillaume Wenzek
|
2f71737adc
skip tests that can't run locally
|
1 year ago |
Guillaume Wenzek
|
d5b035f230
fix PositionalEmbedding with cache
|
1 year ago |
Guillaume Wenzek
|
a771adc782
nicer ctypes_utils
|
1 year ago |
Guillaume Wenzek
|
f6d810543d
fix incremental decoding !
|
1 year ago |
Guillaume Wenzek
|
522b97234e
WIP: simple failing test case
|
1 year ago |
Guillaume Wenzek
|
cc23e2b1c7
prepare tests for flash_attn
|
1 year ago |
Guillaume Wenzek
|
eb7810b81f
force little-endian
|
1 year ago |
Guillaume Wenzek
|
6fbb465f2b
generate_sequence return full results
|
1 year ago |
Guillaume Wenzek
|
86993cbd00
fix StandardTransformerEncoder
|
1 year ago |
Guillaume Wenzek
|
28ed039370
fix MultiheadAttention_forward
|
1 year ago |
Guillaume Wenzek
|
b7b31a3978
rm python 3.11 idiom
|
1 year ago |
Guillaume Wenzek
|
81cdf80eb9
WIP: MultiheadAttention_forward
|
1 year ago |
Guillaume Wenzek
|
f2b5007277
start testing with batch size > 1
|
1 year ago |
Guillaume Wenzek
|
f7d2c90ceb
_almost_contiguous
|
1 year ago |
Guillaume Wenzek
|
88b0690a72
split tests files
|
1 year ago |
Guillaume Wenzek
|
bfbafd9603
fix generation with beam_size=1
|
1 year ago |
Guillaume Wenzek
|
45f986055a
add naive tweaking of lprobs
|
1 year ago |
Guillaume Wenzek
|
2238cea072
SinusoidalPositionEncoder + WIP: TransformerEmbeddingFrontend
|
1 year ago |
Guillaume Wenzek
|
f1f33dbec1
has_layer + transformer decoder
|
1 year ago |
Guillaume Wenzek
|
3f1d6992f3
fix to_numpy for transposed tensors
|
1 year ago |
Guillaume Wenzek
|
a80a3b49f3
remove un-needed code
|
1 year ago |
Guillaume Wenzek
|
fa85f05545
test flash_attn
|
1 year ago |
Guillaume Wenzek
|
06d4ed1475
ggml.ne != np.shape
|
1 year ago |
Guillaume Wenzek
|
c2e6384e29
nb
|
1 year ago |