1
0
mirror of https://github.com/hpcaitech/ColossalAI.git synced 2025-05-06 07:28:12 +00:00
ColossalAI/tests/test_infer
flybird11111 2ddf624a86
[shardformer] upgrade transformers to 4.39.3 ()
* [shardformer]upgrade transformers for gpt2/gptj/whisper ()

* [shardformer] fix modeling of gpt2 and gptj

* [shardformer] fix whisper modeling

* [misc] update requirements

---------

Co-authored-by: ver217 <lhx0217@gmail.com>

* [shardformer]upgrade transformers for mistral ()

* upgrade transformers for mistral

* fix

* fix

* [shardformer]upgrade transformers for llama ()

* update transformers

fix

* fix

* fix

* [inference] upgrade transformers ()

* update transformers

fix

* fix

* fix

* fix

* fix

* [gemini] update transformers for gemini ()

---------

Co-authored-by: ver217 <lhx0217@gmail.com>
2024-06-14 10:59:33 +08:00
..
test_async_engine [Inference] Fix bugs and docs for feat/online-server () 2024-05-08 15:20:53 +00:00
test_kernels [shardformer] upgrade transformers to 4.39.3 () 2024-06-14 10:59:33 +08:00
test_models [Inference] Fix flash-attn import and add model test () 2024-06-12 14:13:50 +08:00
__init__.py [Fix] Fix Inference Example, Tests, and Requirements () 2024-05-08 11:30:15 +08:00
_utils.py [Inference] Add the logic of the inference engine () 2024-01-11 13:39:56 +00:00
test_batch_bucket.py [Fix/Inference] Fix format of input prompts and input model in inference engine () 2024-02-23 10:51:35 +08:00
test_config_and_struct.py [Fix] Fix Inference Example, Tests, and Requirements () 2024-05-08 11:30:15 +08:00
test_continuous_batching.py [inference] Fix running time of test_continuous_batching () 2024-05-24 19:34:15 +08:00
test_cuda_graph.py [Fix] Fix Inference Example, Tests, and Requirements () 2024-05-08 11:30:15 +08:00
test_drafter.py [Fix] Fix Inference Example, Tests, and Requirements () 2024-05-08 11:30:15 +08:00
test_inference_engine.py [Inference] Fix bugs and docs for feat/online-server () 2024-05-08 15:20:53 +00:00
test_kvcache_manager.py [Fix] Fix & Update Inference Tests (compatibility w/ main) 2024-05-05 16:28:56 +00:00
test_request_handler.py [Fix] Fix & Update Inference Tests (compatibility w/ main) 2024-05-05 16:28:56 +00:00
test_rpc_engine.py [release] update version () 2024-05-31 19:40:26 +08:00
test_streamingllm.py [Inference]Add Streaming LLM () 2024-06-05 10:51:19 +08:00