LLMs/Generative AI/Aviary
Topic | Replies | Views | Activity | |
---|---|---|---|---|
About the LLMs/Generative AI/Aviary category
|
![]() |
0 | 376 | April 20, 2023 |
OOM when I decoupled ray from GPTj finetune script
|
![]() |
0 | 29 | November 17, 2023 |
Does ray-llm support only CPU?
|
![]() |
0 | 119 | October 25, 2023 |
How to deploy LLM models that can handle high concurrency based on the Ray serve framework
|
![]() |
0 | 192 | September 19, 2023 |
Question about the model yaml config `accelerator_type_a100`
|
![]() |
0 | 164 | September 12, 2023 |
Memory Requirements for distributing LLM
|
![]() ![]() |
3 | 233 | August 31, 2023 |
Turbocharge LangChain: guide to 20x faster embedding
|
![]() ![]() ![]() |
14 | 1923 | July 27, 2023 |
Use deepspeed in aviary to deploy falcon 40B / Llama 30B Fails
|
![]() ![]() ![]() |
3 | 532 | July 23, 2023 |
How to deploy LLaMA 2 7B model with Aviary
|
![]() |
0 | 1089 | July 20, 2023 |
HuggingFacePredictor Multi-GPU
|
![]() ![]() ![]() |
3 | 431 | July 12, 2023 |
How to deploy Aviary Frontend along with Backend in the same Ray cluster?
|
![]() ![]() |
1 | 339 | June 9, 2023 |
Request for comments: Adding LangChain support to Aviary
|
![]() |
0 | 292 | June 2, 2023 |
Announcing Ray Aviary
|
![]() |
1 | 458 | May 31, 2023 |