LLMs/Generative AI/Aviary
Topic | Replies | Views | Activity | |
---|---|---|---|---|
About the LLMs/Generative AI/Aviary category
|
![]() |
0 | 266 | April 20, 2023 |
How to deploy LLM models that can handle high concurrency based on the Ray serve framework
|
![]() |
0 | 31 | September 19, 2023 |
Question about the model yaml config `accelerator_type_a100`
|
![]() |
0 | 31 | September 12, 2023 |
Memory Requirements for distributing LLM
|
![]() ![]() |
3 | 87 | August 31, 2023 |
Turbocharge LangChain: guide to 20x faster embedding
|
![]() ![]() ![]() |
14 | 1368 | July 27, 2023 |
Use deepspeed in aviary to deploy falcon 40B / Llama 30B Fails
|
![]() ![]() ![]() |
3 | 292 | July 23, 2023 |
How to deploy LLaMA 2 7B model with Aviary
|
![]() |
0 | 640 | July 20, 2023 |
HuggingFacePredictor Multi-GPU
|
![]() ![]() ![]() |
3 | 285 | July 12, 2023 |
How to deploy Aviary Frontend along with Backend in the same Ray cluster?
|
![]() ![]() |
1 | 213 | June 9, 2023 |
Request for comments: Adding LangChain support to Aviary
|
![]() |
0 | 174 | June 2, 2023 |
Announcing Ray Aviary
|
![]() |
1 | 319 | May 31, 2023 |