About the LLMs/Generative AI/Aviary category
|
|
0
|
601
|
April 20, 2023
|
LLM model loading
|
|
0
|
30
|
July 29, 2024
|
Serving LLM with multiple gpus
|
|
0
|
104
|
July 3, 2024
|
Error Encountered While Training Generative AI Model in Aviary
|
|
2
|
111
|
May 17, 2024
|
Turbocharge LangChain: guide to 20x faster embedding
|
|
15
|
3808
|
May 8, 2024
|
Tensor parallelism with torch run inside ray
|
|
0
|
93
|
April 29, 2024
|
GPT-J6B Sample Code
|
|
0
|
110
|
April 10, 2024
|
RayTaskError(OutOfMemoryError) when using a LLM
|
|
0
|
98
|
April 2, 2024
|
Installing TensorRT LLM on Ray Docker Image as Custom Docker
|
|
2
|
506
|
March 7, 2024
|
Best practices to run multiple models in multiple GPUs in RayLLM
|
|
0
|
644
|
February 8, 2024
|
Question about the model yaml config `accelerator_type_a100`
|
|
1
|
430
|
February 2, 2024
|
Overriding resources per worker in ray-llm
|
|
7
|
412
|
January 30, 2024
|
How do i apply ray on pdfs for making pdf reading RAG scaled application using open source like Huggingface?
|
|
0
|
30
|
January 10, 2024
|
How to assign actors to specific machines?
|
|
2
|
256
|
January 8, 2024
|
How to deploy LLM models that can handle high concurrency based on the Ray serve framework
|
|
1
|
964
|
January 8, 2024
|
Can RayLLM be deployed on Azure Cloud Services?
|
|
1
|
212
|
December 15, 2023
|
Download an opensource LLM model in Raycluster yaml file?
|
|
2
|
252
|
December 14, 2023
|
OOM when I decoupled ray from GPTj finetune script
|
|
0
|
239
|
November 17, 2023
|
Does ray-llm support only CPU?
|
|
0
|
384
|
October 25, 2023
|
The deployments ['DeployLLM'] are UNHEALTHY
|
|
0
|
51
|
October 21, 2023
|
Memory Requirements for distributing LLM
|
|
3
|
556
|
August 31, 2023
|
Use deepspeed in aviary to deploy falcon 40B / Llama 30B Fails
|
|
3
|
941
|
July 23, 2023
|
How to deploy LLaMA 2 7B model with Aviary
|
|
0
|
1838
|
July 20, 2023
|
HuggingFacePredictor Multi-GPU
|
|
3
|
707
|
July 12, 2023
|
How to deploy Aviary Frontend along with Backend in the same Ray cluster?
|
|
1
|
667
|
June 9, 2023
|
Request for comments: Adding LangChain support to Aviary
|
|
0
|
503
|
June 2, 2023
|
Announcing Ray Aviary
|
|
1
|
770
|
May 31, 2023
|