About the LLMs/Generative AI/Aviary category
|
|
0
|
605
|
April 20, 2023
|
LLM Deployment retries
|
|
2
|
4
|
January 29, 2025
|
Colab session failing due to Ray inference call
|
|
0
|
26
|
January 5, 2025
|
LLM model loading
|
|
0
|
39
|
July 29, 2024
|
Serving LLM with multiple gpus
|
|
0
|
153
|
July 3, 2024
|
Error Encountered While Training Generative AI Model in Aviary
|
|
2
|
116
|
May 17, 2024
|
Turbocharge LangChain: guide to 20x faster embedding
|
|
15
|
3879
|
May 8, 2024
|
Tensor parallelism with torch run inside ray
|
|
0
|
102
|
April 29, 2024
|
GPT-J6B Sample Code
|
|
0
|
112
|
April 10, 2024
|
RayTaskError(OutOfMemoryError) when using a LLM
|
|
0
|
104
|
April 2, 2024
|
Installing TensorRT LLM on Ray Docker Image as Custom Docker
|
|
2
|
527
|
March 7, 2024
|
Best practices to run multiple models in multiple GPUs in RayLLM
|
|
0
|
661
|
February 8, 2024
|
Question about the model yaml config `accelerator_type_a100`
|
|
1
|
432
|
February 2, 2024
|
Overriding resources per worker in ray-llm
|
|
7
|
420
|
January 30, 2024
|
How do i apply ray on pdfs for making pdf reading RAG scaled application using open source like Huggingface?
|
|
0
|
38
|
January 10, 2024
|
How to assign actors to specific machines?
|
|
2
|
270
|
January 8, 2024
|
How to deploy LLM models that can handle high concurrency based on the Ray serve framework
|
|
1
|
1003
|
January 8, 2024
|
Can RayLLM be deployed on Azure Cloud Services?
|
|
1
|
214
|
December 15, 2023
|
Download an opensource LLM model in Raycluster yaml file?
|
|
2
|
252
|
December 14, 2023
|
OOM when I decoupled ray from GPTj finetune script
|
|
0
|
239
|
November 17, 2023
|
Does ray-llm support only CPU?
|
|
0
|
388
|
October 25, 2023
|
The deployments ['DeployLLM'] are UNHEALTHY
|
|
0
|
73
|
October 21, 2023
|
Memory Requirements for distributing LLM
|
|
3
|
566
|
August 31, 2023
|
Use deepspeed in aviary to deploy falcon 40B / Llama 30B Fails
|
|
3
|
950
|
July 23, 2023
|
How to deploy LLaMA 2 7B model with Aviary
|
|
0
|
1850
|
July 20, 2023
|
HuggingFacePredictor Multi-GPU
|
|
3
|
712
|
July 12, 2023
|
How to deploy Aviary Frontend along with Backend in the same Ray cluster?
|
|
1
|
669
|
June 9, 2023
|
Request for comments: Adding LangChain support to Aviary
|
|
0
|
504
|
June 2, 2023
|
Announcing Ray Aviary
|
|
1
|
779
|
May 31, 2023
|