About the LLMs/Generative AI/Aviary category
|
|
0
|
592
|
April 20, 2023
|
LLM model loading
|
|
0
|
17
|
July 29, 2024
|
Serving LLM with multiple gpus
|
|
0
|
69
|
July 3, 2024
|
Error Encountered While Training Generative AI Model in Aviary
|
|
2
|
110
|
May 17, 2024
|
Turbocharge LangChain: guide to 20x faster embedding
|
|
15
|
3728
|
May 8, 2024
|
Tensor parallelism with torch run inside ray
|
|
0
|
83
|
April 29, 2024
|
GPT-J6B Sample Code
|
|
0
|
108
|
April 10, 2024
|
RayTaskError(OutOfMemoryError) when using a LLM
|
|
0
|
95
|
April 2, 2024
|
Installing TensorRT LLM on Ray Docker Image as Custom Docker
|
|
2
|
491
|
March 7, 2024
|
Best practices to run multiple models in multiple GPUs in RayLLM
|
|
0
|
628
|
February 8, 2024
|
Question about the model yaml config `accelerator_type_a100`
|
|
1
|
429
|
February 2, 2024
|
Overriding resources per worker in ray-llm
|
|
7
|
407
|
January 30, 2024
|
How do i apply ray on pdfs for making pdf reading RAG scaled application using open source like Huggingface?
|
|
0
|
26
|
January 10, 2024
|
How to assign actors to specific machines?
|
|
2
|
248
|
January 8, 2024
|
How to deploy LLM models that can handle high concurrency based on the Ray serve framework
|
|
1
|
915
|
January 8, 2024
|
Can RayLLM be deployed on Azure Cloud Services?
|
|
1
|
210
|
December 15, 2023
|
Download an opensource LLM model in Raycluster yaml file?
|
|
2
|
246
|
December 14, 2023
|
OOM when I decoupled ray from GPTj finetune script
|
|
0
|
239
|
November 17, 2023
|
Does ray-llm support only CPU?
|
|
0
|
379
|
October 25, 2023
|
The deployments ['DeployLLM'] are UNHEALTHY
|
|
0
|
32
|
October 21, 2023
|
Memory Requirements for distributing LLM
|
|
3
|
547
|
August 31, 2023
|
Use deepspeed in aviary to deploy falcon 40B / Llama 30B Fails
|
|
3
|
938
|
July 23, 2023
|
How to deploy LLaMA 2 7B model with Aviary
|
|
0
|
1831
|
July 20, 2023
|
HuggingFacePredictor Multi-GPU
|
|
3
|
702
|
July 12, 2023
|
How to deploy Aviary Frontend along with Backend in the same Ray cluster?
|
|
1
|
663
|
June 9, 2023
|
Request for comments: Adding LangChain support to Aviary
|
|
0
|
502
|
June 2, 2023
|
Announcing Ray Aviary
|
|
1
|
764
|
May 31, 2023
|