An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: pytorchjob

BaizeAI/kcover

🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.

Language: Go - Size: 58.6 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 30 - Forks: 2

polyaxon/polyaxon-examples

Code for tutorials and examples

Language: Python - Size: 919 KB - Last synced at: about 23 hours ago - Pushed at: over 2 years ago - Stars: 67 - Forks: 25

adindayup/kcover

🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads. kubeflow, kubernetes, kubernetes-controller, llm, llmops, mlops, nvidia-gpu, pytorchjob, tfjob, xid-error

Language: Go - Size: 27.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

0M1J/kubeflow-GNN

GNN training in kubeflow.

Language: Python - Size: 702 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 4