GitHub / project-codeflare / codeflare-sdk
An intuitive, easy-to-use python interface for batch resource requesting, access, job submission, and observation. Simplifying the developer's life while enabling access to high-performance compute resources, either in the cloud or on-prem.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/project-codeflare%2Fcodeflare-sdk
PURL: pkg:github/project-codeflare/codeflare-sdk
Stars: 32
Forks: 59
Open issues: 83
License: apache-2.0
Language: Python
Size: 8.95 MB
Dependencies parsed at: Pending
Created at: about 3 years ago
Updated at: 4 days ago
Pushed at: 4 days ago
Last synced at: 4 days ago
Commit Stats
Commits: 79
Authors: 5
Mean commits per author: 15.8
Development Distribution Score: 0.443
More commit stats: https://commits.ecosyste.ms/hosts/GitHub/repositories/project-codeflare/codeflare-sdk
v0.4.0 Job Management Update
v0.4.0
The SDK now has the ability to:
- Submit jobs to a Ray cluster (via TorchX)
- Submit jobs directly to MCAD (via TorchX)
- Check submitted job status
- Check submitted job logs
- Cancel a submitted job
Also included are other minor bug fixes and improvements (mainly clarity on starting vs. running Ray clusters).
Download
v0.3.1 Allow for user defined ports in Auth
v0.3.1
What’s Changed
- Remove hard-coded port 6443 in auth by @MichaelClifford in https://github.com/project-codeflare/codeflare-sdk/pull/74
Full Changelog: https://github.com/project-codeflare/codeflare-sdk/compare/v0.3.0...v0.3.1
Download
v0.3.0 Info and Auth Updates
v0.3.0
Updates:
- New all-encompassing cluster.status() method
- New cluster.details() method
- New cluster.wait_ready() method
- New CodeFlareStatus.STARTING status
- Functions get_current_namespace(), list_all_clusters(), and list_all_queued() now all function as intended
- A ton of bug fixes and error-handling for both regular use and edge-cases (not logged in, login expired, no resources created yet,
- resource failure, cluster setup in-progress, etc.)
- A LOT more testing (up to 95%)
- Improved messaging on both expected and unexpected results
Authentication: abstracted from cluster creation/deletion, now can independently authenticate and logout
Download
v0.2.2 Requirement and Dependency Update
v0.2.2
Corrects requirements and package dependencies by switching from ray 2.1.0 to ray[default] 2.1.0, and corrects allowed python version range to be in line with rich compatibility.
Download
v0.2.1 Auth Skip TLS Option
v0.2.1
Adds a new option to the TokenAuthentication object to skip TLS verification.
Download
v0.2.0 Ray Job Options
v0.2.0
Adds the ability to:
- View a list of submitted Ray jobs
- View status of a given Ray job
- View logs of a given Ray job
all directly through the SDK
Download
v0.1.7 Authentication and Descriptions
v0.1.7
Added new OCP cluster authentication options as well as docstrings and type hints to all modules. Also reformatted and added format-checking rules for developers.
Download
v0.1.6 Configurable Namespaces and New License
v0.1.6
Added Apache v2 license, and introduced a new feature for creating clusters in non-default namespaces.
Download
v0.1.5 Unique Worker Group Names and Pretty Print Improvements
v0.1.5
Improved pretty print formatting and appended cluster names to worker group names to avoid overlap
Download
v0.1.4 Changing Defaults and Adding Dashboard Route
v0.1.4
Added dashboard route and changed head node default resources (and default env vars)
Download