GitHub / code-kern-ai 51 Repositories
Building data-centric open-source tools for NLP
code-kern-ai/refinery-gateway
Gateway for refinery. Manages incoming requests and holds the workflow logic. To interact with the gateway, the UI or Python SDK can be used.
Language: Python - Size: 8.04 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 3

code-kern-ai/refinery-submodule-model
Data model for refinery. Manages entities and their access for multiple services, e.g. the gateway.
Language: Python - Size: 494 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2 - Forks: 1

code-kern-ai/refinery-embedder
Embedder for refinery. Manages the creation of document- and token-level embeddings using the embedders library.
Language: Python - Size: 181 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 1 - Forks: 1

code-kern-ai/refinery-ui
Language: TypeScript - Size: 2.78 MB - Last synced at: 9 days ago - Pushed at: 13 days ago - Stars: 1 - Forks: 0

code-kern-ai/submodule-react-components
Language: TypeScript - Size: 271 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

code-kern-ai/refinery
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
Language: Python - Size: 3.54 MB - Last synced at: 29 days ago - Pushed at: 6 months ago - Stars: 1,438 - Forks: 71

code-kern-ai/bricks
Open-source natural language enrichments at your fingertips.
Language: Python - Size: 3.3 MB - Last synced at: 27 days ago - Pushed at: 5 months ago - Stars: 459 - Forks: 24

code-kern-ai/cicd-deployment-scripts
Scripts used for Kern AI CI/CD efforts
Language: Shell - Size: 365 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

code-kern-ai/refinery-neural-search
Neural search for refinery. Manages similarity search powered by Qdrant and outlier detection, both based on vector representations of the project records.
Language: Python - Size: 89.8 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 5 - Forks: 1

code-kern-ai/refinery-tokenizer
Tokenizer for refinery. Manages the creation and storage of spaCy tokens for text-based record attributes and supports multiple language models. It is used by the gateway.
Language: Python - Size: 99.6 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

code-kern-ai/refinery-common-parent-image
Defines parent image for the Docker images of the refinery services which require the integration of the model and the s3 submodule.
Language: Shell - Size: 22.5 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

code-kern-ai/refinery-mini-parent-image
Defines parent image for the Docker images of the refinery services with the smallest set of requirements.
Language: Shell - Size: 23.4 KB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

code-kern-ai/refinery-submodule-parent-images
Submodule which contains the requirements of the different parent images of refinery.
Language: Python - Size: 83 KB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

code-kern-ai/refinery-exec-env-parent-image
Defines parent image for the Docker images of the refinery services which provide an execution environment.
Language: Shell - Size: 27.3 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

code-kern-ai/refinery-ac-exec-env
Execution environment for attribute calculation in refinery. Containerized function as a service to build custom attributes derived from the original data.
Language: Python - Size: 93.8 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

code-kern-ai/refinery-python-sdk
Official Python SDK for Kern AI refinery.
Language: Python - Size: 171 KB - Last synced at: 2 days ago - Pushed at: 7 months ago - Stars: 19 - Forks: 3

code-kern-ai/refinery-sample-projects
Containing examples of projects you can use to test refinery. Please select the use case from the branches.
Size: 57.1 MB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 25 - Forks: 5

code-kern-ai/submodule-javascript-functions
Language: TypeScript - Size: 30.3 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

code-kern-ai/refinery-torch-cuda-parent-image
Defines parent image for the Docker images of the refinery services that require torch (gpu).
Language: Shell - Size: 24.4 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

code-kern-ai/refinery-torch-cpu-parent-image
Defines parent image for the Docker images of the refinery services that require torch (cpu).
Language: Shell - Size: 25.4 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

code-kern-ai/refinery-entry
Language: TypeScript - Size: 207 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

code-kern-ai/refinery-parent-images 📦
Language: Python - Size: 23.4 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

code-kern-ai/refinery-lf-exec-env
Execution environment for labeling functions in refinery. Containerized function as a service to execute user-defined Python scripts.
Language: Python - Size: 88.9 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

code-kern-ai/refinery-ml-exec-env
Execution environment for the active learning module in refinery. Containerized function as a service to build active learning models using scikit-learn and sequence-learn.
Language: Python - Size: 74.2 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 1

code-kern-ai/refinery-websocket
Websocket module for refinery. Enables asynchronous notifications inside the application.
Language: Go - Size: 24.4 KB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 1

code-kern-ai/refinery-record-ide-env
Execution environment for the record IDE in refinery. Containerized function as a service to build record-specific "quick-and-dirty" code snippets for exploration and debugging.
Language: Python - Size: 81.1 KB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 1

code-kern-ai/refinery-doc-ock 📦
Usage statistics collection for refinery. If users allow it, this collects product insight data used to optimize the user experience.
Language: Python - Size: 131 KB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

code-kern-ai/refinery-submodule-s3
S3 related AWS and Minio logic.
Language: Python - Size: 32.2 KB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 1

code-kern-ai/cognition-pdf2md Fork of opengovsg/pdf2md
A PDF to Markdown converter
Language: JavaScript - Size: 1.81 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

code-kern-ai/submodule-tailwind-config
Language: JavaScript - Size: 2.93 KB - Last synced at: 4 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

code-kern-ai/alfred
Controls the start process of the refinery app. Named after Batman's butler Alfred.
Language: Python - Size: 26.4 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 2

code-kern-ai/refinery-next-parent-image
Language: Dockerfile - Size: 44.9 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

code-kern-ai/refinery-config
Configuration of refinery. Manages amongst others endpoints and available language models for spaCy.
Language: Python - Size: 34.2 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

code-kern-ai/refinery-zero-shot
Zero-shot module for refinery. Enables the integration of 🤗 Hugging Face zero-shot classifiers as an off-the-shelf no-code heuristic.
Language: Python - Size: 71.3 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

code-kern-ai/refinery-weak-supervisor
Weak supervision for refinery. Manages the integration of heuristics such as labeling functions, active learners or zero-shot classifiers. Uses the weak-nlp library for the actual integration logic and algorithms.
Language: Python - Size: 49.8 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

code-kern-ai/refinery-gateway-proxy
Gateway proxy for refinery. Manages incoming requests and forwards them to the gateway. Used by the Python SDK.
Language: Python - Size: 29.3 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 2

code-kern-ai/refinery-authorizer
Evaluates whether a user has access to certain resources.
Language: Python - Size: 22.5 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 2

code-kern-ai/refinery-updater
Updater for refinery. Manages migration logic to new versions if required.
Language: Python - Size: 62.5 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

code-kern-ai/refinery-entry-handlebars 📦
Login and registration screen for refinery. Implemented via Ory Kratos.
Language: Handlebars - Size: 1.09 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

code-kern-ai/automl-docker 📦
CLI-based tool to automatically build ML models from training data into a servable Docker container
Language: Python - Size: 3.33 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 55 - Forks: 7

code-kern-ai/embedders
With embedders, you can easily convert your texts into sentence- or token-level embeddings within a few lines of code. Use cases for this include similarity search between texts, information extraction such as named entity recognition, or basic text classification.
Language: Python - Size: 884 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 21 - Forks: 2

code-kern-ai/sequence-learn
With sequence-learn, you can build models for named entity recognition as quickly as if you were building a sklearn classifier.
Language: Python - Size: 551 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 22 - Forks: 2

code-kern-ai/refinery-ui-angular 📦
UI for refinery. Used to interact with the whole system; to find out how to best work with the UI, check out our docs.
Language: TypeScript - Size: 2.61 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 2

code-kern-ai/rag-webinar-entwickler.de
Language: Jupyter Notebook - Size: 1.7 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

code-kern-ai/twitter-issues-classifier 📦
Since the twitter algorithm has been open-sourced, the issues section of their repository is being polluted. Let's try to fix that.
Language: Python - Size: 368 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 37 - Forks: 0

code-kern-ai/weak-nlp
With weak-nlp, you can integrate heuristics like labeling functions and active learners based on weak supervision. Automate data labeling and improve label quality.
Language: Python - Size: 912 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 1

code-kern-ai/chatgpt-webinar-entwickler.de
Language: Jupyter Notebook - Size: 997 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

code-kern-ai/docs-chatbot-demo
This is a demo of how we can leverage embeddings and LLMs to quickly build a chatbot so you can "chat" with a documentation.
Language: Python - Size: 13.7 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

code-kern-ai/util-scripts
Language: Python - Size: 5.86 KB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

code-kern-ai/hosting
Only used for documentation
Size: 1000 Bytes - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

code-kern-ai/refinery-template-functions
Overview of recurring functions you can use for your projects
Size: 121 KB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 1

code-kern-ai/.github
Size: 120 KB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0
