Hopsworks Feature Store

Commercial Information

Vendor Name

Hopsworks Feature Store

History

First developed at KTH University, now managed by startup Logical Clocks

Stand-alone vs. Platform

Part of the Hopsworks MLOps platform

Delivery Model

Open source, self-managed commercial, and fully-managed cloud service

Clouds Supported

AWS and Azure (managed service), GCP and on-prem (self-managed)

Pricing Model

Cloud service: consumption pricing

Self-managed: per node pricing

Open source: free

Service Level Guarantees

Uptime, Serving latencies

Support

24 x 7 support & response time guarantees

Feature Store Capabilities

Feature Definitions

Feature ingestion jobs managed in notebooks

Automated Transforms

Orchestration of ingestions jobs via Apache Airflow DAGs

Feature Ingestion

Spark/Pandas batch feature ingestion into offline & online store

Spark Streaming feature ingestion into online store

Storage and Feature Processing Infrastructure

Online storage: RonDB

Offline storage: HopsFS on AWS S3, on Azure Block Storage, or on Direct Attached Storage

Feature Processing: Spark and Python

Feature Sharing and Discovery

Web UI

Searchable feature catalog with metadata

Feature discovery including feature values

Feature versioning and dependency management

Training Dataset Generation

Dataset generated from HopsFS using Python SDK

Time Travel to one point of time in the past

Row-level time travel (on the roadmap)

Online Serving

Python SDK for online data retrieval (or direct retrieval from RonDB)

Monitoring and Alerting

Data quality monitoring

Security and Data Governance

Data remains in end-user's cloud account

ACL and RBAC

SSO

Data encryption at rest and in flight

Integrations

Batch data: Any data source that can be read by Python or Spark

Streaming data: Any Spark streaming data sources

Hopsworks Feature Store Frequently Asked Questions

  • What is the Hopsworks Feature Store for Machine Learning?

    Hopsworks and its Feature Store are an open source data-intensive AI platform used for the development and operation of machine learning models at scale.

  • Who is behind Hopsworks?

    Hopsworks started as an open-source collaborative project at KTH UniversityRISE, and has more recently been taken on by Logical Clocks. Several funding bodies have helped contribute to its development including: European Commission (FP7, H2020)EITSSFVinnova and Celtic-Next.

  • Is Hopsworks open source?

    Hopsworks is available under the AGPL-V3 license. In plain English this means that you are free to use Hopsworks and even build paid services on it, but if you modify the source code, you should also release your changes and any systems built around it as AGPL-V3.

  • What is Logical Clocks?

    Logical Clocks is the company behind the Hopsworks project. They offer various products such as RonDB and Maggy. They also can run a managed service instance of the Hopsworks Feature Store.

Reviews

There are no reviews yet. Be the first to write one.