Skip to content

repositories Search Results · topic:spark org:databrickslabs fork:true

Filter by

5 results
 (263 ms)

5 results

indatabrickslabs (press backspace or delete to remove)

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used t…
  • Python
  • 291
  • Updated
    2 days ago

Toolkit for Apache Spark ML for Feature clean-up, feature Importance calculation suite, Information Gain selection, Distributed SMOTE, Mo…
  • HTML
  • 191
  • Updated
    on Jun 1, 2021

HL7 Apache Spark Datasource
  • Scala
  • 57
  • Updated
    on Mar 15

Delta Sharing + MLflow for ML model & experiment exchange (arcuate delta - a fan shaped river delta)
  • Python
  • 21
  • Updated
    on Dec 27, 2023

Automated provisioning of an industry Lakehouse with enterprise data model
  • Python
  • 8
  • Updated
    17 days ago
Package icon

Sponsor open source projects you depend on

Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projects
ProTip! 
Press the
/
key to activate the search input again and adjust your query.
Package icon

Sponsor open source projects you depend on

Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projects
ProTip! 
Press the
/
key to activate the search input again and adjust your query.