fastdbfs - An interactive command line client for Databricks DBFS.
-
Updated
Jun 10, 2021 - Python
fastdbfs - An interactive command line client for Databricks DBFS.
The project aims to develop a distributed architecture using Apache Spark and Databricks to optimize the management of the Rural Environmental Registry. It focuses on data migration and efficient analysis in the Databricks File System (DBFS).
Predicción de incumplimiento crediticio con algoritmo de Spark MLlib Gradient Boosting Trees, usando cluster de procesamiento de Databricks.
YCLS is a Python module for calculating loudness metrics for audio (or video) files, particularly aimed at determining the loudness level suitable for YouTube content.
Exploración los principios del Procesamiento de Datos a Gran Escala con talleres de Databricks y Spark. Aprender herramientas como Pandas y PySpark para el análisis eficiente de grandes conjuntos de datos. Impartidos por John Corredor en la Pontificia Universidad Javeriana.
Add a description, image, and links to the dbfs topic page so that developers can more easily learn about it.
To associate your repository with the dbfs topic, visit your repo's landing page and select "manage topics."