GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.
git clone https://github.com/daniel-dqsdatalabs/data-engineering-sandbox.git cd data-engineering-sandbox Create a .env file in the project root and add the following environment variables: POSTGRES_USER=your_postgres_username POSTGRES_PASSWORD=your_postgres_password MINIO_ROOT_USER=your_minio_root...
Course project. Sapphirine Big Data Repositories has 780 repositories available. Follow their code on GitHub.
spark-data-analysis-projects Public A collection of data analysis projects done using PySpark via Jupyter notebooks. Jupyter Notebook 10 7 personal-compute-cluster Public Software and tools for setting up and operating a personal compute cluster, with focus on big data. Jupyter Notebook 7 ...
BigData-Project: Supermarket Basket Analysis with Markovchain, Aprioi, XGBoost and RNN; M. Sc. Business Intelligence and Process Management, BSEL Berlin, Germany - floridene/bigdataproject
A Big Data Platform Prototype Project. Contribute to rouroucaicai/bdp development by creating an account on GitHub.
http://www.big-data-europe.eu/ info@big-data-europe.eu Overview Repositories108 Projects Packages People7 More PinnedLoading READMEREADMEPublic General README for the Big Data Europe project's sources 8313 docker-hadoop-spark-workbenchdocker-hadoop-spark-workbenchPublic ...
GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.
GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.
A docker cluster for developing your big data projects. Interact Hadoop, Hive, Spark and Postgres using Jupyterlab and Theia IDE - datainsightat/BigDatDevEnv_Docker