01. Install With docker

Docker install

The easiest way to start using Spark is through the Scala shell:

docker pull apache/spark:3.5.0
docker run -it apache/spark:3.5.0 /opt/spark/bin/spark-shell

Try the following command, which should return 1,000,000,000:

spark.range(1000 * 1000 * 1000).count()
res0: Long = 1000000000

Sources

Last modified January 3, 2024: spark hands-on (80fbb4b)