Tag - Apache Spark

50+ Data Science, Machine Learning Cheat Sheets, updated

Gear up to speed and have concepts and commands handy in __data Mining, and Machine learning algorithms with these cheat sheets covering R, Python, Django, MySQL, SQL, Hadoop, Apache Spark, Matlab, and Java. By Thuy T. Pham, U. of Sydney. comments Th...

Alteryx Reveals Newest Platform Release at Inspire 2018

Alteryx, Inc., revolutionizing business through data science and analytics, today announced the general availability of the newest version of the Alteryx platform  (2018.2) at its annual user conference, Inspire 2018. The release delivers new feature...

Apache Flink: The Next Distributed Data Processing Revolution?

Will Apache Flink displace Apache Spark as the new champion of Big Data Processing? We compare Spark and Apache Flink performance for batch processing and stream processing. comments By Kevin Jacobs, Data Blogger. Disclaimer: The results are valid on...

Apache Spark Introduction for Beginners

An extensive introduction to Apache Spark, including a look at the evolution of the product, use cases, architecture, ecosystem components, core concepts and more. comments By Vikash Kumar, Tatvasoft.com.au Businesses are utilizing Hadoop broadly to...

Apache Spark : Python vs. Scala

When it comes to using the Apache Spark framework, the data science community is divided in two camps; one which prefers Scala whereas the other preferring Python. This article compares the two, listing their pros and cons. comments By Preet Gandhi,...

Bigstep Introduces the First Open Data Exploration-as-a-Service

Bigstep, the big data cloud provider, announced the launch of Bigstep DataLab, a solution designed to enable data science and analytics at scale. Bigstep DataLab is an enterprise-ready data research service that gives domain experts, data scientists...