The popular benchmark banded around about Apache Spark‘s machine learning library MLlib is that it is ten times faster than Hadoop based Apache Mahout as an environment for building scalable machine learning applications and a hundred times faster than Hadoop MapReduce. It’s scalabilty is also proven in production to over 8,000 nodes with the ability to cache datasets in memory ...
Read More »Tag Archives: Spark
A Feast of Machine Learning Talks in New York City
Last week saw the New York City instalment of MLConf 2015 in a day which was bursting with insightful talks from respected experts who are at the leading edge of the commercial application of machine learning techniques, delivering more than fifteen sessions over the event. Corinna Cortes, Head of Research at Google kicked off the day with ‘Finding Structured Data ...
Read More »What’s Coming For Spark 2015 – Bay Area Spark User Group
San Francisco’s Spark Meetup group is massive, yet whilst going along to some of these events can be a little intimidating, especially for the more socially awkward attendees, the group really does preach to people of all levels. The first gathering of 2015 kicked off with a review of Spark by Co-organiser Patrick Wendell to get things going, briefly running over ...
Read More »