The 2nd Annual GraphLab Workshop on 7/1/13 in SF!

The GraphLab Big Learning Workshop is a meeting place for both academia and industry to discuss upcoming challenges of large scale machine learning and solution methods. The main goal for this year’s workshop is to bring together top researchers from academia, as well as top data scientists from industry with the special focus of large scale machine learning on sparse graphs.

We have secured talks and demos about the hottest graph processing systems including: GraphLab, Pregel (Google), Giraph (Facebook) , Cassovary (Twitter), Combinatorial BLAS (LBNL/UCSB), Allegro Graph (Franz) ,Neo4j, Titan (Aurelius), DEX (Sparsity Technologies), YarcData and others!

Eventbrite - 2013 Graphlab Workshop on Large Scale Machine Learning

Event Details

Schedule:

  • 8am – 9am: Registration, Coffee & Snacks
  • 9am – Presentations begin (See agenda below)
  • 5pm – 7pm Networking with hosted bar / appetizers

For updated agenda and detailed speaker info see the GraphLab Workshop site here.

Preliminary Agenda

Confirmed Speakers

for the GraphLab 2013 Workshop include:

Prof. Carlos Guestrin, University of Washington – Graphlab 2.2 and Beyond

Dr. Avery Ching, Facebook – Graph Processing at Facebook Scale

Prof. Vahab Mirrokni, Google - Clustering and Connected Components in Mapreduce and Beyond

Dr. Pankaj Gupta, Twitter – WTF: The Who to Follow Service at Twitter

Prof. Joe Hellerstein – Professor, UC Berkeley and Co-Founder/CEO, Trifacta - Productivity for Data Analysts: Visualization, Intelligence and Scale


Dr. Lei Tang – Walmart Labs - Adaptive User Segmentation for Recommendation

 


Dr. Derek Murray – Incremental, iterative and interactive data analysis with Naiad


Dr. Ralf Herbrich, Facebook – TBA


Prof. Mark Oskin, University of Washington, Grappa graph engine.

Featured Projects

Google’s Pregel is their Bulk Synchronous graph framework. Prof. Vahab Mirrokni is going to give an oral talk about graph processing @ Google.
Apache Giraph is the open source equivalent system to Google’s Pregel. Dr. Avery Ching, one of Giraph contributors, will give a talk about large scale graph processing @ Facebook.
Dr. Pankaj Gupta, the creator of Cassovary Graph Processing system @ Twitter will give a talk about Who To Follow (WTF) service in Twitter.
Naiad is a parallel data flow framework from Microsoft with the focus of incremental computation. Dr. Derek Murray from Microsoft Research will present Naiad.
GraphLab is CMU+UW open source graph processing system, which supports both bulk synchronous parallel as well as asynchronous computation. Prof. Carlos Guestrin will present the latest GraphLab project.
Allegro Graph is a high performance graph database with RDF support. Jans Aasman, the CEO of Franz, will give a demo of their newest graph database.
Combinatorial BLAS is a distributed memory parallel graph library from LBNL/UCSB. Dr. Aydin Buluc will present comb-BLAS.
Grappa is a distributed graph processing framework using commodity processors, from The University of Washington. Prof. Mark Oskin will present Grappa.
Titan is a distributed graph database. Dr. Matthias Broecheler will present Titan.
Neo4j is an open source distributed graph database in Java. Alex Averbuch from neo4j will present neo4j.
Infinite Graph from Objectivity is a distributed graph database.
DEX is a high performance and scalable graph database system. Dr. Noerert Martinez will present DEX.
YarcData, a Cray spinoff is creating customized hardware solutions for ultra fast graph processing.
Systap LLC is a startup working on speeding up graph algorithms using GPUs. Bryan Thompson from Systap will present preliminary results of applying the gather apply scatter model on GPU.

Other notable talks at the GraphLab workshop:

Trifacta is the hottest bay area startup out there, started by Prof. Joe Hellerstein from Berkeley and Prof. Jefferey Heer from Stanford. Prof. Joe Hellerstein will talk about Productivity for Data Analysts: Visualization, Intelligence and Scale.
Dr. Lei Tang from Walmart Labs will talk about adaptive user segmentation for collaborative filtering.
Alpine Data Labs is a Greenplum spinoff focusing on big data analytics. Seven Hillion will describe a case study of big data analytics on top of Hadoop. 

 

Gold Sponsors

 

Loogicblox logo

 

Pipefish logo

 

HiringSolved

LexisNexis
Alpine Data Labs

Technicolor Labs Logo


Media Sponsors

LDBC

For more Machine Learning events see mlconf.com and follow @mlconf on twitter.

 

Event Producers

 

For more Machine Learning events see mlconf.com and follow @mlconf on twitter.

Eventbrite - 2013 Graphlab Workshop on Large Scale Machine Learning

More



7/9 in SF: Big Learning Workshop with CMU, Twitter, Pandora, Netflix and many more!

MLconf presents:

Join us on Monday, July 9th in San Francisco for a full-day workshop on Large Scale Machine Learning. Featuring CMU’s Graphlab and including presentations from Twitter, Pandora, Netflix, Intel Labs, MapR, and many more.

The GraphLab workshop on large scale machine learning is a meeting place for both academia and industry to discuss upcoming challenges of large scale machine learning and solution methods. GraphLab is Carnegie Mellon’s large scale machine learning framework. The workshop will include demos and tutorials showcasing the next generation of the GraphLab framework, as well as lectures and demos from the top technology companies about their applied large scale machine learning solutions.
Eventbrite - Large Scale Machine Learning Workshop with CMU's Graphlab

The workshop will be held on Monday, July 9th in San Francisco. Register today to enjoy early bird registration fee!

 

Talks

  • GraphLab Version 2 Overview-  Carlos Guestrin, Carnegie Mellon University
  • Large scale ML challenges -  Theodore Willke, Intel Labs
  • TBD –  Alexander Smola, Yahoo! Labs
  • Large scale ML learning at MapR – Ted Dunning, MapR Technologies
  • Large scale ML at Pandora – Tao Ye, Pandora Internet Radio
  • TBD – Xavier Amatriain – Netflix
  • Cassovary Graph Processing System – Pankaj Gupta, Twitter

More talks from our program committee/ external contributors to follow!

Posters/Demos

  • Green Marl graph processing framework – Dr. Sungpack Hong, Oracle Labs
  • Machine learning benchmark framework – Nicholas Kolegraff, Accenture
  • TBD – Prof. Alexander Gray, Georgia Tech
  • Alpine and MADLib Demo – Steven Hilion, Alpine Data Labs

Platinum Sponsor

Intel Logo

Gold Sponsors

Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries. All other logos are trademarks of the companies who own them, respectivley.

 

For more Machine Learning events see mlconf.com and follow @mlconf on twitter.

More



geekSessions 2.2: Network and Infrastructure Scalability

geekSessions 2.2: Network & Infrastructure Scalability

In our last event we looked at the changing world of Data Scalability. For our 2.2 event we’re bringing in the experts to dive into the complex topics of Network & Infrastructure Scalability. In this Session we’ll look at the architecture of Web-scale infrastructures and investigate issues like Buy vs. Build, Cloud Technologies, Storage, and System Scalability. We’ll also give special attention to the State Of The Network in 2011 with issues like IPv4 exhaustion, more speed, more threats and the compromises between Performance, Reliability, Accessibility, Security and Cost to name a few.

REGISTER NOW and join us for a great discussion and a great networking event!

Follow @geekSessions on Twitter for ongoing announcements, FREE TICKETS, Speaker Conversations and more!

Event Details

What: geekSessions 2.2: Network & Infrastructure Scalability
When:
Tuesday, July 26th, 6:00pm-10:30pm

  • 6:00pm – Registration and networking W/ free beer!
  • 7:00pm – Speakers and Q&A
  • 8:30pm – Networking and giveaways (more free beer)

Where: Madrone Studios Google Map: 1417 15th San Francisco, CA 94103
How:
A limited number of tickets are now open to the public! Get them while they last.

Register for geekSessions 2.2: Network & Infrastructure Scalability in San Francisco, CA  on Eventbrite

About The Speakers

 

Cliff Moon
Co-Founder, Boundary

Cliff Moon is co-founder at Boundary where he uses both Scala and Erlang to build the most advanced network analysis platform in the world.  In a previous life, Cliff wrote Dynomite, one of the first open source dynamo clones.  He is a well regarded member of both the NoSQL and the Erlang communities, as well as a frequent organizer of Bay Area drinkups.

Mike Christian
Senior Director, Infrastructure Resiliency, Yahoo!

Mike has spent the last 8 years building highly available systems for Yahoo!, from global replication of petabyte data sets, to massively distributed CDN and traffic routing mechanisms dispersed to points throughout the world.  Prior to that, he spent 9 years building interactive television systems at Oracle and Thirdspace, wrestling bus sized parallel supercomputing systems and building fast lightweight DVR client applications.  He particularly enjoys solving unsolvable problems.

Allan Leinwand
Chief Technology Officer, Infrastructure Engineering, Zynga

Allan is currently Chief Technology Officer of Infrastructure Engineering at Zynga. In this role Allan is responsible for all aspects of technology infrastructure used in the delivery of Zynga’s social games including data centers, networking, compute, storage, content distribution and cloud computing.

Gleb Budman
CEO, Backblaze

Gleb co-founded Backblaze, an online backup service that provides unlimited storage for $5/month. The Backblaze team designed and open-sourced cloud storage hardware and built a 15+ petabyte cloud storage system without raising funding. Gleb has shared the process and how others can develop their own petabyte cloud storage. Previously, Gleb fought spam, worked to make search smarter, and built robots for nuclear facilities.

 Call For Speakers

If you are interested in speaking at this event contact us at gs22@geeksessions.com or @geeksessions


geekSessions 2.2
is sponsored by:

A10 Networks Logo

A10 NetworksA new generation Advanced Application Delivery Controller (ADC) and server load balancer, offering the industry’s best price/performance.


Juniper NetworksHigh Performance network infrastructure.

Force 10 Networks Logo

Force10 Networks - Force10networks.com

Presented by:


Follow @geekSessions
on Twitter for ongoing announcements, FREE TICKETS, Speaker Conversations and more!


More



geekSessions 2.1: Data Scalability –SQL or NoSQL? on 5/3/2011

geekSessions 2.1

Big Data, big challenge. In this session we’ll look at how companies are building high-performance systems manage, access, analyze, search, and share massive datasets that drive Web-Scale applications and other data intensive apps. We’ll compare relational vs. non-relational approaches and look at how these different paths impact the architecture of the system.

RDMS/SQL, NoSQL, Key Value stores, Wide Columns, Eventual Consistency, Massive Parallelism. We’re bringing together a great panel of speakers to discuss and share knowledge about the technology and techniques being used manage the ever increasing amount of data.

Join us for a great discussion and a great networking event!

Follow @geekSessions on Twitter for ongoing announcements, FREE TICKETS, Speaker Conversations and more!

Event Info

What: geekSessions 2.1: Data Scalability –SQL or NoSQL?
When: Tuesday, May 3rd, 6:00pm-10pm

  • 6:00pm – Networking, Snacks, Free Beer
  • 7:00pm – Panel discussion and Q&A (bar closed)
  • 8:30pm – More socializing

Where: MIGHTY in SOMA, Google Map: 119 Utah St., San Francisco
How:
A limited number of tickets are now open to the public! Get them while they last.

About the Speakers

Jason Lucas
Scalability Architect, Tagged
[click here for Jason's slides]

Ask me what is vps?

Jason Lucas is the scalability architect for Tagged (www.tagged.com), the third largest social networking system in the world.  Jason has worked for Google on large-scale, distributed systems and for Microsoft on the Visual C++ compiler.  He also spent almost ten years working on artificial intelligence systems for treating HIV/AIDS in Africa.  These days Jason focuses on problems in the NoSQL space, creating planetary-scale data services that are reliable, fast, cheap, and, if at all possible, easy to use.

 

Danny Bickson
Researcher, Machine Learning Department CMU
[click here for Danny's slides]
Danny Bickson is a postdoctoral researcher at the Machine Learning Department in Carnegie Mellon University, hosted by Prof. Carlos Guestrin (CMU) and Prof. Joseph Hellerstein (Berkeley). His most recent project, GraphLab, involves the design and implementation of a distributed programming abstraction that outperforms MapReduce, designed to support iterative and potentially asynchronous algorithms on big data. His research targets large scale distributed algorithms design and their deployment, spanning both the theoretical and applied aspects of large scale computing and applied machine learning.

Ted Dziuba
Senior Member of Technical Staff, eBay
[click here for Ted's slides]

Ted Dziuba was the co-founder and lead engineer behind Milo.com, an online comparison shopping engine. Milo was acquired by eBay in December, 2010, and Ted is now Senior Member of Technical Staff for eBay’s Local division. Previously, he worked at Google on internal tools and Pressflip, a machine learning startup. Today, he works with both SQL and NoSQL systems, from hardware and operational aspects to application development.


Eric Bieschke
Playlists, Pandora
[click here for Eric's slides]

Eric Bieschke runs playlist engineering for Pandora. As Pandora’s second employee he built small scale prototypes for many of Pandora’s systems and has grown them to service more than 80M users who’ve thumbed 8 billion songs while listening to billions of hours of music. Pandora has taken a hybrid SQL/NoSQL approach to data scaling with an architecture that leverages everything from Hadoop to Postgres to Redis and everything in between.

About the Moderator

Mike Panchenko
Infrastructure Engineer, SimpleGeo

Mike works at SimpleGeo, a company that provides a hosted spatial database. His primary responsibility is obsessing over the scalable storage infrastructure built on top of Apache Cassandra. He spends his time making data structures work in a distributed eventually consistent system, routing around failures, and making bad jokes about concurrency. Before SimpleGeo, Mike worked at Flickr, where data about some 6 billion photos is stored in one of the largest MySQL installs. SQL or not, he loves large storage architectures that can handle terabytes of data.

More



geekSessions 2.0: The Art and Science of UI – Feb. 22nd

geekSessions 2.0

For the next Session, we’re bringing together a great group of people to discuss the ever evolving world of UI development. We have retro game emulators written in HTML 5, Flash interpreters written in Javascript, new tools like Unity, Mobile platforms, Multi-touch, and more. The field has never been more compelling or more complex.

Thanks to geekSessions alumnus Jonathan Abrams, we’re teaming up with Founders Den to host the this event.

Also thanks to our friends at Terrabit Systems who will be giving away a shiny new iPad at the event! Best door prize evar!!!

Event Info

What: geekSessions 2.0: The Art and Science of UI
When: Tuesday, February 22nd, 6:30pm-10:30pm

6:30pm – Registration, Networking, Beer
7:30pm – Panel discussion and Q&A (bar closed)
9:00pm – More beer, more networking

Where: Founders Den, Google Map: 665 3rd St, San Francisco

How: A limited number of tickets are now open to the public! Get them while they last.

About the Speakers

Chris Smoak
Developer, Smokescreen

Chris made people Apple iOS users cheer in 2010 when he released the open source “Smokescreen” beta. Smokescreen automatically converts Flash (SWF) to Javascript/HTML5. Chris will give us the rundown on Smokescreen from a technical perspective and talk about it’s Flash to HTML5 magic!

-

Martin Kool
HTML5 Stunt Coordinator, Q42

Founder of Quplo. Wrote Sarien.net. Digital inventor. Technologist. Geek. Retro gamer. Father of 3. Husband of 1.

Martin created Sarien.net which is an HTML5 based retro game emulator and multiplayer social experience. Sarien.net enables anyone with an HTML5 capable device to play classic Sierra games like Space Quest (awesome on iPad) and Leisure Suit Larry. It also enables users to play these games in a multiplayer mode and interact with others in the game world.

David Kaneda
Creative Director, Sencha

David Kaneda leads the Sencha design team. He has over eight years of experience designing in a variety of fields, from architecture and fashion to education and software. Recently, David created Outpost, an iPhone app for Basecamp, and jQTouch, a Javascript framework for iPhone development. David also maintains WebKitBits, a site about the browser engine in Safari, Google Chrome, and the iPhone. David brings his wealth of design knowledge to Sencha, and is responsible for the look and feel of our websites and software.

Jeffery Kalmikoff
VP Product, SimpleGeo

Jeffrey spent nearly seven years as Chief Creative Officer of Threadless.com, focused on overall creative direction, design and product development/strategy. During his time there, they built the company from a profitable side-project into a multi-million dollar brand with an active, thriving online-community of over a million tee shirt and design enthusiasts. After Threadless, Jeffrey spent some time as Digg’s Director of Design, and is now working to make it less complicated for developers to add location features to their web and mobile applications as VP of Product at SimpleGeo.

Chelsea Howe
Director of Design, Social Chocolate

Chelsea Howe is currently Director of Design at Social Chocolate. Previously, she worked at Zynga as a Designer on FarmVille. Prior to her social gaming experience, she co-founded Proper Walrus to develop quirky, experimental indie games like Tipoli. She also worked at ActionXL designing and producing motion PC and mobile games, and at Cornell University as a game design instructor for the outreach portion of an NSF-funded research grant.

If you’re doing something great in the UI space and want to share, get in touch with us!

More



Announcing geek-biz-pixel Sessions Mixer

(geek-biz-pixel) Sessions Mixers!

Join us on July 28th for Sessions Mixer 1.0. This will be a social mashup of our 3 events, geekSessions, bizSessions, and pixelSessions. No speakers, no Q&A. Just pure networking with a great mix of technologists, business folks, and designers.

Got something to show off? We’ll have a room with tables and booths devoted demos. Technology Demo, Design Display, or Business Pitch, If you have something to show let us know! Demo space is free but limited so apply here now.

Sessions Mixer

Event Details

  • Tuesday, July 28th, 5:30pm
  • A social mashup: Technologists, Entraprenuers, Designers from geekSession, bizSessions, pixelSessions events!
  • Networking & Demos
  • Sponsor Giveaways!

Powered by

More



Next