Speakers
	Nils Grunwald
Schedule
Day	Sunday
Room	AW1.125
Capacity	76
Start time	10:20
End time	11:05
Duration	00:45
Info
Track	Graph Processing Devroom

Using Cascalog and Hadoop for rapid graph processing and exploration

Graphs are becoming increasingly popular as ways of modeling a wide variety of systems. As such, the label "graph processing" also covers a range of objectives and architectural constraints. At [Linkfluence][http://us.linkfluence.net/], we use graph processing on datasets produced with very different systems (Web crawler, Twitter and Facebook API, feed aggregator, etc.) We spend a lot of time doing exploratory programming, trying to use our eclectic datasets in interesting ways, and processing our data in asynchronous workflows.

We have come to see [Hadoop][http://hadoop.apache.org/] and the processing framework [Cascalog][https://github.com/nathanmarz/cascalog] as essential tools in our toolbox when dealing with graphs, since it gives us architectural flexibility, scalability and the possibility of rapid prototyping.

Cascalog is an open source framework built on top of Hadoop and [Cascading][http://www.cascading.org/]. Its syntactic and semantic roots come from Datalog and Prolog, which have been succesfully applied for a long time in the manipulation of graphs. Also, its ability to directly embed the expressive [Clojure][http://clojure.org/] language allows to very easily define custom operations and ad-hoc processing.

In this talk, we will present the framework, consider its advantages and drawbacks when compared to other approaches, show concrete exemples of usage for graph processing and how we use them to complement graph databases.

Concurrent events:

When	Event	Track	Where
09:45-10:25	Asterisk 10: New Features, New Testing	Telephony and Communications	H.2213
10:00-10:25	MySQL HA reloaded - old tricks and cool new tools to guarantee high availability to your MySQL Servers	MySQL and Friends	H.1309
10:00-10:30	Improving Firefox startup time on Android	Mozilla	UD2.218A
10:00-10:30	Dealing with JVM limitations in Apache Cassandra	Free Java	K.4.401
10:00-10:30	openSUSE on ARM	CrossDistribution	H.1301
10:00-10:40	Toolkits on Wayland - how we're doing!	CrossDesktop	H.1308
10:00-10:45	Introduction to pkgng	BSD	K.4.201
10:00-10:45	Anatomy of a role playing game	Open Source Game Development	AW1.120
10:00-10:45	Systems Management with Matahari	Configuration and Systems Management	K.3.601
10:00-10:50	Voice Applications for the Modern Open Source Hacker	Network and IO	K.1.105
10:00-10:50	CoApp: Packaging Open Source software for Windows	System	Janson
10:00-10:55	Ganeti: "how you can use it"	Virtualization and Cloud	Chavanne
10:00-11:00	Debtags.debian.net reloaded!	CrossDistribution	H.1302
10:00-11:00	The Next Steps for the Pharo Vision	Smalltalk	AW1.126
10:00-11:00	eLuaBrain: a 32-bit MCU based educational computer	Embedded	Lameere
10:00-11:00	KMS plane support in Wayland	X.org+OpenICC	K.3.401
10:00-12:00	OpenSC codesprint	Security	H.2214
10:05-10:45	Perlude: a taste of Haskell in Perl	Perl	AW1.121
10:10-10:55	Introduction of the Genode OS Framework	Microkernel OS	K.3.201
10:20-10:35	Threat Modeling Revolutionized!	Lightning Talks	Ferrer
10:30-10:55	MariaDB 5.3's query optimizer: taking the dolphin to where he's never been before	MySQL and Friends	H.1309
10:30-11:00	JRuby	Free Java	K.4.401
10:30-11:00	Tracking Firefox performance via Telemetry	Mozilla	UD2.218A
10:30-11:10	Mobicents TelScale and RestComm	Telephony and Communications	H.2213
10:30-11:30	OBS Cross Build	CrossDistribution	H.1301
10:30-12:15	LPI Exam Session 3	Certification	Guillissen
10:40-10:55	An introduction to EclipseRT	Lightning Talks	Ferrer
10:45-11:15	Boxes, use other systems with ease	CrossDesktop	H.1308
10:55-11:40	Data-Driven and Component-Based Game-Entities	Open Source Game Development	AW1.120
11:00-11:15	Powerful tools for Linux C/C++ developers based on Eclipse	Lightning Talks	Ferrer
11:00-11:25	Virtualization Management the oVirt way	Virtualization and Cloud	Chavanne
11:00-11:25	How to offload MySQL server with Sphinx	MySQL and Friends	H.1309
11:00-11:30	Xspice: Integrating spice-server into Xorg	X.org+OpenICC	K.3.401
11:00-11:45	Practical implementation of promise theory in CFEngine	Configuration and Systems Management	K.3.601
11:00-11:45	Towards a BSD C++11 Stack	BSD	K.4.201
11:00-11:45	Being a good upstream - the syslog-ng PoV	CrossDistribution	H.1302
11:00-11:50	Internet of Threads	Network and IO	K.1.105
11:00-11:50	Caching and Tuning fun for high scalability	System	Janson
11:00-12:00	Aseba robot programming	Embedded	Lameere
11:00-12:00	Amber, the Smalltalk for web developers	Smalltalk	AW1.126
11:00-17:00	LPI Item Writing Workshop	Certification	UA2.114

Next (up to 3) talks in the same room (AW1.125):

When	Event	Track
11:10-11:55	Birds of a feather - Graph processing, future trends!	Graph Processing
12:00-12:35	Works with persistent graphs using OrientDB	Graph Processing
12:45-13:15	Graphity: an efficient neo4j based graph model for retrieving the top k news feeds for users in social networks	Graph Processing

Events that start after this one (within 30 minutes):

When	Event	Track	Where
11:05-11:45	Perlito	Perl	AW1.121
11:10-11:55	Birds of a feather - Graph processing, future trends!	Graph Processing	AW1.125
11:10-11:55	Introduction to HelenOS	Microkernel OS	K.3.201
11:15-11:55	From zero to VoIP provider in 15 minutes	Telephony and Communications	H.2213
11:20-11:35	Amarok	CrossDesktop	H.1308
11:20-11:35	LibrePlan: Open Web Planning	Lightning Talks	Ferrer
11:30-11:55	oVirt Engine Core: Internals and Infrastructure	Virtualization and Cloud	Chavanne
11:30-11:55	Build simple and complex replication clusters with Tungsten Replicator	MySQL and Friends	H.1309
11:30-12:00	Can I legally do that?	Free Java	K.4.401
11:30-12:00	Wayland Q & A for toolkit developers.	X.org+OpenICC	K.3.401
11:30-12:00	Introducing the Metrics Data Ping	Mozilla	UD2.218A
11:30-12:30	Working with contributor communities (round table)	CrossDistribution	H.1301

fosdem.org

User login

Using Cascalog and Hadoop for rapid graph processing and exploration

Concurrent events:

Next (up to 3) talks in the same room (AW1.125):

Events that start after this one (within 30 minutes):