Artima Developer Spotlight Forum - Google Open-Sources Data Serialization Tool

Articles |
News |
Weblogs |
Books |
Forums

Artima Forums | Articles | Weblogs | Java Answers | News

Sponsored Link •

Artima Developer Spotlight Forum
Google Open-Sources Data Serialization Tool

6 replies on 1 page. Most recent reply: Jul 18, 2008 8:55 AM by Rob Dickens

Welcome Guest
Sign In

Back to Topic List

Reply to this Topic

Search Forum

Threaded View


Previous Topic		Next Topic

Flat View: This topic has 6 replies on 1 page

Frank Sommers

Posts: 2642
Nickname: fsommers
Registered: Jan, 2002

Google Open-Sources Data Serialization Tool

Posted: Jul 8, 2008 5:32 PM

Any application that relies on external data must rely on some kind of serialization mechanism to read and write that data. Many such serialization formats exist, the most popular one being XML. XML also has the benefits of being programming language-agnostic, and also being to some extent self-describing. However, transmitting and processing XML incurs a great deal of overhead.

Google has been using a serialization format called Protocol Buffers for that purpose instead. Today, the company released this tool under an open-source license. According to the project's documentation:

Protocol buffers are a flexible, efficient, automated mechanism for serializing structured data – think XML, but smaller, faster, and simpler. You define how you want your data to be structured once, then you can use special generated source code to easily write and read your structured data to and from a variety of data streams and using a variety of languages. You can even update your data structure without breaking deployed programs that are compiled against the "old" format.

Google's Kenton Varda provides additional details in a blog post:

Protocol Buffers allow you to define simple data structures in a special definition language, then compile them to produce classes to represent those structures in the language of your choice. These classes come complete with heavily-optimized code to parse and serialize your message in an extremely compact format. Best of all, the classes are easy to use: each field has simple "get" and "set" methods, and once you're ready, serializing the whole thing to – or parsing it from – a byte array or an I/O stream just takes a single method call...

One of Protocol Buffers' major design goals is simplicity. By sticking to a simple lists-and-records model that solves the majority of problems and resisting the desire to chase diminishing returns, we believe we have created something that is powerful without being bloated. And, yes, it is very fast – at least an order of magnitude faster than XML.

APIs for reading and writing Protocol Buffers are available for C++, Java, and Python.

What do you think of Protocol Buffers as a data serialization tool? What are your preferred ways to serialize data?

johny boyd

Posts: 28
Nickname: johnyboyd
Registered: Apr, 2007