Tutorial: Use FlatBuffers in Rust

May 17, 2019 FlatBuffers Rust

This is a post in an ongoing series on FlatBuffers.

The FlatBuffers project is an extremely efficient schema-versioned serialization library. In this tutorial, you’ll learn how to use it in Rust.

To learn more about why we need yet another way to encode data, go read my post Why FlatBuffers.

FlatBuffers is a serialization format from Google. It’s really fast at reading and writing your data: much quicker than JSON or XML, and often faster than Google’s other format, Protocol Buffers. It’s schema-versioned, which means your data has integrity (like in a relational database). FlatBuffers supports thirteen programming languages: C++, C#, C, Dart, Go, Java, JavaScript, Lobster, Lua, PHP, Python, Rust, and TypeScript.

This post will show you how to set up FlatBuffers and then use it in a demo Rust program.

(Full disclosure: I maintain the Golang, Python, and Rust FlatBuffers codebases.)

This tutorial has seven short parts:

If you’d like to see all of the code in one place, I’ve put the project up at a GitHub repository.

1. Install the FlatBuffers compiler

First things first: let’s install the compiler.

The compiler is used only in development. That means you have no new system dependencies to worry about in production environments!

Installation with Homebrew on OSX

On my OSX system, I use Homebrew to manage packages. To update the Homebrew library and install FlatBuffers, run:

$ brew update $ brew install flatbuffers

(As is usual on my blog, I indicate CLI input with the prefix $ .)

Personally, I like to install the latest development version from the official Git repository:

$ brew update $ brew install flatbuffers --HEAD

If successful, you will have the flatc program accessible from your shell. To verify it’s installed, execute flatc :

$ flatc flatc: missing input files ...

Other installation methods

If you’d like to install from source, install a Windows executable, or build for Visual Studio, head over to my post Installing FlatBuffers for more.

2. Create a new Cargo project (if needed)

(If you’re adding FlatBuffers to an existing Rust project, you can skip this step.)

Create a basic Cargo configuration with the following command:

$ cargo new rust_flatbuffers_example Created binary (application) `rust_flatbuffers_example` package

There will now be a directory called rust_flatbuffers_example . Change the current working directory to that:

$ cd rust_flatbuffers_example

Now, note that the directory contains the following files:

$ tree . |-- Cargo.toml `-- src `-- main.rs

Finally, check that the Cargo package is properly configured. Do this by running the example program that Cargo automatically generated:

$ cargo run --quiet Hello, world!

If you do not see this output, then please have a look at the official documentation on setting up Rust and Cargo to troubleshoot your configuration.

3. Write a FlatBuffers schema definition

All data in FlatBuffers are defined by schemas. Schemas in FlatBuffers are plain text files, and they are similar in purpose to schemas in databases like Postgres.

We’ll work with data that make up user details for a website. It’s a trivial example, but good for an introduction. Here’s the schema:

// myschema.fbs namespace users; table User { name:string; id:ulong; } root_type User;

Place the above code in a file called myschema.fbs , in the root of your Cargo project.

This schema defines User , which holds one user’s name and id . The namespace for these types is users (which will be the generated Rust package name). The topmost type in our object hierarchy is the root type User .

Schemas are a core part of FlatBuffers, and we’re barely scratching the surface with this one. It’s possible to have default values, vectors, objects-within-objects, enums, and more. If you’re curious, go read the documentation on the schema format.

4. Generate Rust accessor code from the schema

The next step is to use the flatc compiler to generate Rust code for us. It takes a schema file as input, and outputs ready-to-use Rust code.

In the directory with the myschema.fbs file, run the following command:

$ flatc --rust -o src myschema.fbs

This will generate Rust code in a new file called myschema_generated.rs in the pre-existing src directory. Here’s what our project looks like afterwards:

$ tree . |-- Cargo.lock |-- Cargo.toml |-- myschema.fbs `-- src |-- main.rs `-- myschema_generated.rs 1 directory, 6 files

Note that one file is generated for each schema file.

A quick browse of src/myschema_generated.rs shows that there are three sections to the generated file. Here’s how to think about the different function groups:

Type definition and initializer for reading User data pub struct User { ... } pub fn get_root_as_user(buf: &[u8]) -> User { ... } pub fn get_size_prefixed_root_as_user(buf: &[u8]) -> User { ... } pub fn init_from_table(table: flatbuffers::Table) -> Self { ... }

Instance methods providing read access to User data pub fn name(&self) -> Option<&'a str> { ... } pub fn id(&self) -> u64 { ... }

Functions used to create new User objects pub fn create(_fbb: &mut flatbuffers::FlatBufferBuilder, args: &UserArgs) -> flatbuffers::WIPOffset { ... }

(Note that I’ve elided some of the lifetime annotations in the above code.)

We’ll use these functions when we write the demo program.

5. Install the FlatBuffers Rust runtime library

The official FlatBuffers Rust runtime package is hosted on crates.io: Official FlatBuffers Runtime Rust Library.

To use this in your project, add flatbuffers to your dependencies manifest in Cargo.toml . The file should now look similar to this:

[package] name = "rust_flatbuffers_example" version = "0.1.0" authors = ["rw <[email protected]>"] edition = "2018" [dependencies] flatbuffers = "*"

I use * to fetch the latest package version. In general, you’ll want to pick a specific version. You can learn more about this in the documentation for the Cargo dependencies format.

6. Write a demo Rust program to encode and decode example data

Now, we’ll overwrite the default Cargo “Hello World” program with code to write and read our FlatBuffers data.

(We do this for the sake of simplicity, so that I can avoid explaining the Cargo build system here. To learn more about Cargo projects, head over to the official documentation on Cargo project file layouts.)

Imports

To begin, we will import the generated module using the mod statement. Place the following code in src/main.rs :

// src/main.rs part 1 of 4: imports extern crate flatbuffers; mod myschema_generated; use flatbuffers::FlatBufferBuilder; use myschema_generated::users::{User, UserArgs, finish_user_buffer, get_root_as_user};

This usage of the mod keyword instructs the Rust build system to make the items in the file called myschema_generated.rs accessible to our program. The use statement makes two generated types, User and UserArgs , accessible to our code with convenient names.

Writing

FlatBuffer objects are stored directly in byte slices. Each Flatbuffers object is constructed using the generated functions we made with the flatc compiler.

Append the following snippet to your src/main.rs :

// src/main.rs part 2 of 4: make_user function pub fn make_user(bldr: &mut FlatBufferBuilder, dest: &mut Vec<u8>, name: &str, id: u64) { // Reset the `bytes` Vec to a clean state. dest.clear(); // Reset the `FlatBufferBuilder` to a clean state. bldr.reset(); // Create a temporary `UserArgs` object to build a `User` object. // (Note how we call `bldr.create_string` to create the UTF-8 string // ergonomically.) let args = UserArgs{ name: Some(bldr.create_string(name)), id: id, }; // Call the `User::create` function with the `FlatBufferBuilder` and our // UserArgs object, to serialize the data to the FlatBuffer. The returned // value is an offset used to track the location of this serializaed data. let user_offset = User::create(bldr, &args); // Finish the write operation by calling the generated function // `finish_user_buffer` with the `user_offset` created by `User::create`. finish_user_buffer(bldr, user_offset); // Copy the serialized FlatBuffers data to our own byte buffer. let finished_data = bldr.finished_data(); dest.extend_from_slice(finished_data); }

This function takes a FlatBuffers Builder object and uses generated methods to write the user’s name and ID.

Note that the name string is created with the bldr.create_string function. We do this because, in FlatBuffers, variable-length data like strings need to be created outside the object that references them. In the code above, this is still ergonomic because we can call bldr.create_string inline from the UserArgs object.

Reading

FlatBuffer objects are stored as byte slices, and we access the data inside using the generated functions (that the flatc compiler made for us in myschema_generated.rs ).

Append the following code to your src/main.rs :

// src/main.rs part 3 of 4: read_user function pub fn read_user(buf: &[u8]) -> (&str, u64) { let u = get_root_as_user(buf); let name = u.name().unwrap(); let id = u.id(); (name, id) }

This function takes a byte slice as input, and initializes a FlatBuffer reader for the User type. It then gives us access to the name and ID values in the byte slice.

The main function

Now we tie it all together. This is the main function:

// src/main.rs part 4 of 4: main function fn main() { let mut bldr = FlatBufferBuilder::new(); let mut bytes: Vec<u8> = Vec::new(); // Write the provided `name` and `id` into the `bytes` Vec using the // FlatBufferBuilder `bldr`: make_user(&mut bldr, &mut bytes, "Arthur Dent", 42); // Now, `bytes` contains the serialized representation of our User object. // To read the serialized data, call our `read_user` function to decode // the `user` and `id`: let (name, id) = read_user(&bytes[..]); // Show the decoded information: println!("{} has id {}. The encoded data is {} bytes long.", name, id, bytes.len()); }

This function writes, reads, then prints our data. Note that bytes is the byte vector with encoded data. This is the serialized data you could send over the network, or save to a file.

Running it

$ cargo run Arthur Dent has id 42. The buffer is 48 bytes long.

To recap, what we’ve done here is write a short program that uses generated code to write, then read, a byte slice in which we encoded data for an example User. This User has name “Arthur Dent” and ID 42.

7. Learn more and get involved

FlatBuffers is an active open-source project, with backing from Google. It’s Apache-licensed, and available for C++, C#, C, Dart, Go, Java, JavaScript, Lobster, Lua, PHP, Python, Rust, and TypeScript (with more languages on the way!).

Here are some resources to get you started:

And my additional blog posts on FlatBuffers.