Binpress Podcast Episode 11: Slava Akhmechet of RethinkDB

The Binpress Podcast

Binpress Podcast Episode 11: Slava Akhmechet of RethinkDB

September 16, 2014

This week, we chat with Slava Akhmechet, co-founder of RethinkDB, an open-source distributed database taking the development world by storm. Slava discusses why experimental work often doesn’t make the cut for commercial codebases, why you should focus on ideas instead of venture capital, and the biggest opportunity in commercial open source. He also covers how assumptions keep people from coming up withÂ great ideas, how Star Trek: The Next Generation explains why he built RethinkDB, and much more!

Listen to the podcast in the player above, or click here to download it directly. Subscribe on iTunes or do so manually by using this RSS feed.

Show notes

Slava Akhmechet: Website,Â Twitter, Github
RethinkDB: Website, Twitter, Github

Transcript

Alexis: Thank you, Slava, for taking time out of your schedule to join us here on the podcast.

Slava: Itâ€™s my pleasure. Iâ€™m looking forward to this.

Alexis:Â Before we dive into RethinkDB, tell us a little bit about yourself.

Slava: Well, my name is Slava Achmechet, Iâ€™m one of the founders at RethinkDB. I was born in Ukraine and moved with my parents to New York City when I was about 13 years old in 1996, and Iâ€™ve been programming basically for as long as I can remember. Back when I was a kid, I really loved to program and all my peers loved playing games and I love programming computer games so thatâ€™s how I got started. I can tell you a little bit more about that.

Alexis: Yeah, absolutely.

Slava: Then later on, I was always into computers and computer science. I graduated with a computer science degree and spent some time in grad school, again studying computer science and just starting RethinkDB has sort of been a very natural thing for me. I always wanted to start a tech company and build technology products that improve peopleâ€™s lives in some ways or make them happy and thatâ€™s how RethinkDB came along.

Alexis: So how did you first get interested in, â€œYou know, I want to build a database,â€ because thatâ€™s probably not the first thing that comes to peopleâ€™s minds. â€œI want to build a game,â€ or â€œI want to build an app,â€ that kind of thing.

Slava: Yes. I was in grad school at Stony Brook University. One year in, I passed all my qualifiers and I was looking at â€œWhich lab am I going to join?â€ I was playing around with a few ideas. So one was doing massively parallelized simulations on mammalian brains on super computers, and we had accessed the idea on BlueJimp. So it sounds pretty fancy but it was actually this big problem where neurons, human neurons or mammalian neurons have thousands of connections and they all interact with each other which is really hard to simulate and parallelize on the modern computer because you get metric bottlenecks.

So the thing I was working on, I was trying to figure out, â€œOkay, how do you take this simulation model and how do you make it work efficiently on modern computers?â€ And the second thing I was playing with is a file systems lab where people were building file systems for Linux and they were just experimenting with various ideas and thatâ€™s where I met my co-founder Mike who was actually into human-computer interaction.

So weâ€™re sitting around, bouncing ideas with each other and we thought, â€œHey, how can we combine our skills to build something interesting for people?â€ And the thing that we realized is that, so we have backgrounds in infrastructure and human-computer interaction and we realized that the way people are building applications and deploying applications, specifically web applications, basically changed dramatically in the past tenÂ years. And databases were designed 40 years ago back when none of this stuff existed. So just the colonel of the idea was what happens if you sit down and rethink, redesign these systems to work in the modern world? What assumptions would you throw out? What new assumptions will we deal with? And thatâ€™s how the idea came along and it was very exploratory at that time.

It wasnâ€™t that we wanted to build a database, itâ€™s that we thought, â€œWhat happens if we explore like how people build web applications and how can we apply our skills in the most effective way possible?â€ And I think thatâ€™s how the project came along.

Alexis: Was it with the intent of commercializing it at some point or was it more of an exploratory thing as youâ€™d previously mentioned?

Slava: I think it started out as an exploratory thing and then we realized that when we started talking to people about it. So we were in New York and we got into Y Combinator. So there was some kernelÂ of commercializing it. We thought, “We’re going to give it a try.”Â We moved to California, started talking to people about it and we realized that there was just a tremendous amount of excitement in just when we talked to developers and CEOs and even business leaders about the span of technology, so it just became clear that there is something there, and commercialization came out of that but I donâ€™t think it wasâ€¦ like initially we didnâ€™t think it through all that far. We just wanted to build something interesting.

Alexis: So after all that exploration, what resulted in RethinkDB? What did it form into?

Slava: So, RethinkDB is an open source distributor document database.

Alexis: You havenâ€™t said that a thousand times.

Slava: Thatâ€™s right. But I never get tired of saying it because I really love the product. But look, we let people do it, we let them build and scale reactive applications. And just to give you an idea what that means, if youâ€™ve ever used for example Gmail, and youâ€™re looking at the Gmail prep, just reading conversations and if a new email comes along, thereâ€™s this little notification bar that pops up on the bottom that says youâ€™ve gotten a new email.

Or another example of this is if youâ€™ve ever used a product like Quora for questions and answers, youâ€™re looking at a question and an answer and then someone else edits it in a different browser, you see an object instantaneously. So thatâ€™s real time, this type of new real time reactive experience is a relatively new thing and companies like Facebook and twitter and Google have basically trained consumers, they trained all of us to expect that kind of an experience but if youâ€™re a company and you have a website or an app and you are trying to build something like that, it turns out to be really time-consuming and really hard because there is a lot of innovation in the web world and server world like Node.JS and Socket.IO or Sockjs that lets people do that but there hasnâ€™t been very much in the web world to let people do that.

So the type of thing in RethinkDB, what we do is we allow people to write queries in a query language designed for JSON documents and itâ€™s very, very convenient to use if you use something like jQuery itâ€™s very similar. You can just write queries and get answers. You could scale it out in a click of a button to multiple machines.

But then what you could do is when you write a query on a .changes command and then you get a real time stream of updates to the results. So for example, you could say something like give me an average age of my user, plus this isnâ€™t actually a very useful goal, and you could type .changes at the end and youâ€™ll just get anytime someone inserts something into the database and the average age changes, you just get that update, the updated value. And it turns out that that makes the development experience of building this reactive application just dramatically easier for people, and thatâ€™s the core of RethinkDB. Thatâ€™s why people pick it up and use it for the most part.

And thereâ€™s a lot of different things around it. The user experience, we wanted it to be super easy to get started with and we wanted it to be super easy to scale out with their application scale to really care about the quality of the query language, all the stuff. But the fundamental core of the product is just redesigning a database in a way that lets people build and scale these reactive applications in the modern world.

Alexis: So listeners might be thinking, â€œAlright, Iâ€™ve got MongoDB or something like Cassandra.â€ What makes RethinkDB different in some other ways?

Slava: So RethinkDB is actually quite similar to MongoDB. So if youâ€™ve used MongoDB, I donâ€™t think you would be very surprised when you get started with Rethink. Itâ€™s a very similar experience, itâ€™s very easy to get started and itâ€™s familiar. So it certainly would be familiar to for example MongoDB users. Now, the fundamental differences in RethinkDB is this reactive scalability component. The fact that you could get an incremental real time update to any results or to most results, so thatâ€™s a huge differences. That absolutely changes the way of program because instead of writing of a query and then pulling it letâ€™s say every five seconds, the database pushes the updates out to you. That just makes building applications just dramatically different and dramatically easier.

So thatâ€™s the fundamental part and then thereâ€™s a lot of things around the edges. So for example, RethinkDB supports distributor joints. There are very few NoSQL databases that do that and you can use joints the way you would use them with their relational, traditional database, you could use that with JSON documents which gives people enormous flexibility in how to structure their data. And all of this comes from just our love and care for the user experience and I donâ€™t mean just the administrator council, I mean the whole thing, how the people write applications because developers spend 8 hours a day in this environment and we really wanted to make that pleasant for people.

So all these things come from a really pleasant, well-designed query language thatâ€™s chainable and you write it on something like Ruby or Python or JavaScript, pretty much any language that youâ€™re used to. It looks like JQuery, and itâ€™s very similar like if you use Linux Bash where data kind of flows from left to right when you do pipes, so you start out saying my table is letâ€™s say users, so it says table users, and then you say dot, and then you can say other things. Letâ€™s say you want to join it with another table, so you say .join table like game scores or something and then you can say dot again and you can chain things perpetually.

So that query then gets sent to server, compiled on the server, and then gets distributed across the distributed systems of nodes where the computation happens and then as a user, you donâ€™t have to worry about any of that at all, you just get the result, and the system takes care of where is the data, how do I send a query, how do I parallelize it, how do I make it efficient. So from the userâ€™s point of view, you could use this extremely flexible query language to write your application the way you want and then you can get a real time stream of updates which is just as dramatic change in how you would go about building web and mobile applications.

Alexis: Youâ€™ve already kind of answered this question, but we asked some users on Reddit, Twitter, Facebook and the like for questions for you. Could youÂ give us some examples, use cases? They ask, “When is RethinkDB a better choice for building an app than other databases?”

Slava: So I just talked a little bit, we talked a little bit about an example like Quora where youâ€™re like at an answer in your browser and someone else does something else, they make a change, and you see the answer right away, so that will be just a wonderful, wonderful use case for RethinkDB. If youâ€™re building an application like that and you want to bring such a sophisticated user experience to your customers or to your users, RethinkDB makes that really easy.

But let me give you another example of how that could work. So some of RethinkDB users are using it for games, so RethinkDB is the background store for game worlds. Imagine if you have this game where in game youâ€™re selling items to the game players, so you have this end game economy which is a very common kind of thing right now. So it turns out if you talk out to the people who build these games, whatâ€™s really, really interesting about it is that when you, so if you measure demand for particular items in the game and you start changing how many items of a particular type youâ€™d flash into the game world, you could dramatically change, like maximize sales, so you could quadruple your revenue just by paying attention to the in-game economy.

And if you look at how people do this traditionally, they take a snapshot of their game world, letâ€™s say every 24 hours, they transfer it into HadoopÂ and then in HadoopÂ they do some data crunching, and then they figure out â€œOkay, this is how we want to modify our economyâ€ and then they affect the game world. So if you have a functionality where your core database that the game is built on supports real-time queries and real-time update streams on these queries, you can modify this economy in real-time. So you donâ€™t have to wait 24 hours to do this back and forth. You could just do it immediately and the queries can be quite sophisticated. So you can write a MapReduce query and then say give me an incremental set of updates to this MapReduce query.

You could write something like group users by location in the game world and figure out what theyâ€™re buying between these times, and the query can get progressively more complicated and sophisticated and then you just say .changes and you can affect your game world, you can modify your game world in real-time which is just a dramatic change in howâ€¦ I guess people would call this analytics but it isnâ€™t quite analytics because itâ€™s merging this gap between analytics and the real-time sort of experience. RethinkDB is just great for all kinds of use cases like this. Any time you want to do any kind of real-time or reactive experience or you need results to queries in real-time, RethinkDB is great for those kinds of use cases.

Alexis: How long has it been since you started RethinkDB?

Slava: We started the company in May of 2009 so itâ€™s been about five years, a little more than five years.

Alexis: At what point did you get into Y Combinator?

Slava: May 2009. So weâ€™ve been throwing around some ideas before that but really, like the fundamental aspects of the company came out after we joined Y Combinator, so I sort of think of it as May 2009 as the germination time for the company.

Alexis: So whatâ€™s your monetization model for RethinkDB?

Slava: Well as I already mentioned, RethinkDB is open source and we want anybody to be able to use it. If you canâ€™t afford to pay, itâ€™s a free software. You can download it on the internet and start using it. What happens with free software or open source software is if something thatâ€™s at the core of what youâ€™re doing, at the core of your business and itâ€™s sufficiently sophisticated, people absolutely traditionally need to pay for operational support. So people download RethinkDB, they pick it up, they build an application within their organization on top of the product, then they hand it off to operations. And operations are the people, basically the guys that have to wake up at 3 a.m. in case something goes wrong. They generally pay for operational support for products which are the core of their stuff, and RethinkDB is one of those products.

We havenâ€™t actually opened up monetization publicly so you canâ€™t publicly for RethinkDB right now but we will be doing that very soon and thatâ€™s the fundamental revenue stream for the product. We want anybody to be able to use it but when people need sophisticated support, they can pick up the phone and call someone, get someone who is really knowledgeable in the other end of the line who can help them out with their problem.

Alexis: Are you interested in monetization when it comes to RethinkDB as a service, like having it online and having folks spin up their own instances of RethinkDB?

Slava: Right, so you could do right now with EC2Â and we offer people various ways of doing it but we donâ€™t want to be a service company because itâ€™s almost a different business together. If you look at the development process and actually the marketing and just how the business works of building a software company, a software that you ship to people, itâ€™s really dramatically different in many ways from building a service company. And I donâ€™t think you can effectively do that under one roof without significantly splitting focus. So I donâ€™t think weâ€™re going to do the service thing for a while but there has been a couple of services that were built it by our community members and thatâ€™s only going to get better. So we really kind of, we donâ€™t quite outsource it but itâ€™s just something that gets built in the ecosystem around the product by other people, and weâ€™re perfectly happy for that to happen.

Alexis: I usually ask folks about what theyâ€™ve learned about pricing but when it comes to services, when it comes to support, itâ€™s a bit harder to answer and nail down. You donâ€™t have set prices, I guess, everything is pretty fluid, but have you learned anything that might be applicable to other folks who are providing support for their software?

Slava: Yeah, definitely. This particular advice I donâ€™t think would apply to products that you sell to consumers, but when you sell products to businesses, for beginners at least if itâ€™s the first business youâ€™ve started itâ€™s very hard to estimate the value of the product for other people from their point of view, so what people just end up doing is they say â€œWell it costs us this much money to develop this and this much money to support itâ€ and they figure out how much it costs them to build the product, and then they mark it up by some amount so they make some money by letâ€™s say 30 percent or 200 percent or whatever their margins are, and then they go out and charge people that.

But it turns out that if you build a really valuable product for people, you can charge 10 times as much or a hundred times as much and people will be perfectly happy to do that, and theyâ€™ll feel like theyâ€™re getting a lot of value out of it because itâ€™s really important to them and I found that itâ€™s really hard for people that estimate that early on especially because when companies get big, the kind of fundamental assumptions change. So for example there are companies where time is way more important than money for them, like they have a lot of cash, but they need to get the market quicker, and theyâ€™re perfectly happy to trade their cash for time. When youâ€™re selling product, you generally donâ€™t think about it that way right? You just think about how to get your startup off the ground or something.

So that would be the lesson I would tell people, is to make sure to sit down with the customer and try to figure out, and you can just ask people, theyâ€™ll be happy to tell you. You could ask, â€œHow valuable is this to you? How much money would you be willing to pay?â€ And people are kind of afraid to do it or sometimes they think, â€œWell, the other party has the incentive to give you a low-ball number,â€ but in practice I think if you sit down with these people and they are honest and genuine and youâ€™re offering a good product, people are happy to pay way more than you usually think they would.

Alexis: How did you spread the word about RethinkDB in the early days?

Slava: So, I think open source has helped a lot and you canâ€™t just open source a product and expect people will know about it. It doesnâ€™t quite work that way. But open source has been the core of like our beliefs. We really care about it. Everyone who works at RethinkDB, we all love open source and we use open source software our entire lives, so itâ€™s kind of a fundamental part of our culture. When you do that, itâ€™s not just the software is somewhere on some FTP server or something where you could download it. The development process isnâ€™t set-up, the issue tracker isnâ€™t set-up so any user can come in and comment on a feature or a feature request or a bug or when we have technical discussions about features, anybody could come in and comment. So we always think ourselves as just a part of the RethinkDB the system, we just happen to get pain for doing it.

So the product, itâ€™s not just the source code. When we say itâ€™s free or open source, the whole thing is fundamentally open, like the development process is open, the company is open and we really care about maintaining that kind of culture and I think when you do that and you use social media or like Twitter, people begin to really identify with the product, they care about it, making it good because they know they will be heard, and then people starting other need-ups or they themselves write and we can help them out by sending them information how to do it and little gifts, things like that. That kind of thing works really, really well because you get, basically itâ€™s word of mouth because people really care about the product and you just to have help them out a little bit.

Alexis: So how do you get devs to try a new database and more importantly not just try it but push it into production?

Slava: People try new things all the time. Itâ€™s actually a really nice aspect of our users because developers love to tinker and when people first pick up RethinkDB, they might not necessarily think, â€œOh, Iâ€™m going to do this. Iâ€™m going to build this big project and put it in production. They start out saying, â€œI want to build a weekend appâ€ and the project is really is to get started, so they start out, they try it out, they build some app, and then they just absolutely fall in love with the product and then theyâ€™d go to their organization and they find something new get built, they say, â€œIâ€™ve used this amazing product. Itâ€™s a lot of fun to use. Itâ€™s really easy and it solves a lot of problems,â€ and then people look at it and thatâ€™s how things get adopted.

This isnâ€™t the new idea that hobbyists can open drive markets and they can make tremendous difference. So that was the premise of RethinkDB, it was all bottom-up growth. We really wanted to make it just absolutely amazing for people who are tinkering and building apps and then from there, they just go out and spread the world and if the product is valuable, then people will pick it up.

Alexis: You have any examples of who uses RethinkDB that you could share with us?

Slava: Weâ€™re actually going to announce this pretty soon so I donâ€™t want to share specific names of companies. So weâ€™re going to announce pricing and things like that in the next couple of months, so weâ€™re going to do all of that together. So Iâ€™ve been very surprised because we originally built it for web developers and web applications but itâ€™s been used by municipalities and by certain federal agents, this has been used in the financial industry, itâ€™s been used in biotech where people do DNA analysis, and of course itâ€™s used in the web and mobile, but itâ€™s very exciting because it turned out that the product is just really, really horizontal. You could apply it like almost anytime you build something with data or for the internet, you could start using it which at this point is almost anybody who is building software.

Alexis: What have you learned from interacting with the community over these years?

Slava: There have been I think a lot of really interesting lessons. The main one is that what you think people care about isnâ€™t necessarily what people will actually care about, and thatâ€™s really cool because for us, once we got the development process and get-up and people came in and started commenting, we realized, â€œOh man, their problems are, theyâ€™re not the same thing as we think their problems are.â€ So you really have to go out and talk to people and measure things. So I think people who build products, theyâ€™re really good at coming up with like the initial germination or initial vision for what itâ€™s going to be, but then your users and your community is really, really good for refining it.

And itâ€™s always this two-step process where you build something but then you really need the real world to refine it. So that was the first lesson, just what people care about is not necessarily the same thing that you think they care about. The second thing we learned that we felt was very exciting is, and itâ€™s kind of interesting maybe unfortunate in some ways, is that like how hard something is to build is completely uncorrelated to how valuable people will think it will be. So you could do something really simple and people will perceive it as something extremely valuable, or maybe theyâ€™ll perceive it as something extremely difficult.

Iâ€™ll give you one example of this. One example is the backup tool for RethinkDB. So RethinkDB of course allows you to export and back up data, and people cared about this a lot, they always thought itâ€™s really important and it was a missing piece, this was a while back, and we couldnâ€™t quite understand why that is because RethinkDB supports this flexible query language and it takes it 10 minutes to write a script that will let you back up your data. But somehow people felt that this feature is missing something and then we built out this backup command, and you can say RethinkDB backup and it will back up your data and you can restore it back.

It wasnâ€™t quite 10 minutes work. Iâ€™m of course oversimplifying because it had to deal with failures and it had to be efficient, parallelize, all that stuff, so maybe it took a week of work or so but it still seemed really, really simple but people thought of it as a really valuable tool and with this we realized like, â€œOh, you could spend 20 percent of the time that you would think youâ€™d have to spend and people will find it immensely valuable.â€ So I think that was the second biggest lesson and really all of these is just one big point, you have to listen to your users really, really, really carefully and find out what they care about because when you build anything with creative people generally really care about the think that theyâ€™re building, itâ€™s a very emotional thing.

They love the creative process and they love the software but I think in reality, itâ€™s not about you. Itâ€™s about the people who are using the things that youâ€™re building. So you have to love your community more than you love your product if that makes sense. You have to really, really sit down and care about how people perceive it and what problems youâ€™re solving for them and how they think about it, which is totally unobvious and itâ€™s kind of a platitude. Itâ€™s almost like people say well you have to eat right and exercise and it sounds really easy but itâ€˜s really hard to do on practice.

Alexis: Right. Continuing this community thread…. A lot of open source projects, when it comes to hiring, they look towards their contributors particularly the very active ones. Is this something that you all do and what qualities do you look for when hiring other than just contributions?

Slava: So we did hire some contributors to the RethinkDB ecosystem that doesnâ€™t quite work as well at RethinkDB as with might for other projects or other companies. So our target users are people building web and mobile applications typically. That is very different from building a distributed database like C++. So a lot of people who contribute to RethinkDB, they contribute to kind of the outside of RethinkDB, so that contributes to the client drivers which are written in Python or Ruby or JavaScript, node JS or whatever language youâ€™re using. They contribute to the implementation, they contribute to the web UI and thatâ€™s really easy to do. We have tons and tons of contributions from people.

But thereâ€™s this qualitative jump from there to like the core server and the distributed infrastructure, and because thatâ€™s been changing pretty rapidly and there is a large of internal knowledge and internal muscle memory built out around it, it isnâ€™t well documented and it isnâ€™t easier for people to just get into it from the outside. You really have to spend a lot of time to understand how the system works, and we just havenâ€™t quite gotten it to the point where thatâ€™s easy to do because it necessarily hasnâ€™t been a priority. So we did hire contributors from the community when they contributed to the drivers but itâ€™s different. Again itâ€™s not the same thing as the database, the core database. So that would be the first question.

The second one was what qualities to look for. So we never look for people with database experience or anything like that, we really look for just traditional things. So you have to have the passion for what the group is building, because if people care aboutâ€¦ If you care about building an amazing database experience for your users and youâ€™re hiring someone who cares about functional programming languages, that doesnâ€™t necessarily look good because they kind of have a different agenda, they want a different thing. So the number one thing we care about is do people want to build this amazing database experience for their users? Are you passionate about it, are they curious about it? Do they do have a lot of energy around it, like is this something they care about.

And then once people have that, we just look at very traditional computer science knowledge like algorithms, knowing the tool chain, being able to just code out solutions to problems, things I like that. I donâ€™t think itâ€™s very different from any traditional interview process youâ€™d see at Google or Facebook or any other high-tech company.

Alexis: Okay. Returning to funding for a little while, you all have raised more than 10 million dollars, what kind of wisdom could you impart to folks who are beginning to walk down the venture capital road?

Slava: So I think you can raise money in two ways. You can raise money with reputation because youâ€™ve already done something amazing before. Actually three ways. So you could raise money on reputation because youâ€™ve done something amazing before and then people just respect you, they trust you, they know that if they give you capital, youâ€™re going to go and do something amazing or at least attempt going to do something amazing. You could raise money because your product is really taking off and youâ€™ve built something great already, and you get a lot of users, a lot of growth, and then people want to invest into that business and you could raise money just on an idea because you find someone who absolutely falls in love with your idea and just wants to fund it.

And the third one is the hardest because investors fundamentallyâ€¦ So again you have to look at it from their point of view and their businesses, they get money from their limited partners and itâ€™s a financial instrument so they have to return money to their investors and they really care about business that are succeeding. When it comes to venture capital, I think it can be immensely useful just from the point of view of getting capital to grow your business and getting advice from investors on how to build the business that is super useful, but I think again Iâ€™d focus on your users. I think if you build something amazing, everything else is almost an afterthought, thatâ€™s just going to happen. But if you start focusing on venture capital and saying â€œHow do I raise money to build my business,â€ that typically doesnâ€™t work so well.

And I think right now, development has gotten way easier and way cheaper and you can always build something, you could always find the ways to build something the people love on very little capital. So I would go after first. I tried to figure out what is the most important company I could start or the most important product I could build, and once you do that, if you really do that, I think venture capital is just something that follows. Itâ€™s much more about figuring out â€œHow do I pick a good startup idea? How do I actually get on things like that?â€ Iâ€™d focus on that rather than venture capital itself.

Itâ€™s almost like actually if you ever read Paul Grahamâ€™s essay on The Python Paradox, he said, â€œYou want to hire people who are learning programming languages because they care about the programming languages and paradoxically they are learning these but they canâ€™t get jobs doing it,â€ so the paradox is itâ€™s easier to get a job if you learn something where there are not very many job offers for that language.

I think itâ€™s something similar with companies. You want to build a company because you really care about the users even though there may not necessarily very much venture capital excitement around it. Because if there is a lot of excitement around it, it means itâ€™s probably already been built, and if you build the company like that then everything else follows.

Alexis: In your past five years at RethinkDB, whatâ€™s one mistake that youâ€™d rather not repeat?

Slava: Oh man.

Alexis: Weâ€™ve got time for several if you want.

Slava: Let me think about this for a second. So I think the most important thing that Iâ€™ve learned is thatâ€¦ So I really believe, after building RethinkDB for a while, I really believe in the idea of efficient markets and efficient markets for ideas to be specific. And what I mean by that is, so if you look at how people start companies or how they build features or how they do anything, creative people, they typically look at an idea and they say, â€œOh, Iâ€™m going to build apps and apps doesnâ€™t exist now, and Iâ€™m going to go and try to do that.â€ I think thatâ€™s just a fundamental thing that creative people do, like they look at the world and they say, â€œSomething doesnâ€™t exist, Iâ€™m going to go build that and I think itâ€™s going to be successful.â€ I think itâ€™s really important to try and find out why it doesnâ€™t exist because nine times out of 10, there are fundamental structural reasons for why the world is the way it is. Does that make sense? Iâ€™m not sureâ€¦

Alexis: Absolutely yeah. It sounds like itâ€™s part of the saying, â€œDonâ€™t built it if it isnâ€™t useful or if itâ€™s not really needed.â€

Slava: Yeah. So once you realize, once you start asking these questions, say, â€œWhy doesnâ€™t it exist?â€ Basically 90 percent of ideas turned out to be not worth building. And they werenâ€™t building if youâ€™re doing it as a hobby right? You have to be honest with yourself, â€œAm I doing it because itâ€™s fun or am I doing this because itâ€™s useful to people?â€

So I think itâ€™s really, really important. So then you get to this one idea out of ten where you find out, â€œOkay, it doesnâ€™t exist for this structural reason and I think I can overcome this structural reason and it will be very valuableâ€ and then you can go on and do it. So thatâ€™s a mistake that weâ€™ve made I think quite a bit. We build a feature and we say, â€œOh man, this feature will be super cool for people,â€ and then we discovered it doesnâ€™t exist for a particular reason, like thereâ€™s reason why it doesnâ€™t work that way. But once you find this one thing out of ten where the reason is something thatâ€™s changed in the world, like there has been a fundamental change that happened and now the old reason is no longer valid. Then youâ€™ve got something really really valuable.

Because people, like human beings are really good at internalizing answers to things, theyâ€™re really good at internalizing cultures, and theyâ€™re really bad at reevaluating these things and reconsidering them. So when you look at the world and you say, â€œOkay, there has been this fundamental change.â€ Something about the world changed, like more people on the internet for example. What does that mean for everything we already know, for everything we believe, and then you go back from there and examine these beliefs and you donâ€™t even know what to examine. You very often donâ€™t even know you have certain beliefs.

Alexis: Yeah. The fish doesnâ€™t know itâ€™s in water, yeah.

Slava: Yeah, exactly, and thatâ€™s the hardest thing to do and we screw that up probably more than anyone else at the beginning, but I think that is extremely useful so I wouldnâ€™tâ€¦ the one mistake is donâ€™t assume that something doesnâ€™t exist because no one tried it. Usually, at least ten teams tried it and there was a reason why itâ€™s not there, so you really have to stumble on that one thing that happened because the world has changed in some fundamental way and people havenâ€™t realized it yet.

Alexis: Thatâ€™s a very anthropological way of looking at things.

Slava: I guess so.

Alexis: Do you have any tips for how to kind of separate yourself from these assumptions that youâ€™ve already made without knowing that youâ€™ve made them?

Slava: So thatâ€™s really hard. Itâ€™s really difficult. Iâ€™ve kind of been thinking about that a lot and I havenâ€™t quite puzzled it out. Iâ€™d really love to write a blog post on it at some point, and I tried a couple of times and I havenâ€™t come up with anything like pragmatic and useful. So one thing I can think of is you could look at trends, like you could look at companies that are still small but are looking like theyâ€™re growing quickly, and then you could ask yourself, â€œOkay, what would it mean if this company takes over the world? What is the next best thing to build?â€

So for example, when GitHub got started, there were all these other companies saying â€œOh, itâ€™s really cool and valuable to build a hosting and collaboration service around the source control system.â€ So Bitbucket went out and did it from Mercurial, I think,Â and there were a lot of others. So what happens is people start to emulate. Actually, for all the Bitbucket fans, itâ€™s not all implausible that Bitbucket was there first so donâ€™t kill me for this. But I hope you see where the bigger sort of point.

So what happens is when something is beginning to succeed, people start to emulate it and they think they can out compute it, I think itâ€™s much more valuable to say, â€œOkay, this thing is succeeding. Suppose it just takes over the world, what is the next thing you would build?â€ And then you go and build that. And by the time that company has succeeded, like youâ€™ve got something really valuable because youâ€™ve sort of looked at the assumptions differently. Does that make sense?

Alexis: Yeah. Absolutely, yeah.

Slava: So that would be one way of doing it. I would like to think of some more but I canâ€™t think of any at the top of my head.

Alexis: Well, so email me when you got that blog post written. On the flipside, instead of focusing on the negative things, whatâ€™s one decision that youâ€™re particularly proud of?

Slava: Iâ€™m really, really proud of the team that we built here. Iâ€™m really, really proud of the product itself, and itâ€™s really been this creative kind of collaboration of different kinds of people in the company that has worked phenomenally well. So for example, Iâ€™m a systems person and a programming language person, I really love programming languages and I ended up just kind of inadvertently hiring other people to care about that. And my co-founder is someone who really cares about the users experience and heâ€™s thought me a tremendous amount about it and really like inadvertently hired people who cared about that.

And then there are people in the company who care about security, who care about just various other aspects, performance, things like that, so when you take people who care about different things deeply and you put them in a room together and thereâ€™s this battlefield of ideas in a very creative, constructive way, then I think something wonderful happens and you build something real and amazing. So Iâ€™m super proud of the team we put together and how people work together, how respectful they are, and at the same time how critical they are with each otherâ€™s ideas. I think thatâ€™s kind of been the key to building a really pleasant and useful product for people.

Alexis: Speaking of the team, how large has it grown?

Slava: We are 17 people right now.

Alexis: Wow. Are you allÂ distributed or located in one office?

Slava: Right now, weâ€™re all local.

Alexis: Okay, was that an opposition to a distributed team or was it just a fact that you just prefer being local?

Slava: So I think it depends on what you are building, and there is sort of a lot of conventional wisdom now being questioned around local teams versus distributed teams and people talk about collaboration a lot and how some companies are entirely distributed. So for me, that hasnâ€™t actually worked because if you donâ€™t have, human beings are very much, I mean, weâ€™re still human beings, right? Weâ€™ve evolve in a certain way and when you put two people in a room, the kind of creative spark you get out of that is not the same thing as people are in different cities. I think geography still matters to us immensely.

Itâ€™s been very deliberate. I think for a product like this where you require just immense amount of collaboration between people, it will be very hard to get something of this quality if people were distributed. And Iâ€™m not saying it wouldnâ€™t work for other companies or other projects. Itâ€™s just not something that would work particularly well for us.

Alexis: Now, are you sure you studied computer science and you didnâ€™t sit in a few anthropology or sociology classes because thereâ€™s a very humanistic analysis and consideration to some of your questions.

Slava: I learned all of that here in the process of building RethinkDB and making mistakes. Yeah, Iâ€™m definitely an engineer at heart but then software is this very careful and creative mix of engineering and people. You have to care about engineering and you have to care about the people who are building it, funding it, using it, who are writing about it, talking about, so really, Iâ€™ve learned to care about that. And actually, you used the word anthropology a couple of times. I really think what happens is if you take an engineer like a scientist, someone who measures things, and you put him in a world of people and he has to succeed in that world but he knows nothing about it, well heâ€™s going to fall back in what he knows. Heâ€™s going to observe them and create hypothesis and test them, and thatâ€™s really what Iâ€™ve done.

Alexis:Â Applied anthropology, okay. So whatâ€™s the biggest opportunity that you see now in open source?

Slava: I think there have been a lot of companies built like Microsoft or Oracle, around close source software that built infrastructure, and I donâ€™t think that can happen again. So early adaptors basically like made open source just so much the prerequisite. If youâ€™re going to build infrastructure software and developers are a fundamental part of your audience, you have to make it open source. I think that there will be a lot more infrastructure companies built around it, and the reason why I say infrastructures because if you build plan site tools, you canâ€™t make money from it. People will just download it and use it for free and people wonâ€™t really pay for support because itâ€™s not a fundamental part of your business. Itâ€™s something that you kind of use to build as opposed to the core of it.

So I think there will be a lot of really interesting infrastructure companies around it, and you can look at Docker, CoreOS and theyâ€™re building just fundamental pieces of infrastructure, just rethinking all of the old assumptions again. Before, like Red Hat has been the open source company offering operating systems, and Canonical was another one. And now, enough has changed in the world that you could go out and build a new operating system that does things differently and CoreOS is doing that. Same thing for Docker and VMware or Xen.

So I think thereâ€™s a lot of opportunities to look at infrastructure and say, â€œHow has the world changed where all assumptions donâ€™t apply and then go build that.â€ I think there will be a lot of open source companies doing it. So thatâ€™s one thing.

Another aspect of this is that Canonical I think used to be like this leading open source company that everyone would look to for community values and just propagating open source, the idea of open source and philosophy of open source.

Alexis: The shining beacon on the hill for open source.

Slava: Thatâ€™s right, and I think theyâ€™ve lost their way a little bit. When I ask people â€œIf you could think of one company that does that, who can you think of?â€ and people donâ€™t really name Canonical anymore. So Mozilla is doing that now to a large degree but Mozilla isnâ€™t quite a business in the same way. They have a different mission and itâ€™s amazing, but I think thereâ€™s an opportunity for another company to arise as the shining beacon of open source and really return to these philosophical and human values of what open source represents.

Alexis: Now this changes from situation to situation but in general, what are some ways that you consider to be the best way to sustain open source?

Slava: Do you mean an open source business, the company or the movement in general?

Alexis: Both but in specific, I was more asking for a person who has an open source project and theyâ€™d love to make a living on it or help that support.

Slava: So open source is really interesting because from of a business point of view, itâ€™s actually, unless youâ€™re doing things very specifically, itâ€™s not necessarily a really good way of making money. So Iâ€™ll give you an example. Traditionally, if you took to economists and we mentioned anthropology and now weâ€™re switching to economist a little bit and I really care about it too, but if you talk to economist, they would generally say that money is a really good way and probably the only way to measure value.

Traditionally that has been really true. When you build a product or you build a service and you figure out how much money that service is making, well youâ€™ve just figured out how valuable it is to people. But I think again something changed in the world where this isnâ€™t specifically true and one good example of this is Wikipedia. Wikipedia does not really make money but itâ€™s just how valuable is that for humanity, like if you took Wikipedia away, I think it weâ€™ll just be set back so tremendously. So there are things that are getting built now that are extremely valuable but we canâ€™t charge for it for various reasons. And I think open source, for a lot of open source projects, thereâ€™s something similar going on.

Again you have to be really honest with yourself, â€œAm I in it for the money or am I in it because I really care about the things that Iâ€™m building and itâ€™s a hobby for me,â€ or a philosophical reason or whatever reasons people have. So if youâ€™re in it for the money, I think starting a hobbyist open source project is probably not the best way of doing it.

But if you have started an open source project and you really care about it, you can do it I think in two ways. So if itâ€™s a plan tool, you really canâ€™t make a business out of it. You have to just ask people for donations and you have to structure it as a non-profit and use Kickstarter or use social media and say, â€œHey, we need that much money to fund this product for the next year. If you care about it, please donate.â€

And people love doing that and you can give them incentives too. So again back to economics, incentives work really well so you could sell mugs or sell t-shirts, like sell little kind of trinkets that exude the philosophy of your project and mark them up and be honest about it and people will be super happy to do that. So thatâ€™s for plan site stuff but if youâ€™re building infrastructure, you could charge people for support and they will be happy to do that but thatâ€™s hard to do for an individual contributor because you have to work on the projects and then it basically becomes consulting.

Alexis: A pair of questions from the mailbag before we wrap things up. One Reddit user says, â€œHas your approach to working with large code bases or architecture changed over the course of working on the RethinkDB project? Any stylistic best practice or architectural ideas that youâ€™ve adopted or rescinded?â€

Slava: Yes, the process has changed dramatically. So one thing Iâ€™ve realized, and this is kind of similar to the question you asked before about what we’ve learned. So when we got started, we did it under theÂ premise like, â€œHey, there are so many papers in academia about really interesting ideas so letâ€™s go out and implement some of them and see if theyâ€™re valuable to people.â€ But it turns out that once you it get to the big system, you canâ€™t really do that because thereâ€™s so much just engineering low-hanging fruit, there is so much work to do. You really have to keep things like super simple and if youâ€™re making something complicated, you better have a good reason for why youâ€™re making it complicated because itâ€™s probably not going to work.

And this is kind of the philosophy that Linus adopted for the Linux kernel, like there are so many research papers on how to improve the Linux kernelÂ and so many experimental algorithms and none of them make it into the kernelÂ because theyâ€™re just too complicated for people to understand. So thatâ€™s one thing that weâ€™ve learned, just keep things super, super simple, like once the code base gets big, you just canâ€™t afford to put complicated things around it. You simply canâ€™t. You have to keep things simple. And I think thatâ€™s what a lot of people that are writing papers, so you could write papers because itâ€™s interesting and thatâ€™s great, youâ€™re just building up knowledge for humanity, but you have to understand that you typically just can’y put that into a big focus,Â empirically it doesnâ€™t work. Itâ€™s too hard for people to do.

So thatâ€™s lesson number one. Lesson number two is codeÂ reviews. This is again like diet and exercise, like everyone knows youâ€™re supposed to do it but very few people actually do it.

Alexis: And flossing, you canâ€™t forget flossing.

Slava: Thatâ€™s right, and floss. So halfway in, we implement the codeÂ review policy and we also learned that there are some things like flossing that you canâ€™t do halfway, like you either do it everyday or you may as well not do it at all. We codeÂ review absolutely every single commit. And codeÂ reviews are amazing for two reasons. One is quality and two is dissemination of knowledge among the team because knowledge is not just one person who knows how this piece of code works but itâ€™s at least two people, and that worked phenomenally well, and also that helps to keep things simple because if you built something really complicated, the reviewer isnâ€™t going to understand it and theyâ€™re going to force you to simplify it. So codeÂ reviews have been just like absolutely amazing for this kind of thing.

Alexis: And it keeps everybody intimately familiar with the code, yeah?

Slava: Yep, yep. It really, really helps because youâ€™ve got more than one person looking at a piece of code, more than one person knows how this works. If I sit alone in a room and think of something, like we really need other human beings to test ideas and refine ideas. Just like a product, you need users to refine the ideas for your product. For code, you need other people to look at it to refine your ideas and your codeÂ base or a feature or whatever or the class that youâ€™ve built, so codeÂ reviews have been just incredibly useful and I encourage everyone. Like if youâ€™re not doing codeÂ review in every commit and youâ€™re doubting like, â€œMan, is this worth spending the time?â€ like the answer is yes, itâ€™s worth it.

Alexis: Alright, now, wrapping up the mailbag here, these are the last two questions from listeners out there. Whatâ€™s next for RethinkDB?

Slava: So weâ€™ve been working on the product for quite a while and RethinkDB has a big surface area. Thereâ€™s just like a lot to do right there as the storage engine and the distributor system around it and the client drivers and the administrative UI, thereâ€™s a lot to do and weâ€™ve been doing it for a while and weâ€™ve been working super hard on polishing the product and making it absolutely the best product there is.

So right now, I got to the point where the product is good enough that itâ€™s extremely useful to people and people like absolutely love it. So now what Iâ€™m working on is just I just want to go out to the world and get as many people to know about it as possible, and I just tell people exactly what it is, how it was built, what itâ€™s for, what it will do for them, and I think youâ€™ll be hearing about RethinkDB a lot more because I switched my personal focus on doing that. And Iâ€™m very excited about it because for the users of the product, itâ€™s very useful because the more people know about it, the more stuff gets built, the more resources around there is, so I think the community is about to get a lot. Itâ€™s already very vibrant and itâ€™s very intimate, itâ€™s great to be a member of it, but I think itâ€™s going to get a lot more vibrant pretty soon. So thatâ€™s from just the community point of view.

From features point of view, there are a couple of things that Iâ€™m really, really, really, excited about. I canâ€™t wait. Iâ€™ve been playing with prototypes and itâ€™s absolutely amazing, so Iâ€™ll tell what those are. Weâ€™re shipping geospatial queries and geospatial indexes hopefully next week.

Alexis: This will be very neat.

Slava: Itâ€™s very neat and itâ€™s very exciting. Itâ€™s just absolutely awesome to play with. Itâ€™s very visual and itâ€™s very useful. If youâ€™re building anything with maps or locations, itâ€™s just awesome, so Iâ€™m very excited about it, and a lot of users have been asking for it for a long time, we just took the time to do it right. The thing thatâ€™s shipping after that, itâ€™s going to take a little bit of time but what Iâ€™ve been playing is the current version and itâ€™s awesome, is the cluster management and the administration API, and just to give you very briefly just what this is. So when you are using RethinkDB right now and you want to shard your database or shard your table, itâ€™s very easy to do. You could say I want 3 shards and this cluster will automatically partition everything or add replicates or whatever. It will just work and itâ€™s great. But right now it doesnâ€™t have very much visibility. So itâ€™s not clear to people how that happens internally. Itâ€™s pretty hard to understand and you canâ€™t easily make programmatic changes. You either have to do it by the web UI or scripted by a commandÂ line tool.

So weâ€™re integrating all of these into the recall programming language so youâ€™ll be able to write queries that manipulate your cluster and look at its status. And weâ€™ve put an enormous amount of effort into taking like these super complicated aspects of the distributed system and making it just dramatically simple for people. So the API is probably going to be the easiest thing to use or as easy as anything else in RethinkDB, but itâ€™s this thrown into a super complicated thing underneath and weâ€™ve worked a lot, itâ€™s like a crucible where you take these complicated concepts and make them simpler and simpler and simpler until the thing is really beautiful. So I think it will be super useful to our users because a lot of people have been running clusters of RethinkDB and have been working around these issues, so itâ€™s just useful pragmatically for programmatically changing clusters and monitoring what they look like, monitoring performance, and itâ€™s also really, really beautiful from the distributed system point of view and making it simple to understand, so I think people are going to love these two features and Iâ€™m super excited about the upcoming two releases.

Alexis: One question I ask everybody on the show is what is your text editor of choice?

Slava: Emacs. No doubt about it. I do everything in Emacs.

Alexis: Which text editor do you think is in the lead so far?

Slava: In the lead, I think itâ€™sâ€¦

Alexis: Among I should say our guests.

Slava: Well, I think in our company itâ€™s Vi. Itâ€™s about three quarters vi. The rest is Emacs and a couple of other editors that people use. I think in the Linux world and the Unix world, itâ€™s undoubtedly vi but itâ€™s not going to make me switch. You can pry Emacs from my cold, dead hands.

Alexis: Mitchell Hashimoto was a switch-over from Emacs to Vim and now Sublime Text which, not an official count, it seems to be winning among our guests.

Slava: Sublime?

Alexis: Sublime, yep.

Slava: Oh really? Cool. I didnâ€™t know that. So actually just for the record, I use vi for like one-off tasksÂ and stuff and I love that editor too but like I spend 90 persent of my time, itâ€™s split between Emacs and Gmail.

Alexis: Alright. And one last surprise question unique to you. I noticed on your website, you linked to a clip of Star Trek: The Next Generation. I was wondering why you chose that scene and if you could explain to the listeners which one it is.

Slava: Oh are you talking about the Commander Data?

Alexis: Yes.

Slava: Yeah. So I love Star Trek and we could probably spend another hour just talking about that show. I think itâ€™s absolutely amazing. By the way, I think we need a new Star Trek show that reexamines the world and does Star Trek for the new world and I think JJ Abrams has done a pretty good job with the movies. As a Star Trek Iâ€™m not entirely happy about it but at least weâ€™ve got something and the mainstream audience cares about it. But I think Star Trek is amazing and I know itâ€™s not perfect and itâ€™s campy and I get all that but I think it envisions aÂ society thatÂ would be great if we all kind of try to move towards. So I love the show.

You mentioned anthropology and they do this really well, like they look at humanity and theyâ€™ll say â€œWell, if humans act like this in this situationâ€¦â€ Itâ€™s almost like youâ€™re watching national geographic, like itâ€™s about yourself, itâ€™s great. So I love that show, and the specific clip youâ€™re referring to is where Commander Data which is this android, heâ€™s an artificial being. He meets his creator and he asked his creator like â€œHey, why did you create me?â€ The creator goes, â€œWhy does a boxer box or why does a painter paint?â€

And the reason why I picked that clip is because peo

Download Episode

Building a business on open-source and commercial software.