One thing Twitter could learn from Google

Sep 2, 2009 by     8 Comments    Posted under: Social Media

If you don’t know, I think that Google is a evil company. If you read Wired,  you probably read the article about the company being under scrutiny for monopoly practices.

Yesterday, when GMail went down, millions of people had a wake up call regarding the faith that they put on a service that when it goes down leaves the users lost without  access to one of the most important tools used today: e-mail.

Having said this, it is my opinion that Twitter has a lot to learn from Google and one thing it could learn, right now is how to communicate.
Let’s face it: Twitter Status’ informations are frustrating:  They are updated hours after the events occur (usually making people turn to Facebook to know if Twitter is down and not to the official Twitter Status website) and when problems are solved nothing is explained.

You would expect that a bigger company like Google would treat it’s users in the same way, or worse, but that is not the case.

When GMail went down – reports from some countries talk about 5h in a row – alongside taking care of things,  Google started to communicate with it’s users. And after the problem was solved a full explanation was given coming from Google’s VP for Engineering and Site Reliability Czar Ben Treynor:

Gmail’s web interface had a widespread outage earlier today, lasting about 100 minutes. We know how many people rely on Gmail for personal and professional communications, and we take it very seriously when there’s a problem with the service. Thus, right up front, I’d like to apologize to all of you — today’s outage was a Big Deal, and we’re treating it as such. We’ve already thoroughly investigated what happened, and we’re currently compiling a list of things we intend to fix or improve as a result of the investigation.

Here’s what happened: This morning (Pacific Time) we took a small fraction of Gmail’s servers offline to perform routine upgrades. This isn’t in itself a problem — we do this all the time, and Gmail’s web interface runs in many locations and just sends traffic to other locations when one is offline.

However, as we now know, we had slightly underestimated the load which some recent changes (ironically, some designed to improve service availability) placed on the request routers — servers which direct web queries to the appropriate Gmail server for response. At about 12:30 pm Pacific a few of the request routers became overloaded and in effect told the rest of the system “stop sending us traffic, we’re too slow!”. This transferred the load onto the remaining request routers, causing a few more of them to also become overloaded, and within minutes nearly all of the request routers were overloaded. As a result, people couldn’t access Gmail via the web interface because their requests couldn’t be routed to a Gmail server. IMAP/POP access and mail processing continued to work normally because these requests don’t use the same routers.

The Gmail engineering team was alerted to the failures within seconds (we take monitoring very seriously). After establishing that the core problem was insufficient available capacity, the team brought a LOT of additional request routers online (flexible capacity is one of the advantages of Google’s architecture), distributed the traffic across the request routers, and the Gmail web interface came back online.

What’s next: We’ve turned our full attention to helping ensure this kind of event doesn’t happen again. Some of the actions are straightforward and are already done — for example, increasing request router capacity well beyond peak demand to provide headroom. Some of the actions are more subtle — for example, we have concluded that request routers don’t have sufficient failure isolation (i.e. if there’s a problem in one datacenter, it shouldn’t affect servers in another datacenter) and do not degrade gracefully (e.g. if many request routers are overloaded simultaneously, they all should just get slower instead of refusing to accept traffic and shifting their load). We’ll be hard at work over the next few weeks implementing these and other Gmail reliability improvements — Gmail remains more than 99.9% available to all users, and we’re committed to keeping events like today’s notable for their rarity.

Even if you just browse over this information, get lost in the technical details or don’t read it at all, the fact is that the information is there and it gives the user an insight what the company had to make to solve the problem and what measures they are putting into place for this not to happen again. (Still I would not rely on a GMail account as my main e-mail account)

In the case of Twitter, instead of an explanation from the company about the causes, effects and solutions all you get is:

Trends for Search Unclickable

We’re aware and currently working on an issue in which trends in search are unclickable; updates as we resolve the issue.

Update (2:36pm) Trend links are now fixed.

As you can see there are no updates as the issue is solved, no explanations, zilch, nothing.

It is ironic, to me, that a company that makes people communicate with each other doesn’t know how to communicate with its users. And this is something that also should be corrected.
That and spam. But that is another post all together.

8 Comments + Add Comment

  • #Twitter New Blog Post: "One thing Twitter could learn from Google": http://tr.im/xIsy

  • RT @fjfonseca: #Twitter New Blog Post: "One thing Twitter could learn from Google": http://tr.im/xIsy

  • RT @fjfonseca: #Twitter New Blog Post: "One thing Twitter could learn from Google": http://tr.im/xIsy

  • @booksbelow Hi, when you have the time: #Twitter New Blog Post: "One thing Twitter could learn from Google": http://tr.im/xIsy

  • It’s very true that twitter is very poor about communication with it’s users. I wrote a post not long ago asking what responsibilities a private company has when they have become a de-facto utility, with many people dependent on them. Communication with their users should surely be at the top of the list! And it really baffles me that better communication costs them nothing, but poor communication results in very bad PR. (ie this post! :-) Thanks for a great piece.

  • RT @fjfonseca: #Twitter New Blog Post: "One thing Twitter could learn from Google": http://tr.im/xIsy

  • @shellykramer Something for you to read Dear, when you have the time: "One thing Twitter could learn from Google": http://tr.im/xIsy :-)

  • @richgonzalez @shellykramer @drnelk As I point here: http://low.cc/KW9s8 it is a problem of how Twitter is communication with users as well.

Got anything to say? Go ahead and leave a comment!

XHTML: You can use these tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>