Softpedia
 

NEWS CATEGORIES:



NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>
MEET THE EDITORS >>
Home > News > Webmaster > Internet Life

December 29th, 2010, 15:02 GMT · By

Skype Explains 'Perfect Storm' of Issues Behind the Massive Outage Last Week

SHARE:

Adjust text size:


Skype for Windows client bug led to the massive outage of last week
Enlarge picture
Last week, as many of you may be aware, Skype suffered quite a big outage lasting for up to a day. While it wasn't a total shut down of the service, many users experienced problems connecting. Skype has now provided a detailed explanation of why this happened and what it is doing to prevent it from happening again.

As it is often the case, the massive outage was due to a combination of problems. The way the system is set up made it prone to a domino effect, where the initial issue led to another, which led to another with little ways of preventing the progress.

The initial cause was an overload of some servers responsible for offline messages. Because of the overload, messages sent to the clients were delayed.

For most client versions, this wasn't a problem, but a bug in Skype for Windows 5.0.0.152 led to crashes when the delayed messages were received.

The bug only affected this particular version of the Skype client. Skype for other platforms or the older Skype 4.0 for Windows didn't have this problem and neither did the newer Skype 5.0.0.156, which not a lot of users had upgraded to.

As it happens, Skype 5.0.0.152 was the most popular client at that time, with about 50 percent of users running it. Of those, 40 percent were affected by the bug, so only 20 percent of Skype users.

This wouldn't have been such a big issue if it weren't for Skype's peer-to-peer infrastructure. Skype uses p2p for communications meaning that clients actually 'talk' directly to each other without using a server. This method enables Skype to offer free VoIP services but it is also a weakness.

Even though the system is p2p, some clients act as "supernodes" which coordinate the actions of hundreds of clients. While Skype shutdown the overloaded servers, effectively removing the cause of the crash, 25 percent to 30 percents of supernodes were taken down by the crashes affecting the Windows clients.

As a result, the remaining supernodes had to handle all of the communications. This was made worse by the fact that users kept restarting the crashing clients which led to a huge increase in traffic.

Skype says that traffic to the remaining supernodes was about 100 times bigger than what they would normally get. Supernodes have a built-in mechanism which prevents them from taking up too many resources on the client machine by shutting them down.

With the load increase, more supernodes shut down leading to even more strain on the remaining ones. Eventually, most of the supernodes became unavailable effectively preventing users from connecting.

Skype intervened by artificially adding more supernodes to the system, which it dubbed "mega-supernodes." It did this by diverting resources from the group video call feature. Skype continued to add these mega-supernodes until the system started to restore itself. The outage lasted for about 24 hours.

Skype says that it is now working on several ways of preventing this from happening in the future. One way is by possibly implementing automatic updates, for minor version patches.

Skype is also looking at ways it can detect problems and act on them sooner. The company says that the testing procedures will get an overhaul as well.

Skype has become indispensable to millions of users and, while the basic service is offered for free, people have come to expect that the services they use, particularly one as important as communications, work regardless of what they are paying for them.
FILED UNDER:
Skype
outage
VoIP

TELL US WHAT YOU THINK:

823 hits · 1 comment · Link to this article · Print article · Send to friend · Subscribe to news

MUST-READ RELATED ARTICLES:


Skype Explains the Reason Behind Its Massive Outage

Skype Is Down for Millions of People

Facebook Is Testing Skype-Connected Video Chat

Skype Connects 25 Million Concurrent Users

Skype 5.0 Launches with Facebook Integration and Group Video Chat

READER COMMENTS:


Comment #1 by: Mike on 01 Mar 2011, 05:54 UTC reply to this comment

Skype is not too bad. However, the truth is, there are many material misrepresentations in this newest explanation provided by Skype!

1. "Skype has now provided a detailed explanation of why this happened?"
The truth is, Skype already made an attempt to explain it before 12/24/2010. That time it was a geeky explanation of supernodes. And now they came up with this explanation that blames the outage on something else?

2. "The outage lasted for one day?" The truth is, it lasted for TWO days.

3. "Version 5.0.0.152 was the most popular?" The truth is, at Skype you cannot choose what you like! At Skype you don't choose your version! Skype keeps upgrading your software, and they do it without your permission!

3. "Skype 4.0 for Windows didn't have this problem and neither did the newer Skype 5.0.0.156, which not a lot of users had upgraded to?" The truth is, my version was 5.0.0.156. And I didn't choose it. Skype upgraded to it, without my permission. And the truth is, YES, it did have the same problem!

4. "Skype 4.0 for Windows didn't have this problem?" NOT true! My own brother was able to hang on to a version 4.0 of the Skype software. And the truth is, Skype version 4.0 had the same problem!

5. The truth is, the Skype software always has had many-many bugs. Probably because customer service is zero, and probably because user feedback is completely ignored by Skype. And probably because network testing at Skype is near-zero. Therefore it's a miracle that these outages don't occur more often.

6. Personally, the Skype outage cost me $110.00 cash and other expenses and all of which I can prove. Skype further costs me approx. $20.00 cash per month, every month, and I can prove this, too. Therefore, after the 2-day outage, how did they compensate me? They awarded me a grand total of $1.00 cash to compensate me for all my losses. What a generous company!

Copyright © 2001-2012 Softpedia. Contact/Tip us at

WindowsGamesDriversMacLinuxScriptsMobileHandheldNews

SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   UPDATE YOUR SOFTWARE   |   ROMANIAN FORUM