
Technology
Get flash to fully experience Pearltrees
Tablet Devices
Transistors
For online auction powerhouse eBay, big data is serious business. The company has 100 million active users globally, 300 million live listings at any time (and it archives them all), receives 2 billion page views daily, and handles 250 million search queries and 75 billion database calls a day. How does eBay make sense of all this activity?
Under the covers of eBay’s big data operation — Cloud Computing News
Payments
API
Analytics
POS
ParkFree
NoSQL Pioneers Are Driving the Web's Manifest Destiny — Tech News and Analysis
Twitter has scaled back its plans to store billions of tweets using Cassandra, a non-relational database project that Facebook created and open sourced. Friday night, Twitter said that it will still use Cassandra in a new real-time analytics project it is building, but the decision to move away from plans to migrate tweets from its current MySQL database to Cassandra is seen by some as a blow to startups and open-source projects that are attempting to move beyond relational databases. But in reality, the level of interest about what database architecture some popular startup is using goes beyond Twitter and Cassandra, and touches on the changing nature of both the web and the software that underlies it. In short, the story here isn’t about Cassandra or databases themselves, but about groups of pioneering programmers reacting to the new ways they can build software in a world where computing is cheap.Cassandra NoSQL Database an Apache Top Level Project | Web Builder Zone
Saying Yes to NoSQL; Going Steady with Cassandra | Digg About
Cassandra @ Twitter: An Interview with Ryan King • myNoSQL
How Twitter Uses NoSQL - ReadWriteCloud
InfoQ has released a video of Twitter 's Kevin Weil speaking at Strange Loop earlier this year on how the company uses NoSQL. Weil is quick to point out that Twitter is heavily dependent on MySQL. However, Twitter does employ NoSQL solutions for many purposes for which MySQL isn't ideal. According to Weil, Twitter users generate 12 terrabytes of data a day - about four petabytes per year. And that amount is multiplying every year. Read on for our notes on Weil's talk.NSA open sources Google database mimic • The Register
Talk on eBay architecture
Randy Shoup and Dan Pritchett gave a talk on scaling eBay, "The eBay Architecture", at SD Forum 2006. The slides are available ( PDF ). The parallels with Amazon are remarkable. Like Amazon, eBay started with a two-tiered architecture.Data Hegemony
This is a wonderfully informative Amazon update based on Joachim Rohde's discovery of an interview with Amazon's CTO. You'll learn about how Amazon organizes their teams around services, the CAP theorem of building scalable systems, how they deploy software, and a lot more. Many new additions from the ACM Queue article have also been included. Amazon grew from a tiny online bookstore to one of the largest stores on earth.
High Scalability - High Scalability - Amazon Architecture
Amazon Technology"
The massive technology core that keeps Amazon running is entirely Linux-based . As of 2005, Amazon has the world's three largest Linux databases, with a total capacity of 7.8 terabytes (TB), 18.5 TB and 24.7 TB respectively [ ref ]. The central Amazon data warehouse is made up of 28 Hewlett Packard servers, with four CPUs per node, running Oracle 9i database software. The data warehouse is roughly divided into three functions: query , historical data and ETL ( extract, transform, and load -- a primary database function that pulls data from one source and integrates it into another). The query servers (24.7 TB capacity) contain 15 TB of raw data in 2005; the click history servers (18.5 TB capacity) hold 14 TB of raw data; and the ETL cluster (7.8 TB capacity) contains 5 TB of raw data.Twitter is arguably the most heavily used Ruby on Rails application in the world. Almost since its inception, Twitter has fostered a wildly passionate cult following. Also from the beginning, Twitter has suffered from chronic outages under that load. In the past month, record downtime has prompted fresh outcry within its ever-growing user base.

