background preloader

The Cooperative Association for Internet Data Analysis

The Cooperative Association for Internet Data Analysis

http://www.caida.org/home/

Related:  Data mining

Eureqa Eureqa is a breakthrough technology that uncovers the intrinsic relationships hidden within complex data. Traditional machine learning techniques like neural networks and regression trees are capable tools for prediction, but become impractical when "solving the problem" involves understanding how you arrive at the answer. Eureqa uses a breakthrough machine learning technique called Symbolic Regression to unravel the intrinsic relationships in data and explain them as simple math. Using Symbolic Regression, Eureqa can create incredibly accurate predictions that are easily explained and shared with others.

The 100 Best, Most Interesting Blogs and Websites of 2012 (Categorized) Editor’s note: 2015’s list of the best, most interesting websites has arrived! I don’t know about you, but I’m constantly running into so many cool websites I want to remember or recommend to people but I don’t always have time to make a note of them, add them to Evernote, bookmark them in Delicious, etc. So this is a good example of a post I created as a personal (and completely unscientific) resource that other people will probably benefit from as well. Some of these sites are pretty new while others have been around for a while now, but they are all awesome right now in 2012. So whether you are looking for a way to catch up on the latest Internet trends, want to find some exciting new content to load into your favorite news aggregating app or are just bored and need to waste some time online, this list represents hours of potential discovery.

HTTP Compression use by Alexa Top 1000 Yesterday, frontend madman and performance nut Paul Irish reached out to me asking if I had any stats on the use HTTP compression. I’ve written a bunch about the benefits of HTTP compression, as well as the challenges in implementing it. Surprisingly, I realized that, no, I did not have any figures about HTTP compression usage by major sites. 5 tools to transform or enhance text-heavy articles Credit: Image by JM Some rights reserved. During the news:rewired conference last week, Ezra Eeman of Journalism Tools gave delegates a list of essential tools journalists should be aware of when creating enganging content for their audiences. Some of these tools can be applied directly to text-based articles to add a multimedia element and, overall, a more colourful story. There are 50,000 news apps every month, not counting new web online tools found around the internet which journalists can use, said Eeman, and it can be quite difficult to sift through them.

Weka 3 - Data Mining with Open Source Machine Learning Software in Java Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is also well-suited for developing new machine learning schemes. Found only on the islands of New Zealand, the Weka is a flightless bird with an inquisitive nature.

The 100 Best, Most Interesting Blogs and Websites of 2013 Editor’s note: 2015’s list of the best, most interesting websites has arrived! Welcome to one of the web’s most anticipated lists of the year: The 100 Best, Most Interesting Blogs and Websites of 2013. This is the second such list I have made since founding DailyTekk and it’s very special. A website can only be featured ONCE in this annual series, so make sure to check out last year’s list so you don’t miss 100 other amazing sites and blogs. I’ve broken this monster list down into 19 categories and limited myself to five links per category (except for the last category) to ensure I whittled away all but what I thought were the most deserving candidates of right here and right now in 2013.

Alpha Personal Analytics Connect with Faceook, sign in for free, and get unique, personalized information anad analysis on your social data-computed by Wolfram|Alpha Clustering of your friends What are the groups of friends that make up your network? Guardian Australia: lessons in online-only publications When the Guardian launched its online-only Australian edition in May 2013, the site faced competition from already established national sites such as ABC, and the 18-strong editorial team operating out of its Sydney office were working around the clock to get the outlet up and running. Little over 12 months later, Guardian Australia is one of the country's top ten news sites, according to Nielsen Online, and receives 5.55 million monthly unique visitors, according to the latest figures from the Audit Bureau of Circulation. The team, headed up by editor-in-chief Katharine Viner, who is also deputy editor of the Guardian, has more than doubled to almost 50, including 29 reporters in addition to commercial and technical staff.

Data Mining, a useful tool in Business Intelligence In many occasions we have heard about Data Mining but, what is it exactly and when do we have to use it?. Well, I am going to start with some basis definitions I have collected from different sources and authors and I have made a nice combination (from my point of view) that I will share in this post. What is it? Center for Complex Networks and Systems Research We study the structure and dynamics of Web traffic networks based on data from HTTP requests made by users at Indiana University. Gathering anonymized requests directly from the network rather than relying on server logs and browser instrumentation allows us to examine large volumes of traffic data while minimizing biases associated with other data sources. It also gives us valuable referrer information that we can use to reconstruct the subset of the Web graph actually traversed by users.

Related: