background preloader


Facebook Twitter

Data Analysis with Open Source Tools - O'Reilly Media. • The underlying properties of data• The ways to represent the current status of the data• The criteria to select relevant data and attributes• The algorithms to analyze the selected data and attributes• The ways to report the conclusions of the performed data analysis.

Data Analysis with Open Source Tools - O'Reilly Media

The author Philipp K. Janert takes a designer approach rather than an implementer approach. That means that you will gain important suggestions and tips to propose a plan for data analysis, instead of how to build an entire or partial information infrastructure using open source tools like Python, R, PostgreSQL and Weka. Then, for some developers the lack of full programming constructs may be disappointing. However, I feel that Philipp K. Despite the implementer approach is not fully covered, you'll be able to understand how the analytical demands can be satisfied using specifically the programming languages Python and R given its speed of execution, numerical analysis capabilities and cross-platform support.

Search results for indekspot on Delicious. Indekspot (indekspot) Solr Lucene implementation and commercial-grade support. Apache Solr hosted search is now live on the site. Acquia Announces Hosted Solr Search Product. Apache Solr Search Integration. Hosted Solr site search for Drupal is on the way. The search technology area is highly important to people with websites.

Hosted Solr site search for Drupal is on the way

As a result, I've spent serious time looking at it. Several things have come from this time spent: The important thing: We'll soon be adding "hosted site search" capabilities to the Acquia Network for our subscribers. More about this below.The unimportant thing: Search was modestly influential in the selection of our company name. ah-kwe-eh is the (native American) Navajo WW II code talker word for "Locate. " I reasoned that this made sense because most websites are built to help site visitors locate what they're looking for - either information, people, products, or information about people or products, etc. Why bother with offering hosted site search with Google around?

A ton of this value can simply be obtained by using knowledge that Drupal knows about its content as facets -- e.g. when a page (node) was created, by whom, what it's taxonomy / folksonomy tags are (and that these are tags), etc. Hosted SOLR or Lucene service. Just putting the question out to the blogosphere (love that word!)

Hosted SOLR or Lucene service

– is there any interest in a hosted Lucene or SOLR search service? It may be something that is a non-starter, given that google and atomz have wrapped up a ‘hosted web search’ market segment already, but perhaps not. Most people need to search through their web data, true, but google/atomz/etc search the content after its published. Would there be much/any benefit in being able to index/search data before it’s published to the web (perhaps with extra meta data not necessarily easily publishable)?

Or perhaps searching data that is used for other, non-web-publishing activities? I'm currently working on a book for web freelancers, covering everything you need to know to get started or just get better. Hosted full text search solutions. Powerful, hassle-free full-text search for your app. Chromium $50/mo Up to 5 managed indexes Up to 500,000 documents Shared cluster architecture.

Powerful, hassle-free full-text search for your app

Production replicated. Platinum $100/mo Up to 10 managed indexes Up to 2,000,000 documents Shared cluster architecture. Production replicated. Palladium $200/mo Up to 20 managed indexes Up to 5,000,000 documents Shared cluster architecture. These plans include: No setup feesWeb-based index provisioningCustomizable Solr schemaOpinionated and economical shared cluster architectureEmail & ticket support (Business Hours PST) Apache, Apache Solr, and Solr are trademarks of the Apache Software Foundation. Ruby Cloud Platform as a Service.