Google updates BigQuery with SQL-like queries, grouping of distinct values, and support for Timestamp data Google on Thursday announced a slew of new features for Google BigQuery, its service for quickly analyzing large amounts of data, to let analytic teams deliver what organizations really need: “actionable and data-driven business insights.” In short, Google has added new capabilities to help businesses work effectively with large amounts of data over a greater range of query and data types. Here are the three new features Google wants to highlight: Big JOIN: use SQL-like queries to join very large datasets at interactive speeds.Big Group Aggregations: perform groupings on large numbers of distinct values.Timestamp: native support for importing and querying Timestamp data. The new Big JOIN feature gives users the ability to produce a result set by merging data from two large tables by a common key: you can skip a data transformation step by simply specifying JOIN operations using SQL. Google has also added the ability to add new columns to existing BigQuery tables.
Easy Facebook Scripting in Python « Heterogenous Mixture UPDATED: fbconsole Pypi Package and Github Repository Sometimes you just want to write a little script using Facebook’s api that updates your status, or downloads all your photos, or deletes all those empty albums you accidentally created. In order to streamline my writing of one-off facebook scripts, I created a micro api client that implements the client-side authentication flow and has a few utility functions for accessing the graph api and fql.
Launches OpenStack-based Private Cloud Software — Enables Businesses to Install, Test and Run Private Clouds in Minutes Available as a free download, the software empowers customers to deploy private clouds San Antonio, TX – August 15, 2012 – Today, Rackspace® Hosting (NYSE: RAX), the open cloud company, announced the release of Rackspace Private Cloud Software, powered by OpenStack– making it simple and easy for companies to install, test and run a multi-node OpenStack based private cloud environment. The software, code named “Alamo,” uses the same OpenStack compute platform, Nova, used to run Rackspace clouds and is available as a free download from the Rackspace website. This software is based upon Rackspace’s experience in deploying and operating OpenStack-based public and private clouds in a variety of environments including in Rackspace’s own datacenters as well as in external datacenters.
Enterprise IT Monitoring for Networks, Applications, Virtual Servers & the Cloud We pride ourselves on delivering powerful, easy-to-use network monitoring software which gives you deep visibility into your entire network infrastructure – backed by fantastic customer service and all at a highly affordable price. Built for both service providers and enterprise customers, Opsview offers flexible pricing and deployment models that can easily be aligned to your business need. Whether you’re monitoring 50 devices or 20,000, Opsview gives you the deep visibility required to monitor the health and performance of your network and make informed decisions to prevent costly downtime. Get up and running in minutes with Opsview Core open source network monitoring software (free), or if you need to monitor more complex business environments choose Opsview Pro network monitor software (for SMEs) or Opsview Enterprise network monitoring tools (for deployments monitoring up to 20,000 devices). Try out Opsview Core now, our free, open source network monitoring application.
ghettoVCB.sh - Free alternative for backing up VM's for ESX(i) 3.5, 4.x+ & 5.x DescriptionFeaturesRequirementsSetupConfigurationsUsageSample Execution Dry run ModeDebug backup ModeBackup VMs stored in a listBackup All VMs residing on specific ESX(i) hostBackup All VMs residing on specific ESX(i) host and exclude the VMs in the exclusion listBackup VMs using individual backup policies Enable compression for backupsEmail Backup Logs Restore backups (ghettoVCB-restore.sh)Cronjob FAQStopping ghettoVCB ProcessFAQOur NFS Server ConfigurationUseful LinksChange Log This script performs backups of virtual machines residing on ESX(i) 3.5/4.x/5.x servers using methodology similar to VMware's VCB tool. The script takes snapshots of live running virtual machines, backs up the master VMDK(s) and then upon completion, deletes the snapshot until the next backup. This script has been tested on ESX 3.5/4.x/5.x and ESXi 3.5/4.x/5.x and supports the following backup mediums: LOCAL STORAGE, SAN and NFS. The script is non-interactive and can be setup to run via cron.
Deep Information Sciences Takes Big Data a Quantum Leap Forward PORTSMOUTH, N.H.--(BUSINESS WIRE)--Deep Information Sciences, Inc. (Deep) made its debut today with the introduction of its groundbreaking new general purpose database, DeepDB, and the announcement of $10 million in Series A funding to support the company’s strategy to transform the Big Data market. Enabling simultaneous transactions and analytics in real time on the same data set, DeepDB brings unprecedented scale, simplicity and speed to data-driven decision making at large organizations. “Our powerful technology, combined with our leadership bench strength, positions us for a fast path to market leadership in Big Data. We look forward to the opportunity to help our customers overcome the challenges of Big Data and seize competitive advantages through a next-generation approach to data management.”
Software defined storage Software-defined storage (SDS) is a term for computer data storage technologies which separate storage hardware from the software that manages the storage infrastructure. The software enabling a software-defined storage environment provides policy management for feature options such as deduplication, replication, thin provisioning, snapshots and backup. By definition, SDS software is separate from hardware it is managing. That hardware may or may not have abstraction, pooling, or automation software embedded.
Big Data and Data Science – what’s really new? Big Data is a hype. It’s also a buzz word. Maybe a trend? Down-to-earth people could say it’s just mass data called “big”. Although there are many very large data warehouses in the BI world, data science seems obsessed with handling “big data – when the size of the data itself becomes party of the problem.” RemoteStorage-2011.10 - Unhosted Web Community Group Introduction Adding WebFinger, OAuth and Cross-Origin Resource Sharing (CORS) to an online storage makes it usable as per-user storage for web apps. This specification describes a common interface for such a per-user online data storage.