background preloader

FUSE: Filesystem in Userspace

FUSE: Filesystem in Userspace
Related:  User interface design

YAGO2 - D5: Databases and Information Systems (Max-Planck-Institut für Informatik) Overview YAGO is a huge semantic knowledge base, derived from Wikipedia WordNet and GeoNames. Currently, YAGO has knowledge of more than 10 million entities (like persons, organizations, cities, etc.) and contains more than 120 million facts about these entities. YAGO is special in several ways: The accuracy of YAGO has been manually evaluated, proving a confirmed accuracy of 95%. Every relation is annotated with its confidence value.YAGO combines the clean taxonomy of WordNet with the richness of the Wikipedia category system, assigning the entities to more than 350,000 classes.YAGO is an ontology that is anchored in time and space. YAGO is developed jointly with the DBWeb group at Télécom ParisTech University.

Deploying a Django App on Dokku - Gun.io Guest post written by Michael Herman, hacker at Gun.io and co-founder of Real Python, about deploying a Django app on Dokku. What is Dokku? A few days ago I was pointed towards the project Dokku. Dokku is a "Docker powered mini-Heroku" that you can deploy on your own server to serve as your own private PaaS. Why would you want your own mini-Heroku? Dokku ties Docker together with Gitrecieve and Buildstep into one package that is easily deployed, forked/hacked, and updated. What You Need To Get Started You could use anything from AWS to a computer on your own private network. The requirements for hosting Dokku are simple: Ubuntu 13.04 x64SSH Capabilities, andJeff suggests having a domain name on hand so it's easy to point to your apps. First, I signed up for an account on DigitalOcean. Next, I created my first "droplet" (spun up a node) by clicking "Create Droplet". Installing Dokku Now that our host is all set up it's time to install and configure Dokku. For example, I used: Finally, PUSH to:

Ureadahead Ureadahead (Über-readahead) is used to speed up the boot process. It works by reading all the files required during boot and makes pack files for quicker access, then during boot reads these files in advance, thus minimizes the access times for the harddrives. It's intended to replace sreadahead. Requirements Ureadahead needs a kernel patch to work, which is no longer available on the AUR. The user-space package is called ureadahead. How it works When run without any arguments, ureadahead checks for pack files in /var/lib/ureadahead, and if none are found or if the packfiles are older than a month, it starts tracing the boot process. Otherwise, if the file is up to date, it just reads the pack file in preparation for the boot. It works for both SSDs and traditional harddrives and automatically optimizes the pack files depending on which you have. Using ureadahead First you need the patched kernel. ureadahead() { /sbin/ureadahead --timeout=240 & } add_hook sysinit_end ureadahead Configuration

Named Entity Demo About the Named Entity Demo Named entity recognition finds mentions of things in text. The interface in LingPipe provides character offset representations as chunkings. Genre-Specific Models Named entity recognizers in LingPipe are trained from a corpus of data. The examples below extract mentions of people, locations or organizations in English news texts, and mentions of genes and other biological entities of interest in biomedical research literature. Language-Specific Models Although we're only providing English data here, there is training data available (usually for research purposes only) in a number of languages, including Arabic, Chinese, Dutch, German, Greek, Hindi, Japanese, Korean, Portuguese and Spanish. LingPipe's Recognizers LingPipe provides three statistical named-entity recognizers: Sentence Annotation Included The demos use the appropriate sentence models. Named Entity XML Markup First-best output N-best output Per tag confidence output Named Entity Demo on the Web

Working with Schema Names in Entity Framework Code-First Design | Entity Framework content from Dev Pro Related: "Improve Performance with Entity Framework 5" and "Thoughts on Microsoft's Entity Framework." In my last column about Entity Framework's code-first design feature, I explained to readers about my session at PASS Summit 2011. One of the questions I didn't have time to explore in my session is how you can work with database schema names other than being the database owner (DBO). On one hand, the options aren't very obvious, but on the other it's a capability that would be hard to imagine the Entity Framework developers overlooking. Fortunately this capability wasn't overlooked, and after you see how the basic feature works, schema names become pretty easy to work with. I'll use the same sample application from last month's column, Ordering.sln, which is a simple ordering system with Customer, Address, and Order entities. Figure 2: Database Schema in SQL Server Management Studio Figure 4: Updated Database Schema So far, so great. Voilà! Figure 5: Final Schema

protobuf - Protocol Buffers - Google's data interchange format What is it? Protocol Buffers are a way of encoding structured data in an efficient yet extensible format. Google uses Protocol Buffers for almost all of its internal RPC protocols and file formats. Latest Updates Documentation Read the documentation. Discussion Visit the discussion group. Quick Example You write a .proto file like this: message Person { required int32 id = 1; required string name = 2; optional string email = 3;} Then you compile it with protoc, the protocol buffer compiler, to produce code in C++, Java, or Python. Then, if you are using C++, you use that code like this: Person person;person.set_id(123);person.set_name("Bob");person.set_email("bob@example.com"); fstream out("person.pb", ios::out | ios::binary | ios::trunc);person.SerializeToOstream(&out);out.close(); Or like this: Person person;fstream in("person.pb", ios::in | ios::binary);if (! For a more complete example, see the tutorials.

Entity Extractor SDK Finds People, Places, and Organizations in Text Big Text represents the vast majority of the world’s big data. Lying hidden within that text is extremely valuable information, unable to be accessed unless read manually—a challenge compounded when foreign languages are involved. This hidden data often comes in the form of entities—names, places, dates, and other words and phrases that establish the real meaning in the text. Rosette® Entity Extractor (REX) instantly scans through huge volumes of multilingual, unstructured text and tags key data. As linguistics experts with deep understanding at the intersection of language and technology, Basis Technology continually improves the Rosette product family with language additions, feature updates, and the latest innovations from the academic world.

Create web forms and calculators for ASP.NET from Excel-SpreadsheetConverter Create calculating and good-looking web pages and web forms in minutes using Microsoft Excel. Convert to ASP.NET or classic ASP format with SpreadsheetConverter. Receive submitted forms directly in your Inbox. Let a business expert create a calculator or web form as an Excel spreadsheet including all necessary formulas. Either use the page as it is or open it in Visual Studio for further processing. While the user moves on to edit the next field in the form, the server dynamically instructs the browser to update any fields or charts that were affected by the previous data entry. See examples of what SpreadsheetConverter can do for you. Free offer: Send us a spreadsheet and we’ll send it back as a web page. It’s so easy, you can do it yourself! SpreadsheetConverter ASP.NET lets you connect your website to internal IT resources behind the firewall. A time-saving calculator makes your web site stand out from the competition. Example: Simple home loan calculator with slider and live chart

e4rat e4rat (ext4 – reduced access time) est un outil permettant d’accélérer le démarrage de votre distribution Ubuntu en déplaçant certains fichiers de démarrage en début du disque dur réduisant considérablement le temps de démarrage. C'est donc une alternative à ureadahead utilisé par défaut par Ubuntu. Attention, cet outil n'est pas officiellement supporté par Ubuntu et modifie en profondeur votre système: utilisez-le à vos risques et périls. Pour configurer e4rat, il vous faut redémarrer votre ordinateur et lorsque le menu de grub-pc apparaît, appuyez sur la touche “e” pour l'éditer. À la fin de la ligne kernel /vmlinuz26 root=/dev/disk/by-uuid/… ou de la ligne linux /boot/vmlinuz-… ajoutez ceci: init=/sbin/e4rat-collect et appuyez sur <Ctrl> + <X> pour lancer Ubuntu avec la nouvelle option. Une fois dans votre session et pendant 2 minutes, e4rat va collecter et enregistrer dans le fichier /var/lib/e4rat/startup.log tout ce que vous faites comme lancer Firefox, Thunderbird, … single

GitSetup < Main < TWiki Git repositories allow for many types of workflows, centralized or decentralized. Before creating your repo, decide which steps to follow: Create A Local Repository If you will be working primarily on a local machine, you may simply create a git repo by using cd to change to the directory you wish to place under version control, then typing: git init To initialize a git repo in that directory. Create A Remote Repository If you will be working with your code primarily on patas, you will likely want to create your initial repository there. ssh to patas.ling.washington.edu cd to the directory you wish to place under version control type "git init" in this directory. Cloning the Remote Repository If you wish to maintain a local copy of your code, you can clone the repository from patas by doing the following: Create a Shared Repository on Patas If you will be working with a group, using Patas as a centrally-located server to coordinate your checkins is a good idea, but takes some setting up.

Getting Started This tutorial will teach you the basics of building an ASP.NET MVC 5 web app using Visual Studio 2013. Download the completed project. This tutorial was written by Scott Guthrie (twitter@scottgu ), Scott Hanselman (twitter: @shanselman ), and Rick Anderson ( @RickAndMSFT ) You need an Azure account to deploy this app to Azure: You can open an Azure account for free - You get credits you can use to try out paid Azure services, and even after they're used up you can keep the account and use free Azure services.You can activate MSDN subscriber benefits - Your MSDN subscription gives you credits every month that you can use for paid Azure services. Getting Started Start by installing and running Visual Studio Express 2013 for Web or Visual Studio 2013. Visual Studio is an IDE, or integrated development environment. Creating Your First Application Click New Project, then select Visual C# on the left, then Web and then select ASP.NET Web Application. Click F5 to start debugging.

Bootchart Lucene - Apache Lucene Core Apache LuceneTM is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform. Apache Lucene is an open source project available for free download. Please use the links on the right to access Lucene. Lucene offers powerful features through a simple API: Scalable, High-Performance Indexing over 150GB/hour on modern hardwaresmall RAM requirements -- only 1MB heapincremental indexing as fast as batch indexingindex size roughly 20-30% the size of text indexed Powerful, Accurate and Efficient Search Algorithms Cross-Platform Solution Available as Open Source software under the Apache License which lets you use Lucene in both commercial and Open Source programs100%-pure JavaImplementations in other programming languages available that are index-compatible The Apache Software Foundation

Related: