background preloader

Data Mining Software

Facebook Twitter

Weka 3 - Data Mining with Open Source Machine Learning Software in Java. Weka is a collection of machine learning algorithms for data mining tasks. It contains tools for data preparation, classification, regression, clustering, association rules mining, and visualization. Found only on the islands of New Zealand, the Weka is a flightless bird with an inquisitive nature. The name is pronounced like this, and the bird sounds like this. Weka is open source software issued under the GNU General Public License. We have put together several free online courses that teach machine learning and data mining using Weka.

Weka supports deep learning! Weka - home.

Machine Learning

Visualisation. Graph. Graph Software. Text Analysis. Понятия и термины. Text analyzis software. xMarkup. Minicorpus. Предварительные замечания Предлагаемая методика использования xMarkup, mystem и MS Access позволяет выполнять следующее: автоматическое разделение текста на абзацы, абзацев – на предложения, а предложений – на слова (словоупотребления);автоматическое получение частотного списка словоупотреблений;автоматизированная лемматизация словоупотреблений (получение списков <словоупотребление> – <лексема><лексема> – <часть речи>);автоматическое получение частотного списка лексем;автоматизируемое присвоение семантических категорий лексемам или выделение лексико-семантических вариантов одной лексемы;автоматическое построение картотек и конкордансов для словоупотреблений, лексем, слов одной части речи или семантической категории;автоматизированное выделение грамматически связанных сочетаний слов, расположенных как контактно, так и дистантно. Пояснения по поводу порядка работы с текстом Обработка исходного текста Более чем вероятно, что вы обнаружите нужный вам текст на сайте библиотеки Мошкова.

В начало. Xmwin-setup-2.1.7.

Pymorphy

Лемматизатор Mystem. Документация. Загрузить mystem для некоммерческого использования. Simple Concordance Program. Niederländische Philologie FU Berlin. TextSTAT is a simple programme for the analysis of texts.

Niederländische Philologie FU Berlin

It reads plain text files (in different encodings) and HTML files (directly from the internet) and it produces word frequency lists and concordances from these files. This version includes a web-spider which reads as many pages as you want from a particular website and puts them in a TextSTAT-corpus. The new news-reader, too, puts news messages in a TextSTAT-readable corpus file. TextSTAT reads MS Word and OpenOffice files. No conversion needed, just add the files to your corpus... Documentation: For a first introduction to TextSTAT, please refer to the Quickstart Guide to text analysis with TextSTAT from the 'Humanities Resource Centre' at Princeton University.

There is also a nice video tutorial by Zarah Weiß, available via YouTube. NEW: TextSTAT 3 (beta) There are some drastical changes in this new version of TextSTAT, most of them internal. TextSTAT now works with Python 2 (>= 2.7) and - even better - with Python 3 (>= 3.4). Corpus Linguistics: Investigating Language Structure and Use (Cambridge Approaches to Linguistics): Books: Douglas Biber,Susan Conrad,Randi Reppen. Text Content Analyser. Generate text statistics and analyse the content of a text.

Text Content Analyser

Use our free text analysis tool to generate a range of statistics about a text and calculate its readability scores. Adam "I simply wanted to thank you on the great text analyser and its robustness. It has certainly sped up my work. " Manar "I've been using this website for a long time and it's always been the best place to answer all my questions, so thanks so much for your help that deserves millions of thanks. " Get Detailed Text Statistics. Our text analyser will show you statistics about your text to help you understand its complexity and readability. Word CountUnique WordsNumber of ParagraphsNumber of SentencesWords per SentenceNumber of CharactersCharacters per WordNumber of SyllablesSyllables per Word Test Your Readability.

WordSmith Tools. TextAnalyst. CLOC collocations concordance and wordlists.

Обзоры ПО по анализу текста