Natural Language Processing *

Computer analysis and synthesis of natural languages

Articles Posts News Authors

SergeyBPshenichnikov Mar 28 2021 at 19:09

Converting text into algebra

10 min

1.5K

Search engines*Semantics*Algorithms*Natural Language Processing*

Translation

Algebra and language (writing) are two different learning tools. When they are combined, we can expect new methods of machine understanding to emerge. To determine the meaning (to understand) is to calculate how the part relates to the whole. Modern search algorithms already perform the task of meaning recognition, and Google’s tensor processors perform matrix multiplications (convolutions) necessary in an algebraic approach. At the same time, semantic analysis mainly uses statistical methods. Using statistics in algebra, for instance, when looking for signs of numbers divisibility, would simply be strange. Algebraic apparatus is also useful for interpreting the calculations results when recognizing the meaning of a text.

snakers4 Dec 5 2020 at 12:55

Playing with Nvidia's New Ampere GPUs and Trying MIG

11 min

3.9K

Image processing*Big Data*Machine learning*Computer hardwareNatural Language Processing*

Every time when the essential question arises, whether to upgrade the cards in the server room or not, I look through similar articles and watch such videos.

Channel with the aforementioned video is very underestimated, but the author does not deal with ML. In general, when analyzing comparisons of accelerators for ML, several things usually catch your eye:

The authors usually take into account only the "adequacy" for the market of new cards in the United States;
The ratings are far from the people and are made on very standard networks (which is probably good overall) without details;
The popular mantra to train more and more gigantic models makes its own adjustments to the comparison;

The answer to the question "which card is better?" is not rocket science: Cards of the 20* series didn't get much popularity, while the 1080 Ti from Avito (Russian craigslist) still are very attractive (and, oddly enough, don't get cheaper, probably for this reason).

All this is fine and dandy and the standard benchmarks are unlikely to lie too much, but recently I learned about the existence of Multi-Instance-GPU technology for A100 video cards and native support for TF32 for Ampere devices and I got the idea to share my experience of the real testing cards on the Ampere architecture (3090 and A100). In this short note, I will try to answer the questions:

Is the upgrade to Ampere worth it? (spoiler for the impatient — yes);
Are the A100 worth the money (spoiler — in general — no);
Are there any cases when the A100 is still interesting (spoiler — yes);
Is MIG technology useful (spoiler — yes, but for inference and for very specific cases for training);

veesot Jul 7 2020 at 10:06

How to find an English teacher. Part 2

4 min

889

Python*Programming*Data visualization*Machine learning*Natural Language Processing*

This is a continuation of story about using Data Science for finding an English teacher. If you have not read it yet - there is an opportunity to become familiar with it

Briefly - we had information about language teachers and tried to apply some basic ideas using pandas and our expectations. Unfortunately we got stuck on the third step, because there is not enough information for resolving our the last requirements - we need not more 3 candidates at the end.

Disclaimer

It is an approach based on my own experience and can be unsuitable to your point of view, ideas, or principles.

veesot Jul 3 2020 at 15:33

How to find an English teacher. Part 1

5 min

1.6K

Python*Programming*Data Mining*Data visualization*Natural Language Processing*

In the modern world, here and there ideas are arising about using data science for an extra benefit. For instance, Google can use a history of watched videos for providing recommendations about new ones. Online shops are using a recommendation system for increasing your receipt. However… if companies use the data for their benefit, could we do the same for own needs such as looking an online English teacher?

Disclaimer

It is an approach based on my own experience and can be unsuitable to your point of view, ideas, or principles.

veesot Nov 9 2019 at 13:16

Machine Learning for your flat hunt. Part 3: The final push

7 min

2.1K

Python*Programming*Data Mining*Machine learning*Natural Language Processing*

Photo by Dugan Arnett on Boston Globe

Are you still looking for a new flat? Ready to make the last attempt? If so - follow me and I show you how to reach the finish line.

SergKremen1984 Oct 6 2019 at 14:52

Keyword Tree: graph analysis for semantic extraction

3 min

1.7K

Data visualization*Machine learning*Natural Language Processing*

This post is a small abstract of full-scaled research focused on keyword recognition. Technique of semantics extraction was initially applied in field of social media research of depressive patterns. Here I focus on NLP and math aspects without psychological interpretation. It is clear that analysis of single word frequencies is not enough. Multiple random mixing of collection does not affect the relative frequency but destroys information totally — bag of words effect. We need more accurate approach for the mining of semantics attractors.