Сообщения

Сообщения за декабрь, 2017

Wandex or Nine things you didn't know about search engines

Изображение
Figure: the Big three. Or...? We believe the search engines something taken for granted. They exist because it must exist. Without them we find the necessary and adequate information among millions of web pages would be almost an impossible task. Here are nine, most likely, unknown to you facts about the search engines. 1. Invented in 1936? The idea which led eventually to the invention of hypertext, and the argument about the need to develop rapid extract data from the thus stored information (the equivalent of today's search engines) was published in 1945, the American engineer and science administrator Vannevar Bush (Vannevar Bush). His essay " As we might think " it was perhaps written in 1936. It introduced the concept of device memory extender contained original ideas that, eventually, morphed into the Internet. 2. Magic automatic extractor of text The first real search engine was created in the 1960s by Gerard Salton (Gerard Salton).

Uniform alignment of the blocks width

Изображение
Continuing my "css-dig" has a new idea, to analyze another relevant topic which concerns the uniform alignment of the blocks in width. basically my thorough research, I already posted in your blog , but since my last job loved the Habra-community, I decided to make here a small obzorchik this short article to no Habra-soul missed her for sure. So, in the words of Yuri Gagarin: "let's Go." In General, the layout problems periodically there are times when there is a need to align some list to the screen width. The items on this list should align evenly, hugging their extreme elements to the container bounds, and the distance between them should be the same. The figure shows that the points align in width adjacent to the side walls and making the spacing between a – equivalent. How it works? In fact, we should get whatever makes text-align: justify with text. Since the behavior of our blocks is very similar to the result of the alignment of

Habr, karma and practical Cybernetics

On the basis of topics ( here and here ), reading the reaction to the change counting algorithm karma, unexpectedly came the thought: but what an interesting experiment it turned out. A self-organizing system (Habr) was deduced from the equilibrium (prior to the usual distribution of money), external effects (change in the calculation of karma and the General reduction of its level). The system responded with a desire to restore the lost balance (herpolitha were actively added each other with the aim to get as close to the old, familiar values of karma). Habr behaves as a kind of single artificial organism Homeostasis Homeostasis — a self-organizing system that simulates the ability of living organisms to maintain some quantities (e.g. body temperature) in a physiologically acceptable bounds. the Homeostat has the ability of self-organization , that is, may to a certain extent to learn and to adapt the forms of their behavior towards a sustainable balance with the envi

PageRank predicts Nobel prize winners

Изображение
Ranking scientists by the number of links to their work — a thankless job. Anyone can name a few weaknesses of this system. 1. Not all links are the same. The importance of reference work is an important factor. 2. Scientists from different fields of science use quotations and references in different ways. Work in the field of life Sciences is cited six times, work in physics to three times, and math — is only one. 3. Breakthrough work can be cited rarer because they involve niche research area at an early stage of their development. 4. Important work often stop to quote when they get in the textbooks. The pattern of cross-references between scientific papers forms a complex network similar to the network of hyperlinks on the Internet. Maybe that is the key to opening the best way to assess the importance of the particular work? Sergei Maslov of Brookhaven national laboratory in new York and Sidney Redner from Boston University have asked themselves the same quest

Chabrette

This summary of the rules were drafted after the discussion on "What is bad?" habré. Mostly, he focused on writing a set of rules that reflect the causes and what cannot be, to change the karma and change the status message. These rules are for executing in an order, these are the rules of etiquette and to some they may seem obvious, but still worth to read and implement, it is not so difficult. A set of rules not a last resort, moreover, is not even an alpha version, and I hope that our synergy and collective intelligence will help us to Supplement, modify this list. the Hebraistical Any person has the right to use the grading system messages and people, but has no right to restrict personal freedom of other people. the do Not abuse their opportunities. You need to understand that minus is very different from the plus. Minus oppressing man, and does not develop it. Put the pros when You like something but think before you put a minus, if something is not like

Collection Nigma-fich 2009

it's Time to celebrate the New Year and take stock of the passing year. 2009 for Nigma.ru was even more productive, the year 2008 . We have developed and launched over a dozen new useful services and according to your numerous requests updated design :)) We take this opportunity to congratulate all the users of Habrahabr with the coming year 2010! We wish you new achievements and emotional balance! It's time to see the old year — we present to your attention a collection of the main Nigma-Fitch 2009: 1. Search for chains of chemical reactions : Ag -> X -> AgNO3 -> X -> Ag(NH3)2OH -> X -> Ag . 2. Miracle auto-completion answers the question even before the user had time to enter it in the search bar. 3. a New algorithm for indexing sites , which is able to extract structured information. 4. Chemical uravnavesil , which can automatically arrange the coefficients in chemical reactions. 5. Crisis search products — the list of goods in

[42]magnets — startup one day

Изображение
Increasing popularity among geeks gaining entertainment in the format of a One-day Project or a Funky Friday. The idea is that in a very short time to develop and unreleased project. The benefit is not only to feel like a real startup and test yourself and your team strength. The benefit is that this format allows to test the ideas and find the pitfalls before you completely devote yourself to the project. In addition, an additional bonus is that you will learn how to break the idea into parts, highlighting the Most Important thing. After a day is impossible, and do not need to implement all the ideas — you need to concentrate on the primary to be on time. For us it was the second project that we developed in this format. With the development of the first we have missed in terms even close. It taught us to better assess your strength and choose tools. To implement the second idea we came up more seriously. Advance were worked out the idea, define the minimum functionality

The Chinese government launches its own search engine

Изображение
On Thursday, the Xinhua news Agency and China Mobile announced a deal to create a new search engine. Both the first and second companies of the state. China Mobile is the largest and most expensive mobile operator in the world, still, with its 508 million subscribers. Of course, she is highly regarded and on the new York and Hong Kong stock exchanges. Xinhua (in Russian, "Xinhua) news Agency reflecting the official policy of the Party; reports directly to the propaganda Department of the Communist party. The government under Hu Jintao has already criticized the national television channel broadcasting, in their opinion, narrow-minded programs that encourage cravings for material wealth and frivolous behavior. However, these programs are very popular among the population, as evidenced by their high rating (advertising in China was not forbidden). Many Chinese use domestic search engine Baidu and to a lesser extent, Google. The last are those who are bolder and

Competitors of Google in Europe: Yandex and Seznam

a Few weeks ago, the Czech online industry suddenly was a noise. Google allegedly bypassed the search engine Seznam market share. Given that almost all of Europe is dominated by Google, the Czech Republic, together with Russia are in a unique position. In these countries, Google is leading in search. Instead of leading positions in local search engines. Messages about the transition of dominance from Seznam to Google challenged the company Seznam. And it seems that she is quite right. The number provided by the web Analytics service Toplist based on traffic to sites Toplist. This method is not quite correct to calculate market share. However, quickly circulated press release Toplist attracted a lot of attention to yourself. And he raised another interesting question: what allows Seznam and Yandex to be among the few who can stand up to Google in their regions? Let's look at both search engines. Seznam First, let's look at the Czech giant. What makes it

The purchase of the game

Изображение
Remember a couple of years ago there was a boom in penny auctions ? If someone does not know — is a type of auction where you have to pay for a bet that the price of goods with each rate decreases (is low) or increases (despite the fact that the initial price is very small and there is a timer). Now most of them have lost popularity. It and is clear — no way to guarantee the honesty of the store in terms of the lack of artificial rates. Ie you bet — and will give you buy or not is unknown. So, smart people figured out how to do everything openly and honestly . lowbuy.ru you do not pay for a bet, and for the opportunity to know the price of the product (which is guaranteed below the famous initial). Then you have 15 seconds to make a decision. If you buy a product of yours, without conditions, unlike the tech is penny auctions. Part of the price rates decreases the value of the goods. And so long as the goods can not buy. Yes, the game element is present. But all

Google Analytics and Google Webmaster Tools added statistics on social networks

Изображение
/ > And I must say, just in time — many web masters had to invent all sorts of "crutches" for information about how many visitors comes to your site from social networks. And now developers at Google have created an interesting tool (as it is convenient and efficient — time will tell) that allows it to track without any problems. In Google Webmaster Tools now has a section "+1 Metrics", which shows the effect of the buttons "+1" on search results. New Analytics can also show what effect the button on indicators such as CTR. Of course, information is displayed not only text, but also in the form of graphs, which is very convenient. In addition, Google has Added the sections "Activity report" and the Audience report to Webmaster Tools. The first section shows how many button clicks "+1" was made on pages of your web site. The second section shows a report of the users, including demographic and geographic characteris

The largest database in the world — Yahoo! And it works on PostgreSQL.

Изображение
Yahoo utverjdaet , she managed to break a world record by creating the largest and most loaded database in the world! The volume launched a year ago the database has reached 2 petabytes. The system is designed for analytical purposes, it holds the history of the behavior of web users (it is alleged that in the month the data is stored on the half a billion users). In addition, the Internet giant declares that it is not only the largest database in the world, but also the most loaded in the day it records information about 24 billion events. And now the most interesting. Controls this monster is a modified PostgreSQL. This is the result of buying startups Mahat Technologies, initially working with the most advanced, open source database system PostgreSQL. Code "Postgres" has been modified to work with such huge volumes of information (one of the major changes: a focus on columnar storage instead of the traditional row-based, which slows down disk writes, but provid

That overcomes the power of innovation?

the Description of conditions of successful innovations is dedicated to the many correct words and wonderful thoughts. But the question is nature of resistance to innovation , in my opinion, been unfairly overlooked. From smrtnosti of view comes a lot of funny and sad continuations: inventors rely on the power of ideas, investors expect offers obvious advantages, successful implementers called innovation replicating an existing... It seems to me that there is no other force that prevents innovation, in addition to the force of inertia of consciousness. Accordingly, the only object with which you want to interact with the innovator — the minds of others. Why the wipers sweep the path? In our time there are still conscientious caretakers. And much more – always somewhere late citizens who have absolutely no leisure time to walk on the sidewalks. Whether they are at school remember that the hypotenuse is shorter than the sum of the other two sides, whether intuition tells them t

The collapse of the shares of Yandex, a letter to the investors, and Chrome

Изображение
unfortunately, past Habra was an interesting story with the first major collapse of quotations of Yandex on Wednesday. Fill in the blanks. Quotes of the domestic search engine fell by 13.1% to us $22.9 per share, almost twice lower than the highest market price in $from 42.01 per share after triumphant IPO in may this year. The decline in the shares of Yandex was especially noticeable on the background of rising U.S. stocks. Payback for honesty The collapse of quotations was preceded by a meeting of the management of Yandex with representatives of several hedge funds, which among other questions asked about the falling share of the search market Yandex in Russia (according to Liveinternet, the share of Yandex in September decreased to 61.4% with 63% of requests). Heard by the representatives of hedge funds did not like, and they started to sell shares of Yandex. Immediately after that, Yandex has sent to shareholders a letter in which he said that the

Remart: how I built my business

Изображение
Part III: So, after a long search path ideas and investors , decision making we got to the most difficult and interesting stage of our project. Ago has no way, only forward. And the most important thing now to bring the right people, i.e. to build a great team. How to organise a team and infect all members of common purpose? I was all the time tormented by the question: "well, why don't they understand?" First of all you need to strain and working hard to get the result and only then you can think about how to relax. Obviously! Since they do not many, I would even say a minority. It turns out, most want everything right now. They couldn't care less what you in the beginning that you don't have the resources to make their lives a fairy tale and that they need to be pressed down in expenses for some period of time. There is the concept of "avoidance of uncertainty" and, it turns out, most people are committed this uncertainty be avoided. Wor

16% of search queries every day — new

In one of his presentations, Google has published an interesting figure . It turns out that every day search website handles 16% new search queries that have never met before. It would seem that where they come from, if all the words from dictionary with all the possible typos and all combinations have been exhausted? But if you think about it, it's easy to understand where they come from. First, contribute a couple of hundred languages other than English. Second, users very often look for specific information in a long unique search queries (addresses of specific establishments, names and biographies, quotes from songs, literary works and pieces of code much more). In the end, it's pointless shortcuts that accidentally hit the search engine and also unfinished requests from a dynamic search that updates results as you type user. Maybe in the statistics taken into account not the total volume of queries and individual queries. For example, if one million people se

Trigeminy index or "search the mistakes"

Изображение
once on duty, there is a need to add to the search on the site to all known features, the service is "Maybe you meant..." or "Search for mistakes". Began to think how to implement it. Third-party services and APIs to use is not wanted, because the time to someone else's servers and back, and in General not very good. Just by the way was the pg_trgm module, which looks similar to the query word based on trigrammes index. the Implementation To begin with – how it is used. To search similar words, you need to create a list of valid options. Create a table with a text field, which later will hang trigeminy index. CREATE TABLE " public "."tbl_words" ( "word" VARCHAR (255) NOT NULL ) WITHOUT OIDS; * This source code was highlighted with Source Code Highlighter . Fill in the table in various ways, for ourselves, we decided this question: — Dictionary of Zaliznyak (~90 thousand words), dictionary of Russi

The design of the algorithms of search engines — the way to success designing websites and optimizing

Introduction The easiest way of development of methods of promotion and development of the site-specific PS – is the development of its own PS. I'm not talking about implementing complex algorithms, we need an abstracted solution. You can just imagine a simplified model of the algorithm and to work with her. It is important to try to get all associated parameters. For example, the estimated time to implement, the load on the server and the time of algorithms. By measuring these parameters, it is possible to obtain a lot more information and use it to their advantage. Most novice webmasters and SEOs come from the concepts of "I want". I want the SS gave great weight to all the links, I want non-unique content, well indexed, etc. But in reality things are different, and many think this is not correct. At the same time did not think, what would the Internet be if all their suggestions worked, and even given the scale. There's only one way of struggle — the tran

The Tokyo court decided to block the search Google

Изображение
Today it became known about the decision of the court of Tokyo, which forbids search Google for the Japanese people. According to the court, such tips are a direct violation of the laws on the right to privacy of Japanese citizens. Interestingly, the trial started with insignificant, at first sight of the incident. One Japanese filed a complaint against Google because when you enter his name in the search string occurred autopackage linking the name of the Japanese to the murder committed by another person with the same last name. This led to the fact that the Japanese were dismissed, and furthermore, could not find a new job. Every time his name was typed into a search engine, pop up the same prompt. the man is so tired of it all that he went to court. Japanese lawyers have checked all the possible combinations of the search associated with his name, and found out that the ill-fated tip really occurs in conjunction with the name of the defendant. The Tokyo court si

Billion tables?!

Изображение
Thanks to the presentation of Selena Deckelman ( Selena Deckelmann ) on pgCon , some of us got involved in a discussion about "How many tables can PostgreSQL theoretically pull". Hastily wrote a script with a bold attempt to create one billion tables of the following form: the CREATE TABLE tab_### ( id SERIAL NOT NULL PRIMARY KEY, value TEXT NOT NULL ); It should be noted that such a design among other things will create billions of sequences (sequences), indexes, constraints (constraints), and two billion fields. A Perl script was running on the GoGrid cloud hosting in 4 parallel processes. It worked tolerably well, the fruit of about 300 000 tables per hour, until the disk space is not over. Based on the fact that 100,000 empty tables takes almost 2GB of disk space, after creating almost 3 million tables to the server came the notorious Northern fur animal. So, start PostgreSQL was only if I disable the fsync : fsync=off Who would have thought

Customize your Google

Изображение
After writing this post, I was interested in the search parameters which you can use for convenient operation. Given this issue, in Runet I'm more or less full information is not found. But! It turns out Google for a long time and for all shown . Let's start simple — www.google.com/search?q=Запрос this is the simplest query, in which all options are disabled or parameters are used which are stored in your cookies. Next I will talk about the most interesting and used parameters. If you want to know more, you here . Even though it is called the search parameter, I will use the word token. So any of the marks are for the test, insert the end of the line (if your line is of the same kind as just above). Marker search by country &cr=countryRU Putting this token, we get the results from the specified country. List of acronyms used. Example Asus Finland UPDATE (thanks to Soutlan noticed the error) Token language results &lr