The Guardian

In March this year, a man with a passion for Portuguese football, living in a city in Florida, was drinking heavily because his wife was having an affair. He typed his troubles into the search window of his computer. "My wife doesnt love animore," he told the machine. He searched for "Stop your divorce" and "I want revenge to my wife" before turning to self-examination with "alchool withdrawl", "alchool withdrawl sintoms" (at 10 in the morning) and "disfunctional erection". On April 1 he was looking for a local medium who could "predict my future". But what could a psychic guess about him compared with what the world now knows? This story is one of hundreds, perhaps tens of thousands, revealed this month when AOL published the details of 23m searches made by 650,000 of its customers during a three-month period earlier in the year. The searches were actually carried out by Google - from which AOL buys in its search functions.

The gigantic database detailing these customers' search inquiries was available on an AOL research site for just a few hours before the company realised that substituting numbers for users' names did not really protect their identities enough. The company apologised for its mistake - and removed the database from the internet. The researcher who published the material has been sacked, as has his manager, and last week AOL's chief technology officer, Maureen Govern, resigned. But those few hours online were enough for the raw data files to be copied all over the internet, and there are now four or five sites where anyone can search through them using specialised software.

What was published by AOL represents only a tiny fraction of the accumulated knowledge warehoused within Google's records - but it has given all of us, as users, a dramatic and unsettling glimpse of how much, and in what intimate detail, the big search engines know about us.

The number of searches Google carries out is a secret, but comScore, an independent firm, reckons that the search engine performed 2.7bn searches by American users alone in July this year. Yahoo, its main rival, conducted around 1.8bn American searches in the same month; Microsoft's MSN around 800m and AOL 366m.

All of this information is stored. Google identifies every computer that connects to it with an implant (known as a cookie) which will not expire until 2038. If you also use Gmail, Google knows your email address - and, of course, keeps all your email searchable. If you sign up to have Google ads on a website, then the company knows your bank account details and home address, as well as all your searches. If you have a blog on the free blogger service, Google owns that. The company also knows, of course, the routes you have looked up on Google maps. Yahoo operates a similar range of services.

All this knowledge has been handed over quite freely by us as users. It is the foundation of Google's fortune because it allows the company to target very precisely the advertising it sends in our direction. Other companies have equally ambitious plans: an application lodged on August 10 with the US Patent & Trademark Office showed that Amazon is hoping to patent ways of interrogating a database that would record not just what its 59 million customers have bought - which it already knows - or what they would like to buy (which, with their wish lists, they tell the world) but their income, sexual orientation, religion and ethnicity. The company, of course, already knows who we are and where we live.

Even though the search logs that AOL released were made anonymous, by assigning a number to each user, it is not difficult in many cases to discover somebody's name from their search queries. And it is easy to follow exactly what users were thinking as they sat at their computers, in the apparent privacy of their own homes, since the time and date of every search is given.

On April 4, for instance, user 14162375, the melancholy Portuguese-American in Florida, seems to have passed out on the keyboard at 6.20pm, when he asked, suddenly, "llllfkkgjnnvjjfokrb" then "vvvvbmkmjk" and "vvglhkitopppfoppr". An hour later he had recovered enough to search for variations on his wife's name - he thought she might have moved to New England. On the evening of April 16, matters came to a head. "My cheating wife," he typed; and then, five times, "I want to kill myself," and then "I want to make my wife suffer," followed quickly by "Kill my wifes mistress," "My wifes ass," "A cheating wife". Two days after that he was back looking for audio surveillance and bugging equipment and four weeks later he seemed to have cheered up and was looking for motorcycle insurance.

The story stops abruptly there, at the end of May, because that is when the three months' worth of released AOL search records came to an end.

One of the first researchers to demonstrate that we will tell anything, however intimate, to a computer, was Joseph Weizenbaum of MIT, who in 1966 wrote a programme called "Eliza" that parodied non-directional psychotherapy. If the user typed anything in, Eliza would appear to ask a question based on that cue. In no time at all, unhappy students were telling the computer all their troubles as if there were a real and sympathetic person behind the screen. Stories and jokes about this circulated for decades, but the men most successful at turning this concept into a fortune were the founders of Google, Larry Page and Sergei Brin. As users, we think that the Google search engine is a way of supplying us with information about what's on the web. But the flow of information is two way. We ask Google things that we would hesitate to ask anyone living. The price for the answers is that Google remembers it all.

Take user 11110859 of New York City, who fell in love and then was sorry. She was up early on March 7 to buy hip-hop clothes from G-Unit; by March 26, however, there was more excitement in her life. Searches on "losing your virginity" were followed by three weeks of frantic worry about whether she was pregnant: stuff she might have hesitated to tell her best friend or her mother is all quite clear from the Google searches. But by the end of April the pregnancy scare was over and had been replaced by a broken heart. Even before she had stopped asking "Can you still be pregnant even though your period came?" she was asking "Why do people hurt others" and this was the theme of almost all her questions throughout May, culminating on the afternoon of the 19th, when she asked "How to love someone who mistreated you?"; "What does Jesus say about loving your enemies?" "What does God mean when he says bless those who spitefully use you?" Then she spent a couple of days trying to buy Betty Boop postage stamps, and the next thing we know, she was asking first for directions to the New York prison on Rikers Island, then "What items are we allowed to bring at Rikers Island" and finally for "uncoated playing cards".

User 11110859 was not the only person interested in the prison but she seems to have been the youngest and, in some senses, the most innocent. User 3745417 laid out her thoughts in detail just as graphic: on March 6 she made eight searches on child molestation and similar phrases. A week later she was trying to find a prisoner in Rikers Island - nine searches in one evening - a subject she returned to at 9.30am on March 25, when she made another eight searches. Between March 27 and March 29 she made 34 successive searches for M&M chocolates in the early evening, followed on the 30th, at 10pm, by four searches for "Kid Party Games". By 10.15pm she was searching for "Whitney Houston"; then, in the course of the next hour, 29 searches on "black porn for women" and similar subjects.

By the end of April, she was looking for a legal aid lawyer in New York City, a swimsuit, a credit card and a holiday in the Bahamas.

These stories, with all the revealing information they contain, cannot always easily be tied to a specific individual, but sometimes they can. The social security number, with which all Americans are issued, conforms to a recognisable pattern which is easy to search for in the data that AOL released. So, too, are telephone numbers. On the internet, you can buy anything from anywhere, but there are some things, such as pet care, which people mainly buy locally, so it is easy to spot where they live. People often search for their own names, which can then be cross- referenced with the telephone book.

At least one person in the AOL group, a blameless grandmother in Alabama, was identified by the New York Times within days of the AOL data release. And though it may be hard to identify complete strangers, it is very much easier to recognise in the AOL data details of someone you may already know. A church lady in the midwest, whose quest for Christian quilted wall hangings was interspersed with inquiries about vibrators and arousing frigid wives, is probably easy for anyone in her congregation to identify.

This is knowledge beyond the dreams of any secret police in history. Earlier this year Google fought a lawsuit to keep a week's worth of random search data out of the hands of the US government, but other search companies have handed over their data without complaint and nobody has yet discovered what deals have been struck between search engines and the Chinese government. China is generally thought of as attempting to censor the internet, which it does; search engines that do business in China must censor their own results if they are to succeed. But the real power for a totalitarian government is no longer just censorship. It is to allow its citizens to search for anything they want - and then remember it.

No western government, so far as we know, has gone that far. But if one ever does, it will know where the information is kept that will tell it almost everything about almost everyone. This morning, as I logged in to Googletalk, to chat with my sister, the programme silently upgraded itself. "Would you like to show friends what music you're playing now?" it asked.

From spying on the wife to motorcycle insurance

This edited list of searches by Florida AOL user 14162375 shows what intimate details are held by internet databases

March

marriage counseling 2006-03-19 17:50:31

spy on the wife 2006-03-19 17:52:47

spy on the wife 2006-03-19 17:52:47

spy on the wife 2006-03-19 17:52:47

spy on the wife 2006-03-19 17:52:47

spy on the wife 2006-03-19 17:58:58

spy recorders 2006-03-19 18:02:34

signs of cheating 2006-03-19 18:05:52

videos 2006-03-20 17:56:16

postal service stamps 2006-03-21 09:27:46

tracking cell phone numbers 2006-03-21 11:00:13

divorce 2006-03-23 14:10:27

divorce lawyers 2006-03-24 00:38:47

cheating wives 2006-03-24 06:07:00

cheating wives 2006-03-24 06:07:00

divorce lawyers 2006-03-24 13:10:32

saving a marriege 2006-03-24 13:42:04

saving a marriege 2006-03-24 15:02:24

saving a marriege 2006-03-24 15:02:24

saving a marriege 2006-03-24 15:20:13

fitness gyms 2006-03-24 16:32:50

womes wellness 2006-03-24 16:35:33

hypertension 2006-03-24 17:07:33

e-cards 2006-03-26 23:40:56

saving a marriage 2006-03-26 23:50:11

saving a marriage 2006-03-26 23:50:11

saving a marriage 2006-03-26 23:50:11

sexual techiques 2006-03-27 10:39:27

greenting cards 2006-03-27 12:45:53

standar times 2006-03-27 23:09:25

news papers 2006-03-27 23:09:56

stop your divorce 2006-03-27 23:49:06

stop your divorce 2006-03-27 23:53:30

stop your divorce 2006-03-28 00:06:53

alchool withdrawl 2006-03-28 10:43:51

alchool withdrawl sintoms 2006-03-28 10:45:38

disfunctional erection 2006-03-28 10:46:46

cheating therapy 2006-03-30 16:49:56

women's urine blood 2006-03-30 18:21:16

spy from a distance 2006-03-31 21:11:29

spy from a distance 2006-03-31 21:11:29

spy from a distance 2006-03-31 21:15:55

spy from a distance 2006-03-31 21:15:56

listentrough walls 2006-03-31 21:16:22

listen through walls 2006-03-31 21:16:25

car sound recorder 2006-03-31 21:20:07

car conversation spy 2006-03-31 21:20:24

April

spy on wife 2006-03-31 21:21:29

phico card readers 2006-04-01 22:03:08

bruchas 2006-04-01 22:04:17

phyco card readers 2006-04-01 22:06:43

phyco card readers 2006-04-01 22:07:10

predict my futur 2006-04-01 22:20:24

psychic 2006-04-02 10:14:07

i want my wyfe back 2006-04-02 23:14:28

i want revenge to my wife 2006-04-02 23:27:54

i want revenge to my wife 2006-04-02 23:27:54

get revenge from a wife cheater 2006-04-02 23:41:22

munchies 2006-04-03 11:54:59

lisbon jobs 2006-04-03 11:58:20

divorce and kids 2006-04-03 12:19:46

llllfkkgjnnvjjfokrb 2006-04-03 18:20:11

vvvvbmkmjk 2006-04-03 18:20:36

vvglhkitopppfoppr 2006-04-03 18:22:04

www.whitepages 2006-04-06 06:14:07

my wife wants to leave me 2006-04-07 16:35:03

how do i get my wife love me again 2006-04-08 17:10:55

need help getting my wife back 2006-04-08 19:27:2

i need my wife to get back to me 2006-04-08 19:29:11

i need my wife to get back to me 2006-04-08 19:29:11

my wife doesnt love animore 2006-04-08 19:30:58

i still live whith my wife can i get her bach 2006-04-08 19:32:15

i want revenge towards my wife 2006-04-08 19:32:59

i want revenge towards my wife 2006-04-08 19:32:59

i want revenge towards my wife 2006-04-08 19:32:59

i want revenge towards my wife 2006-04-08 19:36:58

making my wife suffer as i do 2006-04-09 13:19:54

get my wife back 2006-04-09 14:03:28

avoid breaking up 2006-04-09 14:04:11

avoid breaking up 2006-04-09 14:04:11

stop breaking up 2006-04-09 15:10:47

get even with my wife 2006-04-09 15:15:16

husband revenge 2006-04-09 15:23:37

husband revenge 2006-04-09 15:23:37

husband revenge 2006-04-09 15:23:37

how to harm my wifes lover 2006-04-10 13:11:28

infidelity 2006-04-10 14:32:02

whow to talk on the phone with youor wife 2006-04-10 14:43:07

catch your wife aving an affair 2006-04-10 14:44:32

baby monitors 2006-04-15 17:15:31

baby monitors 2006-04-15 17:15:31

my cheating wife 2006-04-16 16:48:06

my cheating wife 2006-04-16 16:48:06

my cheating wife 2006-04-16 16:48:06

i want to kill myself 2006-04-16 19:55:51

kill my wifes mistress 2006-04-16 20:26:49

my wifes ass 2006-04-16 20:38:37

cheating wives 2006-04-18 16:45:12

recording home survellence 2006-04-18 16:54:43

recording home surveillance 2006-04-18 16:54:53

audio roome surveillance 2006-04-18 16:55:40

audio roome surveillance 2006-04-18 16:55:43

sore muscules 2006-04-23 17:32:00

sore muscles 2006-04-23 17:32:06

sore muscles 2006-04-23 17:32:06

sore muscles 2006-04-23 17:32:06

alcoolism 2006-04-24 08:10:53

men acting like winners 2006-04-25 16:02:09

make the infidelity suffer 2006-04-25 16:03:20

the portuguese mafia 2006-04-25 16:24:04

May

motorcycle inurance 2006-05-29 18:31:19

motorcycle insurance 2006-05-29 18:31:29

private eye 2006-05-30 21:12:07

video surveillance 2006-05-30 21:20:18

video surveillance 2006-05-30 21:21:05

video surveillance 2006-05-30 21:21:24

white pages 2006-05-31 05:55:41

âˆ‘ AOL user search history data, released by AOL, August 2006.