Yahoo Buys Web 2.0
"We're in the midst of buying Dogg (a Web 2.0 cross between Digg and Dogster "Where Every Dog Has A Webpage"), and you know what? It's a lot of work. Buying up Web 2.0 companies here and there in piecemeal fashion gets old after a while.
It's a lot of work on our Corporate Development team, the Public Relations team, and really isn't that efficient.
So after some long discussions with Tim O'Reilly, Michael Arrington, and other Web 2.0 experts, we've decided to just buy Web 2.0.
All of it. All the people, the round cornered boxes, crazy business ideas, and pastel colors." (Yahoo Search Blog)
So if you want to be a Yahoo company, just create a Web 2.0 startup and you'll automatically be a part of Yahoo's big family.
Read some great tips to create a Web 2.0 site:
"If the name of your app isn't short and catchy, it won't take off. Try recording yourself reading paragraphs from Geoffrey Chaucer's The Canterbury Tales, then play the audio backwards, slowly, for inspiration.
Make access to the app invite-only, but don't actually invite anyone. Nothing creates more desire for a product than its exclusivity. If no-one has it, everyone will want it. Simple!
Ask an A-list blogger to review your app (linking to your own "review", so that you drive traffic to both your web app and the site that reviews it). While it's unlikely that they'll do so without actually logging in and trying out your app for real, they might change their tune if you offer them some kind of incentive. Send them some Photoshopped screenshots showing tag clouds and images of your app in use, and blame your data centre for the server being down -- the perfect excuse for why they can't login just now. Then promise them 20% of your profit when you sell the app to Yahoo! Negotiate as required -- everyone has their price."
Related:
Yahoo buys a company before its foundation
Google acquires Writely
The best Web 2.0 software of 2005
Just what is Web 2.0?
Google Rooms
Related:
Google Browser launched
Yahoo buys all Web 2.0
Google Browser
You want a browser that doesn't use all your RAM.
Firefox is a memory hog.
You want a secure browser.
Internet Explorer is not secure.
You want a fast browser.
Opera is fast, but not that fast.
You want a cross-platform browser.
Safari is definitely not cross-platform.
You want a browser that doesn't care about JavaScript.
Netscape cares about JavaScript - they invented JavaScript.
Presenting you Google Browser, a fast, versatile and secure text browser for the Internet. Google Browser is free, already out of beta, and the installer has only 1.68 MB.
Just open it, type g (that's from Google), enter a URL, press enter and visit the web through Google's eyes.
Download Windows version
Download Mac version
Download Linux version
Tutorial
Google mentions that Linux users should compile the downloaded package. Google Browser is the first major Google software released for Linux.
Note: This post was written for the April Fools' Day, so there's no Google Browser. At least not yet.
New Google Talk with Funny Avatars
You can also change your chat theme, to include contact's avatar.
Another new feature in this build is Gmail counter that tells how many new mails you have, so you don't have to right-click the tray icon.
You can download Google Talk 1.0.0.91 from here (this download is not linked at Google Talk's website).
[Via Google Blogoscoped].
Related:
Google Talk with Video
Gmail Users Need Fingerprint Readers
Many users think that even if someone knows their Gmail password, that person shouldn't be able to delete the Gmail account. But that's absurd: if you log in to Gmail, you can select mails (in a batch of maximum 100) and delete them. That's almost the same thing as deleting the account.
Until the identification systems evolve from username/password to biometrics authentication there will always problems like these. Biometric solutions use unique biological or behavioral characteristics (like fingerprint matching) to verify identification. A growing number of notebook PCs and computer peripherals are coming to market with built-in fingerprint readers, including keyboards, mice, external hard drives, USB flash drives and readers built into PC card and USB plug-in devices. For example, Wireless Intellimouse Explorer from Microsoft has a fingerprint reader that allows you to log on to your PC and your favorite Web sites with your fingerprint.
Read more:
Gmail horror story: Gmail account deleted
What Happens When You Google Google?
Google Local Shows Ads on Maps
If you search for "New York books" and "Ralph Lauren New York" on Google Local, you will find some small icons on the map that represent a coffee cup, a shopping bag, a grocery cart, a flower or something related to the business. If you click on the link, you'll see more information about the place, the address, driving directions, the site.
Related:
Google Local ads
Google Local Mobile
Windows Live Local
Google AdWords Blog
Improve Time Management: Overcome Procrastination
* start with a written plan of action to avoid getting distracted
* keep your plan simple and straightforward
* start with the one thing you must get done today to feel productive
* should be a manageable item you can complete in 10-15 minutes
* your tasks should match your values or purpose
* bring each task into congruence with your basic mission
* if you can't, take it off of your list
* don't put any "to-do" on your list that takes more than 30 minutes
* if it takes longer, it's actually a series of smaller "to-do's"
* don't try to do everything perfectly
* perfectionism often causes procrastination
* any small step toward completion is an accomplishment
* do the worst job (or part of the job) first and get it out of the way
* set a time limit -- "I'll file papers for 5 minutes"
* alternate unpleasant jobs with tasks you enjoy
* delegate out items you can't make yourself do
* interruptions tend to occur in identifiable patterns
* notice when interruptions occur, by whom, and why
* take steps to prevent those interruptions before they occur
* if they can't be prevented, learn how to delegate to someone else
* if they can't be delegated, learn how to delay until you are finished
* make the project and environment as pleasant as possible
* give yourself the best tools and work space for the project
* take a few minutes to organize your work space
* schedule a regular time to check in with a friend or colleague
* rewarding your accomplishments encourages productivity
From Online Organizing.
Google Sandbox and TrustRank Algorithm
Jon Galloway explains how Google changed their ranking system:
Jagger's [Google index update from October 2005] main change is the switch from the elegant but overly trusting PageRank system to the more realistically cynical TrustRank system, which is designed to only count votes from sites it trusts.
TrustRank imitates human behavior - if a stranger on a train recommends a movie, I'm going to value it a lot less than a recommendation from a close friend or movie critic, both of whom have earned my trust by either how long I've known them or by their reputation. Trust comes from two sources - site age and links from trusted sources. From my movie recommendation analogy above, site age is the close friend who has gained trust through the age of the relationship, whereas trusted sources are sites who has been granted a position of authority by links form a small seed group of trused sites.
Another way to look at this is from the point of view of a content publisher with a new site. At first, your links will be untrusted and will not contribute to the Page Rank of the page they link to. The site has to undergo an aging delay to before it is considered authoritative, which has led to discussion of the "Sandbox" (or the "Trustbox"). The idea is that new sites are sandboxed so they can't mess up the rankings until they've proven themselves, at which time they can participate in Page Rank voting.
There are two ways to gain trust and escape the Trustbox:
* Acquire links from highly trusted sources (the "movie critic recommendation")
* Acquire links from somewhat trusted sources and let them age (the "friend recommendation")
Google Sandbox is a filter whose criteria is the age of a site. After let's say 4-6 months or when the site acquires highly trusted links, a site is given credit for what it has achieved, for the backlinks it has established: its PageRank increases and it's more visible in the search results.
Related:
Expertrank: authoritative search
The future of search
Google Interactive TV
"You will identify key market trends that are shaping user behavior when watching Television. These include but are not limited to the intersection of internet and Television technologies, video-on-demand, personal video recorders and emergence of next generation set-top-boxes with IP connectivity. You will then identify areas where use of Google’s search and advertising technology can enhance this user experience and define appropriate products to deliver these user benefits." (Google Jobs)
Google aims to extend AdSense advertising program offline, in print and television. The problem is how will Google deliver contextual ads: they will just match the show content or the viewer profile?
Related: Google ads on TV
Burn a Windows Live CD
You will have a complete Win32 environment with network support, a graphical user interface (800x600) and FAT/NTFS/CDFS filesystem support. It's useful for testing systems with no OS, data recovery or virus scan.
What's great about Bart's PE Builder is that it has many plugins that extend the operating system and allow you to do many tasks:
* Access USB drives.
* Load the CD with SSH, Remote Desktop Client and VNC so you can use the boot CD as a workstation.
* Recover deleted files.
* Defragment the hard drive (much faster that defragmenting from a boot HD).
* Use Internet Explorer and Firefox from the boot CD to surf the web.
You can get Bart's PE Builder here. You should copy the XP setup files on your hard-disk. Before building the Windows Live CD, visit PEBuilder Plugin Repository and Paraglider PE Plugins to get some plugins.
Related posts about data recovery:
System Rescue Live CD
TestDisk: Recover lost partitions
How to crack a Windows password
Will Google Buy Facebook?
Google may periodically sell up to 5.3 million shares of stock according to a regulatory filing on Wednesday. At its current share price of $394.98 a share, a sale of 5.3 million shares would raise nearly $2.1 billion.
Facebook, the Web site where students around the world socialize and swap information, has put itself on the block, BusinessWeek Online has learned. The owners of the privately held company have turned down a $750 million offer and hope to fetch as much as $2 billion in a sale, senior industry executives familiar with the matter say.
Well, can you see a connection between the two news? BussinessWeek says it may be. After all, Facebook might integrate with Google Scholar, Google Books, Google Groups. But $2.1 billion is a lot of money. Just consider that Rupert Murdoch bought Myspace.com for $580 million.
Joga: Share Your Passion For Football
The site will ask you a lot of questions about football: what teams you like, favorite players, most embarrassing football moment, favorite other sports. The good thing is that most of them are optional.
But what can you do on Joga? You can create a blog, upload videos, create albums, add bookmarks, add friends based on simple criteria like location, age and join communities. You can also create your own teams, find local fields, and play other Joga teams in your community.
"Whether we've succeeded, of course, will be up to all of you to determine. We look forward to seeing football-crazy people from around the world playing as beautifully as possible at Joga.com." (Google Blog)
The New Yahoo Mail Will Have Obtrusive Ads
Another usability problems regarding ads in Yahoo Mail Beta. There are two group of ads in most of the pages, one in the right sidebar (animated flash), next to the scrollbars that lets you select a mail or read it. The other group of ads sits at the bottom, right under the navigation, and includes the usual "credit card", "free loan consult" small ads. The problem is that it's very easy to click on the ads by mistake. And if you click on an ad, it doesn't open a new page or tab, the page just replaces the ad in the iframe, so you'll get this funny picture.
Because Yahoo chose to stick with the flash ads, if you move from a page to another (from inbox to the RSS reader, for example), you'll notice a big delay in page loading, especially if you have a slow connection.
And another thing: Yahoo didn't drop the welcome page that announces you how many emails you have, and whose only purpose is to make people click on the ad if they don't have any new messages.
BitTorrent Search Engine Defends Itself Against MPAA
The judges won't probably too impressed to hear that searching for [photoshop bittorrent] on Google yealds as a top result a site that links to Adobe Photoshop torrent links. If you search for something unlawful, Google is not responsible for the sites you find.
The MPAA filed seven lawsuits against Torrentspy and other search companies that help visitors find torrents or instruct them how to download it.
"The MPAA is in essence trying to outlaw the .torrent file format." says Ira Rothken, Torrentspy lawyer.
BitTorrent has become a widely used online system to download very large files as the technology is very efficient at splitting up and sharing data.
Google Page Creator Review
"You can easily add for example many HTML, CSS, Java, and JavaScript features by simply creating your web page for example with an HTML editor (e.g. Mozilla Composer), a WYSIWYG editor (e.g. OpenOffice.org) or with a text editor (e.g. SuperEdi, Notepad, or emacs) and then you just upload the file to GPC. You can also use Google videos, Google maps and stat counters by uploading your pages, which isn't possible with the Google Page Creator's Page/HTML editor, because it removes objects that use features such as JavaScript, which usually are in elements like script or embed in the HTML source."So if you want Flash, Google AdSense, Google Maps, Google Video, Java, JavaScript, music, stat counters, or videos, create your pages offline and upload them to Google servers. And if you think the whole point of Google Page Creator was to create pages online, you're wrong. What webhosting provides you with 100 MB quote, 10 MB maximum size for files, unlimited bandwidth (well, almost) and no restriction for file types (mp3, avi, exe, zip) - all for free, and without including their ads in your pages?
Google April Fool's Day 2006 - Take A Guess
Let's take a guess:
1) Google will take over the world. Google will produce a device that will scan your important information (documents, bills, pictures, favorite colors, what you say, SIM content, hard-drive content), organize it, upload it to your Google account and make it searchable. EGoogle device.
2) Google will drop its simple and clean homepage in favour of the Personalized Home, that will also contain ads. Furthermore, you won't be able to search Google unless you have a Google account and you are logged in.
3) All mails from Gmail will be deleted. The Gmail team released it's so inefficient to type so many emails every day, so they will introduce AudioGmail, where you can record, search and organize voicemails. The new AudioGmail has another advantage: spam will be easier to recognize.
On a more serious note, everybody waits for the Google Calendar and there are some people out there that still hope that Gmail will go public.
Related:
The real Google prank from this year
Google April Fool's Day 2000: Google MentalPlex
Google April Fool's Day 2002: PigeonRank
Google April Fool's Day 2004: Lunar Jobs
Google April Fool's Day 2005: Google Gulp
Googlebot Can Destroy Sites
"Josh Breckman worked for a company that landed a contract to develop a content management system for a fairly large government website. Much of the project involved developing a content management system so that employees would be able to build and maintain the ever-changing content for their site.
Things went pretty well for a few days after going live. But, on day six, things went not-so-well: all of the content on the website had completely vanished and all pages led to the default "please enter content" page. Whoops.
Josh was called in to investigate and noticed that one particularly troublesome external IP had gone in and deleted *all* of the content on the system. The IP didn't belong to some overseas hacker bent on destroying helpful government information. It resolved to googlebot.com, Google's very own web crawling spider. Whoops.
As it turns out, Google's spider doesn't use cookies, which means that it can easily bypass a check for the "isLoggedOn" cookie to be "false". It also doesn't pay attention to Javascript, which would normally prompt and redirect users who are not logged on. It does, however, follow every hyperlink on every page it finds, including those with "Delete Page" in the title. Whoops."
So next time don't assume every visitor has JavaScript activated and validate the actions both on client side and on server side. If you want to validate your data for accuracy and security then you must use server side code to check your form inputs.
And something else: according to the HTTP 1.1 specification, the GET method is defined as a Safe Method which "SHOULD NOT have the significance of taking an action other than retrieval." If you to change a state (delete content, replace data), you should use POST.
Related posts about security:
Google Deleted... Google Blog
GMail vulnerability: GMail runs javascript in body
Get sensitive information using Google
How To Crack Your Windows Password
Minimum amount of RAM required: 256MB.
Recommended amount of RAM: 512MB.
Download Ophcrack Live CD
Related:
Reset Windows Password
Rainbow Tables (Wikipedia)
Google's Internal Competition
George P. Alexander Jr. writes in his blog about a different perspective on life at Googleplex:
"Do you know that there is so much internal competition in Google? For instance, there are some situations when a project is set to start its life cycle, that there is, on a parallel other teams that work on the same project (and if I'm right, sometimes the identity of the opposing team members or even this aspect of parallel teams existing for a project is not revealed until a later time in the SDLC). The final product that gets released belongs to the team that comes with the best proof of concept AND / OR the best design AND / OR the best pilot AND / OR the best final product AND / OR the best something else based on various parameters."
It seems that Google uses both competition and collaboration to deliver great products, but it's weird to work at Google as an undercover programmer.
Related:
Inside Google Campus
Xooglers (ex-Googlers )
Google Office
How To Reset Windows Password
This tip works for NT operating system: Windows NT4, 2000, XP and 2003.
Related posts about data recovery:
TestDisk: Recover Lost Partitions
System Rescue Live CD
Google Gains Share Market In The US
Google‘s share of the US search market increased to 42.3 from 36.3 percent a year earlier, according to a study by ComScore. Yahoo's share dropped to 27.6%, MSN share dropped to 13.5%, while AOL's share increased slightly to 8.0% from 7.9% in Jan 06. Ask market share rose to 6.0% from 5.6% in Jan 06.
Related:
Where Do You Want To Go Today?
Google Market Share in the U.K.
FlashGet - Best Download Manager For Windows
FlashGet features:
* clipboard monitoring, browser monitoring (Explorer, Firefox, Opera) - you just click or copy the download link you want and FlashGet will get it for you
* split files into sections or splits, and download each split simultaneously
* supports RTSP and MMS protocols
*
* a very nice Site Explorer utility that lets you explore HTTP and FTP sites. You can crawl Google site, starting from the index and flolowing all the links recursively.
* you can create rules for download management: for example, all mp3 files should go into a specific folder
Download from Softpedia (FlashGet's homepage has slow mirrors for download)
Update: FlashGet is now free and it doesn't have ads.
Google Deleted... Google Blog
"We've determined the cause of tonight's outage. The blog was mistakenly deleted by us (d'oh!) which allowed the blog address to be temporarily claimed by another user. This was not a hack, and nobody guessed our password. Our bad." says Jason Goldman, Blogger Product Manager.
TestDisk: Recover Lost Partitions
- DOS
- Windows (NT4, 2000, XP, 2003)
- Linux
- FreeBSD, NetBSD, OpenBSD
- SunOS
- MacOS
Related:
System Rescue Live CD
Google Desktop As Data Recovery Software
Where Do You Want To Go Today?
google.com - 75%
images.google.com - 9%
mail.google.com - 7%
groups.google.com - 2%
video.google.com - 2%
news.google.com - 1%
froogle.google.com - 1%
yahoo
mail.yahoo.com - 53%
search.yahoo.com - 9%
news.yahoo.com - 3%
login.yahoo.com - 3%
bid.yahoo.com - 2%
yahoo.com - 2%
finance.yahoo.com - 2%
msn
hotmail.msn.com - 73%
search.msn.com - 8%
spaces.msn.com - 5%
msn.com - 3%
msnbc.msn.com - 2%
moneycentral.msn.com - 1%
arabia.msn.com - 1%
*The data is provided by alexa.com and it's not statistically accurate.
Check Multiple GMail Accounts
Gmail Manager displays account details including unread messages, saved drafts, spam messages, labels with new mail, space used, and new mail snippets. You'll also get a compose menu for your Gmail accounts.
Also read:
5 Fast Ways To Check Your Gmail
Gmail Skins: Change Gmail Interface
Gmail Horror Story: Gmail Account Deleted
Google Market Share in the U.K.
Here are the top 5 search engines in the U.K.:
74.67% | |
Yahoo | 9.30% |
MSN | 5.46% |
AOL | 4.21% |
Ask Jeeves | 2.28% |
Read:
Google Traffic Breakdown (November 2005)
Gmail Skins: Change Gmail Appearance
Features:
* Change the colour scheme of your inbox
* Insert smileys/emoticons and images in to your emails
* Make the navigation (Inbox, Starred, Sent Mail) horizontal
* Zebra stripes on the inbox (alternating colors for rows)
* Change the attachment paperclip to an icon indicating the file type of attachment
* Hide invite panel, page footer
* Hide your email address at the top - for privacy issues
The skins don't look very pretty, but they disrupt a little the monotonous Gmail look.
Alternatives:
Skinning Gmail with user-defined CSS page
Highest Paying AdSense Keywords
$54.33 mesothelioma lawyers
$47.79 what is mesothelioma
$47.72 peritoneal mesothelioma
$47.25 consolidate loans
$47.16 refinancing mortgage
$45.55 tax attorney
$41.22 mesothelioma
$38.86 car accident lawyer
$38.68 ameriquest mortgage
$38.03 mortgage refinance
$37.55 refinancing
$35.99 auto accident attorney
$35.52 equity mortgage
$34.34 mesothelioma texas
$34.05 mortgages
$33.80 criminal defense attorney
$33.54 epocrates
$32.95 mesothelioma
$32.08 car accident attorney
$31.60 mortgage refinance rate
$31.38 loan refinance
$31.29 personal injury attorney
$31.24 best refinance
$30.14 register domain names
$29.86 medical malpractice lawyer
$29.68 incorporate
$29.68 malignant mesothelioma
$29.49 mortgage refinance
$29.45 freecreditreport
$29.41 fargo refinance
$28.53 mortgage loans
$28.15 125 refinance
$28.05 los angeles lawyer
$27.96 re mortgage
$27.38 how to register a domain name
$27.31 mortgage refinance rate
$26.86 personal injury
$26.48 refinance
$26.17 refinance
$25.43 mortgage loan
$25.35 texas refinance
$25.33 medical malpractice attorneys
$25.33 mortgage application
$24.46 mortgage companies
Detecting Near-duplicate Documents
"From the perspective of users, duplicate and near-duplicate documents raise problems. More specifically, when users submit a query to a search engine, most do not want links to (and descriptions of) Web pages which have largely redundant information. For example, search engines typically respond to search queries by providing groups of ten results. If pages with duplicate content were returned, many of the results in one group may include the same content. Thus, there is a need for a technique to avoid providing search results associated with (e.g., having links to) Web pages having duplicate content."
One idea might be indexing the keywords in the documents and comparing the percentage of terms shared by the two documents, but that highly inefficient.
Or you can try to compute the edit distance (Damerau-Levenshtein distance) between the two documents. The edit distance between two input strings is the minimum cost of a sequence of edit operations (substitution of a symbol in another incorrect symbol, insertion of an extraneous symbol, deletion of a symbol, and transpositions ) needed to change one input string into the other string.
A much better method for detecting duplicate and near-duplicate documents involve generating "fingerprints" (hashes) for elements (paragraphs, sentences, words, shingles) of documents. Two documents would be considered to be near-duplicates if they share more than a predetermined number of fingerprints.
A k-shingle is a sequence of k consecutive words from a documents. If S(A) is the set of shingles contained by A, we can compute the resemblance of A and B like this: |S(A)VS(B)| divided by |S(A)US(B)|. The problem is that the intersection is hard to compute, so it has to be estimated.
Learn more from Andrei Broder's course at Princeton University [PDF, html version].
"Search without a box" - A chat with Andrei Broder
Internet Censorship In China
WHO (helps China censor the Internet)
Western corporations provide much of the equipment and services for China's Internet system. Major players include Cisco Systems, Nortel Networks, Sun Microsystems, 3COM, Google, Yahoo!, Microsoft, IBM and others.
Cisco Systems has been integral to China's Internet development. Its router equipment, which reportedly provides no anonymity or encryption and was specifically designed for China, is in the core of the nation's surveillance of the Internet.
WHERE (is censorship exercised)
Called wangba, or Net bars, cybercafés are required to keep detailed logs of customers' online activity on file for 60 days. If a user tries to access forbidden Web sites, a café must disconnect the user and file a report with state agencies. Penalties for violations include fines and even imprisonment.
WHAT (is censored)
Chinese search engines monitor content by keyword and remove offending Web sites. When people request banned content (for example: Revolution, Democracy, STD, Human rights) through Chinese search engines like Baidu and Yisou, the filtering system disconnects them.
HOW (to bypass the censorship)
People use proxy relays to get around Internet filtering and monitoring.
Tunneling allows a user in a censored location to access information through a tunnel to a computer in an unfiltered location. All requests run through an encrypted tunnel to a non-filtered computer, which forwards requests and responses transparently. Both private and commercial tunneling services are available.
Related posts:
Google censors search results in China
Free proxy list (anonymous proxy servers)
Google April Fool's Day 2000: Google MentalPlex
Google MentalPlex gives some funny error messages:
* Error 01: Brainwaves received in analog. Please re-think in digital.
* Error 005: Searching on this topic is prohibited under international law.
* Error 008: Interference detected. Remove aluminum foil and remote control devices.
* Error 8P: Unclear on whether your search is about money or monkeys. Please try again.
* Error 666: Multiple transmitters detected. Silence voices in your head and try again.
* Error: MentalPlex(tm) has determined that this is not your final answer. Please try again.
More Google April Fool's Day:
Google April Fool's Day 2002: PigeonRank
Google April Fool's Day 2004: Lunar Jobs
Google April Fool's Day 2005: Google Gulp
Google April Fool's Day 2006 - Take A Guess
Google April Fool's Day 2002: PigeonRank
"PigeonRank's success relies primarily on the superior trainability of the domestic pigeon (Columba livia) and its unique capacity to recognize objects regardless of spatial orientation. The common gray pigeon can easily distinguish among items displaying only the minutest differences, an ability that enables it to select relevant web sites from among thousands of similar pages.
By collecting flocks of pigeons in dense clusters, Google is able to process search queries at speeds superior to traditional search engines, which typically rely on birds of prey, brooding hens or slow-moving waterfowl to do their relevance rankings."
It's PigeonRank, not PageRank.
More Google April Fool's Day:
Google April Fool's Day 2000: Google MentalPlex
Google April Fool's Day 2004: Lunar Jobs
Google April Fool's Day 2005: Google Gulp
Google April Fool's Day 2006 - Take A Guess
Horoscopes, Financial Advice, Chat
Well, two years later look what you can see on Google pages: Google Finance, Google Talk and horoscope on Google IG.
Also see:
Google is not a portal
Google Finance launched
Mike Grehan: Google to become a portal
Google Hires Bram Moolenaar, Author Of Vim
Other famous people hired by Google:
* Vint Cerf, internet "founding father"
* Guido van Rossum, creator of Python
* Ben Goodger, lead software engineer on Mozilla's Firefox development team
* Sean Egan, lead developer for Gaim
* Adam Bosworth, senior manager at Microsoft
* Udi Manber, chief executive of Amazon's A9 online search unit
Check The New Google Design
javascript:document.cookie= "PREF=ID=fb7740f107311e46:TM=1142683332:LM=1142683332:S=fNSw6ljXTzvL3dWu; path=/; domain=.google.com";top.location="http://www.google.com/preferences"
Then click "Save Preferences" to make the new design permanent.
To revert to the old design, just delete the cookies for google.com.
Via Digg
Google Is Not A Portal
In 1998, when Google was getting started, Scott Rosenberg speaked about Google as a better search engine:
"Google ... is important -- as a sign, amid the profusion of look-alike portals, that there's still plenty of room for improvement in the basic technologies we use on the Web every day. If the portals themselves don't generate innovation, smart people elsewhere will. Commerce is a big driving force in how the Web evolves, but creativity is another. Just as imaginative marketers will keep finding ways to sell us more stuff, inventive programmers will keep finding ways to reduce noise and confusion online and help us all find what we're looking for. ... The irony here is that the big portal sites are the ones, increasingly, making it harder to use the Web: They're under such pressure to turn a profit to justify their market valuations that their pages have become crowded, blinking arrays of commercial distractions. Meanwhile, they're failing to drive forward the technology at the root of their business.
That a couple of grad students could build a better search engine than a whole raft of media and technology companies with stock-market valuations in the billions does not speak well of how these firms are spending their budgets. ... Which is one more reason to distrust the conventional view that the portals have the future of the Web sewn up. There's something ultimately dumb about these all-things-to-all-people sites in a medium whose greatest strength is the ability to be specific things to specific people. If the portals can't even build a better search engine, I am not betting on their ability to control an industry as fast-moving, innovative and metamorphic as the Internet -- next year or any year."
Google is not a portal, it's a homepage for the web, a door for information. It has a personalized home feature, but that's not a collection of links that promote other services or articles like: "Should I Forgive Her For Cheating?" (see the screenshot from msn.com).
If they are not already a portal, how will we know if it actually does become one? When the screenshot that illustrates this post will be from google.com.
Python: PageRank vs ExpertRank
If you search for [python] on Ask.com and Google, you will see Google has the first 10 results about the programming language called Python, while Ask has a one-box results that features a picture of a python (snake) and a definition and also alternates results about the two meanings of the word.
Of course, most of the pages that contain the word "python" will be about the programming language (you can check that, searching for [python snake] that gives 2,310,000 vs 234,000,000 results for [python]). That's why Google will consider these pages the most relevant. In fact, none of the first 100 results for [python] in Google is about the snake (few of them are about Monty Python).
Ask.com uses ExpertRank, finds clusters for the query you entered and returns the most authorative sites for each cluster.
ExpertRank: Authoritative Search
The algorithm behind Teoma was rebranded ExpertRank: "Ask's ExpertRank algorithm provides relevant search results by identifying the most authoritative sites on the Web. With Ask search technology, it's not just about who's biggest: it's about who's best. Our ExpertRank algorithm goes beyond mere link popularity (which ranks pages based on the sheer volume of links pointing to a particular page) to determine popularity among pages considered to be experts on the topic of your search. This is known as subject-specific popularity. Identifying topics (also known as "clusters"), the experts on those topics, and the popularity of millions of pages amongst those experts -- at the exact moment your search query is conducted -- requires many additional calculations that other search engines do not perform. The result is world-class relevance that often offers a unique editorial flavor compared to other search engines."
ExpertRank is an evolution of IBM's CLEVER project, a search engine that never made it to public. "Clever attempts to ensure that the information it retrieves is useful by pointing people toward either of two classes of sites: authorities and hubs. An authority is a site to which many other sites have links, which Dom sees as implied endorsements of the site's usefulness. A hub is a site that has links to many other sites, and is therefore a potentially good reference. Clever's job is to identify the best hubs (those that link to the best authorities) and the best authorities (those that are linked to by the best hubs)."
The difference between PageRank and ExpertRank is that for ExpertRank the quality of the page is important and that quality is not absolute, but it's relative to a subject.
"Clever starts with 200 pages that are the result of an ordinary keyword search. It then adds all pages that link to, or are linked to by, one of those 200 pages. This step typically swells the set of pages to 1,000 or more. Clever initially assigns each page a hub score of one and an authority score of one. It sums up all the authority scores to get a page's hub score, and sums up all the hub scores to get a page's authority score. Then it repeat the process some five times until the system has identified the hubs that link to the top-scoring authorities and the authorities that are linked to by the top-scoring hubs."
Hacking Google
If you search for sites with "Remote desktop web connection" in the title, you'll find... remote desktops that you can take over: [intitle:"Remote Desktop Web Connection"]
During a series of demonstrations, Kurtz showed how fairly straightforward queries will bring up user names and passwords as well sensitive information such as social security numbers. Just search for [ssn 111111111..999999999 death records].
If you type inurl:robots.txt in Google, you might be able to see the contents of that file and subdirectories that weren't meant to be public. For example, you can find Google MBD.
Read more:
Get sensitive information using Google
Google Hacking Database
Google April Fool's Day 2004: Google Lunar Jobs
"Google is interviewing candidates for engineering positions at our lunar hosting and research center, opening late in the spring of 2007. This unique opportunity is available only to highly-qualified individuals who are willing to relocate for an extended period of time, are in top physical condition and are capable of surviving with limited access to such modern conveniences as soy low-fat lattes, The Sopranos and a steady supply of oxygen.
The Google Copernicus Hosting Environment and Experiment in Search Engineering (G.C.H.E.E.S.E.) is a fully integrated research, development and technology facility at which Google will be conducting experiments in entropized information filtering, high-density high-delivery hosting (HiDeHiDeHo) and de-oxygenated cubicle dwelling. This center will provide a unique platform from which Google will leapfrog current terrestrial-based technologies and bring information access to new heights of utility."
If you emailed Google about the supposed jobs, you would've got an auto-reply:
"Thank you for contacting Google about our Copernicus Research Center.
We've received an overwhelming response to this opportunity and are not currently accepting additional resumes. We will, however, keep your information on file should we have an opening in the future. At the current staffing levels, we anticipate that we may need additional applicants on or around April Fool's Day in 2104. Until then, we appreciate your interest in Google and your taking the time to write us.
Sincerely,
The Googlunar Recruiting Team"
More:
Google April Fool's Day 2000: Google MentalPlex
Google April Fool's Day 2002: PigeonRank
Google April Fool's Day 2005: Google Gulp
Google April Fool's Day 2006 - Take A Guess
Google April Fool's Day 2005: Google Gulp
April Fool's Day 2005: Google Gulp
"At Google our mission is to organize the world's information and make it useful and accessible to our users. But any piece of information's usefulness derives, to a depressing degree, from the cognitive ability of the user who's using it. That's why we're pleased to announce Google Gulp (BETA)™ with Auto-Drink™ (LIMITED RELEASE), a line of "smart drinks" designed to maximize your surfing efficiency by making you more intelligent, and less thirsty.
Think a DNA scanner embedded in the lip of your bottle reading all 3 gigabytes of your base pair genetic data in a fraction of a second, fine-tuning your individual hormonal cocktail in real time using our patented Auto-Drink™ technology, and slamming a truckload of electrolytic neurotransmitter smart-drug stimulants past the blood-brain barrier to achieve maximum optimization of your soon-to-be-grateful cerebral cortex. Plus, it's low in carbs! And with flavors ranging from Beta Carroty to Glutamate Grape, you'll never run out of ways to quench your thirst for knowledge. "
Related:
Google April Fool's Day 2000: Google MentalPlex
Google April Fool's Day 2002: PigeonRank
Google April Fool's Day 2004: Lunar Jobs
Google April Fool's Day 2006 - Take A Guess
60% Of Vista Code Is Broken, While Office Is Delayed Too
On a related note, Microsoft confirmed that it is also pushing the mainstream launch of Office 2007 to next year. The reason is that Microsoft wants to launch Office and Vista in tandem.
So the word of the week for Microsoft was DELAY. But what if the problems are so difficult to solve that they need another year? They're working for Windows Longorn since 2003, that's 3 years ago, and 60% of the code they wrote needs to be rewritten. I think this needs almost 2 years, so the best thing to do is set a new deadline: December 2007.
Related:
Vista Launch Delayed One More Time
Office 2007 Screenshots
Multiple live CDs in one DVD
Search out-of-copyright books with Google Books
Patent For Advertising On Google WiFi
"[0046] In stage 610, the first entity, in turn, credits the WAP provider with a portion of the advertisement revenue. The portion of the revenue may include a flat rate, a percentage of the advertisement revenue, or a combination thereof. In one embodiment, the first entity identifies the WAP to be credited via the IP address.
[0047] As a result of receiving a portion of the advertisement revenue, the WAP provider is may cover the expenses of providing the WAP and may recoup a profit, while providing end-users with access to the WAP at a reduced rate.
[0048] In alternative embodiments, data other than advertisements could be inserted by the first entity into the view presented to the end-user accessing a WAP. For example, the data could in the form of a message, or a static advertisement that does not include a hyperlink.
[0049] Furthermore, the processes and architecture described above may be used to provide wireless access at a reduced rate for multiple WAPs, including multiple disparate WAPs."
It will be interesting to see if the advertising-based WiFi will be a viable solution. Google might combine this with Web Accelerator and distribute the content via a proxy.
Interesting reading:
System Rescue Live CD
What the disk contains?
* GNU Parted - a tool editing your disk partitions under Linux
* QtParted - Partition Magic clone for Linux.
* Partimage - Ghost/Drive-image clone for Linux
* File systems tools (e2fsprogs, reiserfsprogs, reiser4progs, xfsprogs, jfsutils, ntfsprogs, dosfstools): they allow you to format, resize, debug an existing partition of your hard disk
* Sfdisk allows you to backup and restore your partition table
The download ISO is 123 MB and is based on Gentoo Live CD.
Firefox Bug Causes Break-up
Summary: When different users on one system choose to save or not save passwords for sites, any other user can see sites they not only saved passwords for but can also see what other users have been saving/never saving passwords for.
Reproducible: Always
Steps to Reproduce:
1. Create 2 unique user accounts (for steps sake, let's call the two accounts Joe and Mary) in Windows XP Home.
2. Logout and sign-in under Joe.
3. Open Firefox and go to an e-mail site or to jdate.com or wherever.
4. Attempt to log-in to the site so that Firefox will ask whether or not you want your password saved.
5. Choose not to save the password.
6. After successfully logging in and having selected the "never save password" option, logout.
7. Log-in as Mary and open Firefox.
8. Browse, browse, browse ... but you don't really have to. Just go to "View Saved Passwords," click on the tab that will show you sites to never save passwords for, and you'll see whatever painful site Joe denied to save a password for.
9. Break-up with fiancé.
More at Bugzilla [ via Digg ].
New Google Calendar Screenshots
Other Google Calendar screenshots.
Update: Google Calendar is live.
Google.com Is Number Two In Alexa Rankings
Emerging Technologies: Epigenetics
In 1998, Alexander Olek founded Berlin-based Epigenomics to create a rapid and sensitive test for gene methylation, a common DNA modification linked to cancer. The company's forthcoming tests will determine not only whether a patient has a certain cancer but also, in some cases, the severity of the cancer and the likelihood that it will respond to a particular treatment.
Philip Avner, an epigenetics pioneer at the Pasteur Institute in Paris, says that Epigenomics' test is a powerful tool for accurately diagnosing and understanding cancers at their earliest stages. "If we can't prevent cancer, at least we can treat it better," says Avner.
From: Techology Review
Read more about epigenetics: Epigenetics Changes in Cancer Cells
Yahoo Messenger with VoIP
Available in several other countries since December, the service allows users to make calls from their computers for 2 cents a minute or less to the top 30 national phone markets, including the United States.
Here are the new features of the VoIP service:
- Phone Out: Calls within the U.S. and to more than 30 other countries can be made for two U.S. cents a minute or less.
- Phone In: For $2.99 a month or $29.90 a year, people can select a personal phone number, and receive incoming calls free. In the beta service, country-based phone numbers are initially available in France, the United Kingdom, and the United States with additional country-based numbers available in the coming months.
- Free Voicemail. Additionally, Yahoo! Mail now includes useful links to Yahoo! Messenger with Voice, enabling people to easily check their voicemail directly from Yahoo! Mail.
More at Reuters.
Google Calculator
What's the answer to life, the universe and everything?
Answer: 42 (see Hitchhiker's Guide To The Galaxy)
How many seconds in a decade?
Answer: 315 569 260 seconds
Speed of light
Answer: 299 792 458 m / s
17 to binary
Answer: 0b10001
e^((pi)*i)+1
Answer: 0
10th root of 1024
Answer: 2
Half a cup in teaspoons
Answer: 24 US teaspoons
Ounces in a shot
Answer: 1.5 US fluid ounces
When is Easter 2006?
Answer:
(Western) April 16, 2006
(Orthodox) April 23, 2006
Earth mass in kg
Answer: 5.9742 × 1024 kilograms
New Google SERP Really Soon
As mentioned here, Google will change the layout of their SERPs (Search engine results pages). They've experimented with many designs and it seems they chose the most simple one, the layout that uses more space for the results.
There is a similar screenshot on Flickr, where the ads are put at the bottom of the page.
Google Annual Report Is Pessimistic
Major competitors
"We face formidable competition in every aspect of our business, and particularly from other companies that seek to connect people with information on the web and provide them with relevant advertising. Currently, we consider our primary competitors to be Microsoft Corporation and Yahoo! Inc. Microsoft has announced plans to develop features that make web search a more integrated part of its Windows operating system or other desktop software products. We expect that Microsoft will increasingly use its financial and engineering resources to compete with us. Both Microsoft and Yahoo have more employees than we do (in Microsoft’s case, approximately 11 times as many). Microsoft also has significantly more cash resources than we do. Both of these companies also have longer operating histories and more established relationships with customers and end users. They can use their experience and resources against us in a variety of competitive ways, including by making acquisitions, investing more aggressively in research and development and competing more aggressively for advertisers and web sites. Microsoft and Yahoo also may have a greater ability to attract and retain users than we do because they operate Internet portals with a broad range of content products and services. If Microsoft or Yahoo are successful in providing similar or better web search results compared to ours or leverage their platforms or products to make their web search services easier to access than ours, we could experience a significant decline in user traffic. Any such decline in traffic could negatively affect our revenues."
The revenue growth will decline
"We expect that our revenue growth rate will decline over time and anticipate that there will be downward pressure on our operating margin. We believe our revenue growth rate will generally decline as a result of increasing competition and the inevitable decline in growth rates as our revenues increase to higher levels. We believe our operating margin will experience downward pressure as a result of increasing competition and increased expenditures for many aspects of our business."
Ad-blocking may kill Google
"Technologies may be developed that can block the display of our ads. Most of our revenues are derived from fees paid to us by advertisers in connection with the display of ads on web pages. As a result, ad-blocking technology could, in the future, adversely affect our operating results."
Vista Launch Delayed One More Time
Microsoft pushed back the consumer version of Vista until January 2007 from an earlier target for the second half of 2006 and pledged to ship the next version of its operating system to business customers in November.
"It is a critical eight- to 10-weeks for retailing and for the producers. The retailers and PC hardware manufacturers work on razor-thin margins, so the impact there could be pretty severe," said David Smith, analyst at Gartner.The explanation for the delay is that Microsoft wants to improve overall quality, particularly in security, and that PC makers didn't want the operating system introduced in the middle of holiday sales, because a new version would create instability in the market.
It's not the first time Microsoft delays the launch of Vista (previously codenamed Longhorn), 2005 was another deadline.
Press:
New York Times: Microsoft to Delay New System
Google Health Database?
"Adam Bosworth is a Vice President of Engineering at Google Inc. He was previously VP Engineering at BEA Systems and was responsible for the engineering efforts for BEA's Framework Division. Prior to joining BEA, Bosworth co-founded Crossgain, a software development firm acquired by BEA in 2001. Crossgain's "Cajun" project developed into BEA's WebLogic Workshop product. At BEA, Bosworth also developed the Alchemy intelligent caching framework in a team consisting of Bosworth and his son, Alex.
Known as one of the pioneers of XML, Bosworth previously held various senior management positions at Microsoft, including General Manager of the WebData group, a team focused on defining and driving XML strategy. While at Microsoft, he was responsible for designing and delivering the Microsoft Access PC database product (codenamed 'Cirrus') and assembling and driving the team that developed Internet Explorer 4.0's HTML engine (codenamed 'Trident')."
According to Garett Rogers, it seems that Adam Bosworth is working on a new Google project, known as Google Health. His title is "Architect, Google Health". Maybe Google Health is the same thing as Google MDB (Google Medical and Biological Database).
Google Embed
Embed Google Maps
Go to this Google Maps page to sign up for an API key. Then you can visit Google Maps EZ to get some code to insert in your page.
Embed Google Videos
Go to Google Video, search for a video, and click "Put on a site" on the right sidebar to get the code.
Embed Google Search
If you want to make your site searchable, you can include a Google search box. You can even customize your search results to look more like your site.
Google MBD
Old robots : 72.14.203.104/search?q=cache:4 ...
New robots : www.google.com/robots.txt
It's weird to see that www.google.com/mbd resolves correctly. Google actually serves a blank page.
What could mean MBD? Maybe "million barrels per day", or "minimal brain dysfunction"? Don't think so. Maybe Google Embed, a service that lets you create mashups using Google services.
Update: Peter Dawson suggests in Blogoscoped Forum that the acronym may mean "medical and biological database".
Google Finance Launched
You can search for stocks, mutual funds, public and private companies, find news about companies and even blog posts, see related companies, company summary and management information.
You can create a portfolio, if you have a Google Account. Google Finance portfolios allow you to keep track of financial information, including how many shares you own and at what price, for up to 200 stocks or mutual funds.
But probably the best feature of Google Finance is the interactive charts, that correlate market data with corresponding dated news stories to help you determine if there is a relationship between them.
Of course, the product is far from perfect if you compare it with Yahoo! Finance: it doesn't have real-time quotes, statistics, SEC fillings, list of competitors, analyst estimates, list of major holders, income statements, option to compare stocks and many other features. So I think Google Finance can't be considered real competition for sites like fool.com or finance.yahoo.com. Yet...
Update: Google Blog says that Google Finance "started as a small project led by a few engineers in Bangalore and later joined by more engineers and finance enthusiasts in Mountain View and New York".
Identity Crisis For Google Software
There is also a funny bug in the installer. If you rename GoogleVideoPlayerSetup.exe to GoogleVideoPlayer.exe, and run the setup, it will give you an error: "Google Video Player is running. Please close it to continue." even if the player isn't started. That only means one thing: the setup checks if there's a file called GoogleVideoPlayer.exe running (that's the name of the Google Video Player main exe). It checks for the file, and the file found is exactly the installer. That's a really dumb way to check if the player is running. I'm sure Googlers have heard about mutexes (even InnoSetup supports that). You can even rename ANY executable to GoogleVideoPlayer.exe and try to run the setup. The same error message.
That reminds me of another error message from Google Desktop. I had a version downloaded from 11 March, I downloaded the latest version when Desktop got out of beta (on 15 March) and I couldn't install it: "A newer version of Google Desktop is already installed." It's also interesting to note that, although, Google Desktop reached version 3, the software presents itself as "Google Desktop 4.2006.306.1208-en".
Related:
Google Video Store is live
Google Desktop Should Reinvent Itself For Vista