[Linux-sohbet] reklamsiz google ??

---------

New Message Reply About this list Date view Thread view Subject view Author view Attachment view

From: Mustafa Akgul (akgul@Bilkent.EDU.TR)
Date: Wed 12 Jan 2005 - 20:06:13 EET


UNDP Biliism sayflarinda buldugum bir yazi.
UNDP@nin "gozlemei'nde ilginc baglantilar/dokumanlar var:
http://sdnhq.undp.org/observatory/

Saygilar
Mustafa Akgul
%%%%%%
The Register » Internet and Law » Digital Rights/Digital Wrongs »

Original URL: http://www.theregister.co.uk/2005/01/11/open_source_google_scraper/
An open source Google - without the ads
By Andrew Orlowski in San Francisco (andrew.orlowski at theregister.co.uk)
Published Tuesday 11th January 2005 09:44 GMT

With the hope of returning at least one corner of the web to its non-commercial roots, Google
watcher Daniel Brandt, who curates the NameBase archive, has released the source code to a
Google scraper. Brandt has been making an ad-free proxy available for two years using Google's
little known minimal "ie (http://www.google.com/ie)" interface. By using this proxy, users
bypass both Google's notorious "2038" cookie (that's when it expires
(http://www.google-watch.org/cgi-bin/cookie.htm)) and the text ads.

Brandt fully expects Google to throw legal and technical resources at him, but says he welcomes
the challenge if only to clarify copyright issues. Google took people's free stuff and made a
$50 billion business from it, he argues.
Click Here

"The commercialization of the web became possible only because tens of thousands of
noncommercial sites made the web interesting in the first place," he writes. "All search engines
should make a stable, bare-bones, ad-free, easy-to-scrape version of their results available for
those who want to set up nonprofit repeaters. Even if it cuts into their ad profits slightly,
there's no easier way to give back some of what they stole from us."

He explains in more detail in the source code: "Legally, Google probably has the right to block
anyone they want. And legally, we believe that as a tiny nonprofit with an interest in Google's
violations of privacy, we have the right to access Google's publicly-available data any way we
want. If you want to argue about copyright, then let's start with the fact that Google scrapes
billions of web pages and doesn't ask permission before making the cache copies available. Thiss
craping is used as a carrier for the ads that make Google stinkin' rich.

"Now that, in our opinion, is an interesting copyright issue. As this is written, Google has a
market cap of $55bn. This exceeds the market cap of General Motors and Ford combined. Google is
probably the single largest information resource on the planet, and they're getting rich off of
us. It's time for Google to give something back to the public sector."

The source code, which runs on Linux, asks the users only to use the program for non-commercial
purposes.

"We think it would be splendid if scraping Google for nonprofit purposes, and stripping out
their wretched advertising, was established someday as an acceptable, legal practice."

In the week since it launched, the source code has been downloaded about a hundred times a day
says Brandt.

Google would rather you licensed its beta Web API (http://www.google.com/apis/). However, as
Charles Ferguson writing in MIT Technology Review noted recently, the service is "laughably
limited" to 1,000 queries a day, and offers little functionality; Google has let the offering
languish.

You can find the code here (http://www.scroogle.org/zipdir/nbbw.zip) [ZIP archive, 16kb], an
explanation here (http://www.scroogle.org/gscrape.html) and try out the proxy here
(http://www.scroogle.org/cgi-bin/scraper.htm). ®
Related stories

Google exposes web surveillance cams
(http://www.theregister.co.uk/2005/01/08/web_surveillance_cams_open_to_all/)
Major flaw found in Google Desktop
(http://www.theregister.co.uk/2004/12/20/google_desktop_flaw/)
Google News' chief robot speaks out
(http://www.theregister.co.uk/2004/12/08/bharat_turing_test/)
Gates: PC will replace TV, TV will become a giant Google
(http://www.theregister.co.uk/2004/10/20/gates_interactive_tv_obsession/)
Google Desktop privacy branded 'unacceptable'
(http://www.theregister.co.uk/2004/10/15/google_desktop_privacy/)

© Copyright 2005
_______________________________________________
Linux-sohbet mailing list
Linux-sohbet@liste.linux.org.tr
http://liste.linux.org.tr/mailman/listinfo/linux-sohbet


New Message Reply About this list Date view Thread view Subject view Author view Attachment view

---------

Bu arsiv hypermail 2.1.2 tarafindan uretilmistir.