Friday, May 27, 2005

Peter Givler: Stupidity about copyright on a massive scale

So, Google wants to make every book in the world searchable, by digitizing them; and some publishers are mad:

At issue is whether Google Print for Libraries, the company's plan to digitize the collections of some of the country's major university libraries, infringes the copyrights of the authors of many books in those collections. . . .

In a letter to Google dated Friday, the details of which were first reported by BusinessWeek on Monday, Peter Givler, executive director of the press association, said that Google Print for Libraries "appears to involve systematic infringement of copyright on a massive scale."

I have one question for Peter Givler: How, exactly, do you think Google makes the web searchable?

Here's a hint: Google does not, contrary to what Givler may believe, have an infinite army of monkeys who click around in their web browsers looking for your phrase when you hit "Google Search". That would be pretty cool, but alas, infinite monkey armies are in short supply, and also monkeys can't read. Instead, Google has a copy of the entire web on its servers, indexed for fast retrieval based on content and link structure. Actually, Google keeps many, many copies of the entire web on its servers, just as every web search engine has done since hoary old Altavista. And almost all of this material is technically under copyright --- so does Peter Givler believe that Google's web index constitutes "systematic infringement of copyright on a massive scale"? If so, is he willing to urge all members of the Association of American University Presses to boycott all Web search engines, since, after all, it would be immoral and hypocritical for them to patronize a service that so flagrantly disrespects copyright?

I mean, if you use arithmetic similar to that employed by the RIAA/MPAA in similar circumstances, the aggregate value of all the infringing copies of the web on web search engines must be staggering --- hundreds of billions of dollars at least, per copy. In fact, if you added up the value of all the "infringing" copies made by web search providers, then it would probably exceed the total monetary value of everything ever produced by every member of the AAUP by several orders of magnitude.

But, of course, the RIAA/MPAA's arithmetic is nonsense, and so is Peter Givler's allegation. Sorry, but building a digital, searchable, full-text index of every book is no more illegal than building a web search engine; or, for that matter, building a library card catalog.

Wednesday, May 25, 2005

Spam and the Department of Justice

A recent Politech post, about the prospects of new spam legislation, notes that most existing spam already breaks at least one U.S. law. What's really needed is better enforcement of existing laws.

This past winter quarter, I helped organize a cross-disciplinary seminar (in the UW-CSE dept.) on technical and legal perspectives on software security. We got, among other things, a UW law prof and a US Attorney from the DOJ to come and give talks. (I'd tell you their names, but in light of the other crazy stuff I write on this blog, I suspect they'd rather not be associated with me on random Google searches.) One of the things that we learned was that state and local authorities don't have the jurisdictional rights to investigate and prosecute most computer crimes, and that federal law enforcement on all levels is incredibly underfunded given the amount of stuff that they need to enforce. Hence, enforcement of federal laws gets heavily triaged.

So, if you're an FBI agent or a US Attorney, you're basically going to go after bank robbers, drug dealers, child molesters, and terrorists... and, er, copyright infringers. Everybody else can wait in line. Whatever its aggregate cost to society, spamming is a "mere" crime of annoyance, and investigating it requires technical expertise that's in short supply in public sector law enforcement. Unless the Department of Justice gets lots more funding, and a real mandate comes down from on high to go after spammers, Congress can pass all the anti-spam laws it wants --- spammers aren't going to jail.*

From my own meager understanding of the current budgetary climate in Washington, that kind of budgetary boost for the DOJ, for anything besides anti-terrorism, isn't in the cards --- we're already deep in deficit, and tax cuts for the rich are more important than services for all.

* For strange historical reasons, the Secret Service is in charge of investigating certain kinds of computer crime, and they're now part of the Department of Homeland Security, but similar problems most likely apply.

Tuesday, May 24, 2005

Word of the Month: Catospheric!

For several years, aerospace consultant Joanne McNeil has had a harmless and rather quiet blog; I don't remember how or when it got onto my roll, but there it is. One of the minor features of her blog is a list of links to people who are, or once were, associated with the Cato Institute; this link list was humbly titled the "Catosphere".

Today my RSS reader delivered to me news that some clueless consultant sent her a nastygram because her use of "Catosphere" allegedly infringes on the service mark of some other service with the same name. Being more clueful than said consultant, and hating obnoxious intellectual property land grabs as much as I do, I did a bit of homework and responded. McNeil's usage appears to predate this consultant's use of the word "Catosphere" by at least six months.

In the words of Jay-Z, "I ain't passed the bar, but I know a little bit/Enough that you won't illegally search my shit", and one funny thing about U.S. service mark law is that the legal theory is based on protecting the market from confusion. Therefore, the strength of one's case w.r.t. protecting a right in a service mark is based, in part, on whether the target market understands the mark to refer to your product and not your competitor's. Now, "Catosphere" is a sufficiently rare string that, in all probability, summer 2004 would have turned up J. McNeil's blog as the top hit on search engines. So McNeil has at least one defense, on the grounds that prior to this consultant's appropriation of her mark, "Catosphere" arguably referred to the think tank and/or McNeil's link list, not the consultant's service, and it is the consultant who's sowing confusion and not McNeil. Indeed, even after significant self-promotion by this consultant, Google's rank for "Catosphere" still places McNeil in the top 20.

But we're not done yet. We can weaken the consultant's case even further by reinforcing the rightful and original usage of the word "Catosphere". The first step, of course, is to link to McNeil's blog using the word Catosphere as the link text.

The second step is to use "Catosphere" and variants thereof in a sentence; for example:

Third, we can sign up for, and use, del.ico.us to categorize a whole mess of links under catosphere.

Fourth, along the same lines, we can categorize all Cato or Cato-related links on our blogs under "Catosphere", using Technorati tags; for example:


UPDATE 28 July 2005: We interrupt this old post to bring you the following announcement: my original understanding of Technorati tags was totally incorrect. I should have read the spec more carefully. This post has been updated to reflect my new understanding.

The following code generates the above link:

<a href="http://technorati.com/tag/catosphere" rel="tag">Catosphere</a>

The rel="tag" part indicates that this link is a tag, and the href="http://technorati.com/tag/catosphere" part indicates that we're using the Technorati tag namespace with the tag name "catosphere".

Here's a few more random, non-tagged links (I'd do more, but I don't want to trip Google's spam detection):

[ Catosphere Catosphere Catosphere Catosphere ]

Lastly, we should point out that the proprietor of Catosphere.com is guilty of employing a link farmer, i.e. "search engine optimization" service, as this screen cap demonstrates. Normal web pages simply do not generate links like that. Well, Google hates automated link farming with a passion --- they view it as a form of spam, which it is --- so if anybody at Google is out there reading...

(As far as this post goes, I plead innocent of link farming, as I am creating a collection of links that add informative value to the Internet's link graph, as opposed to just promoting an individual entity.)

Expect to see a relatively large number of Cato-related posts in the next month.

Moral of Today's Story: unless you've got mountains of money to burn on lawyers, playing the whuffie economy will serve you better than cracking the whip of IP law.

Saturday, May 21, 2005

Coming soon: Remote inspection of your wallet

A few recent posts to the Politech email list comment on JP Morgan Chase's plan to embed RFID technology on credit cards, so that they can be used with contactless readers.

For those who don't know, the key elements of RFID technology are the chip and the reader. The chip or "tag" is a small electronic device containing a radio antenna and a small amount of circuitry, which can store some data (typically a unique ID number) and perform a tiny amount of computation. The reader is a combination of directional radio transmitter and antenna that fires radio waves and listens for responses from RFID tags. Passive RFID tags don't need a battery; they get power via induction in their antennae from the reader's radio transmission. RFID tags are durable and cheap (last I heard, less than 50 cents each), and, like all electronics, getting cheaper and more powerful all the time. Most consumers have probably seen RFID tags used as anti-theft tags, though they have many other uses.

RFID tags can be read without physical contact, although they often require that the tag be held in the proper orientation relative to the reader. The range at which they can be read by current technology varies from centimeters to several meters. You can increase read range by using tags with bigger antennas, and by increasing the reader's power output; but the physicists tell us that, for passive tags, the strength of the tag's reply drops as the fourth power of the distance between reader and tag, so it seems unlikely that you'll get much beyond tens of meters in the foreseeable future.

So, anyway, as I was saying, if many companies follow JP Morgan Chase's lead, and decide to embed RFID tags in their cards --- and many of them will --- then your wallet will be packed with RFID tags containing all kinds of interesting information about you. Anybody with a cleverly located reader (one Politech commenter suggests planting one in the seat of a mall bench) will be able to snag the entire contents of people's wallets, without the victims' ever noticing.

Fortunately, RFID readers can be foiled (ha, ha) by all the usual countermeasures that work against radio signals. If you line your wallet with aluminum foil, it will become impossible to read your cards until you take them out. Of course, that countermeasure has the downside that you will have become the kind of person who lines their wallet with aluminum foil.

Saturday, May 14, 2005

Gelernter's shameful editorial on public schools

Yglesias points to David Gelernter's shameful editorial on abolishing public education. This editorial's shameful in at least two ways.

First, he opens by citing statistics on how badly our public schools are doing --- "12% of graduating seniors were 'proficient'", and "[g]lobal rankings place our seniors 19th among 21 surveyed countries" --- but neglects to consider a glaringly obvious fact: all the countries that do better rely on public education systems. I can't access Pamela Winnick's Weekly Standard article, which Gelernter cites, but the US Dept. of Education's Trends in International Mathematics and Science Study results for 2003 may be revealing. Students in Singapore, Taiwan, and South Korea aren't going to charter schools.* But you shouldn't need recent statistics to realize this; a basic familiarity with history suffices to recognize that public education is one of the cornerstones of the modern nation-state, and common to all advanced industrialized economies. If some nations are doing better than America, then one should study their public education systems for features that could be adopted in our own, not scrap the entire system.

Second, and equally shamefully, the editorial pulls a huge rhetorical bait and switch. Gelernter opens the editorial complaining about the quality of public school education, but promptly switches to a claim that schools no longer "speak for the broad middle ground of American life" --- which, if you translate from bland, homiletic conserva-speak into the sole example that Gelernter gives, means that public schools teach tolerance towards gays.

Let's leave aside Gelernter's regrettable intolerance, and consider the bait-and-switch. If, as Gelernter claims, "public schools have a right to exist insofar as they express a shared public view of education", then educational quality does not matter. A public school that produces excellent educational outcomes but fails to "express a shared public view of education" still has no right, by Gelernter's argument, to exist. So complaining about the quality of public schools is just misdirection. But Gelernter's editorial would strike the average reader as far less convincing if he just stated this outright, so he must first engage in some rhetorical prestidigitation about educational quality.

So, then, what of Gelernter's argument? Well, basically, I think the language and logic of "rights" that Gelernter uses is pretty unhelpful when considering whether public schools should exist. "Rights" can be a useful framework for thinking about entities that, like human beings, are ends-in-themselves, to be protected from certain harms or guaranteed certain goods simply by virtue of what they are. But things like schools, armies, or corporations are not ends-in-themselves; they're instrumental, existing for some external purpose, and the proper way of justifying their existence is to consider their effectiveness at their mission relative to their costs.

Therefore, public schools should exist insofar as they produce better educational outcomes, on average and in the long run, than if they did not exist. All available evidence, across all known space and time (i.e., between nations, and through history), suggests that some form of universal education greatly benefits everybody, and that public education can provide universal education. Call this the public education theory, which merits that name because it has been thoroughly validated. Some have also suggested a hypothesis --- call this the private school choice hypothesis --- that private schools with vouchers or some other parental-choice mechanism would also lead to adequate universal education, and possibly some improvements over public schools. This hypothesis hasn't been evaluated, but it's probably worth experimenting with.

For what it's worth, in my opinion K-12 education in America is clearly a huge mess, but its problems stem from two fundamental facts that have nothing to do with the fact that it's government-run. First, unlike private schools, public schools attempt to educate everybody, including the students who don't care and the students whose parents don't care --- and in America, those two categories include a lot of students. Second, unlike private schools, public schools largely lack the funds to attract first-class people into the teaching profession. Any privatized school system that had to educate everybody, using the amount of money that K-12 education currently receives, would suffer from similar problems. These are, of course, only hypotheses, which is why it's probably worth experimenting with some form of school choice on a small scale.

p.s. One other, lesser quibble with Gelernter's piece: Gelernter cites the "1910" Encyclopedia Brittanica's claim that "The great mass of the American people are in entire agreement as to the principles which should control public education", and expects us to believe it. First of all, this is the kind of vague and sweeping generalization that anyone with an ounce of critical sense should regard skeptically. Furthermore, as any Wikipedian knows, 11th ed. of Brittanica, like every edition of Brittanica including the present one, was full of errors. Moreover, in the absence of modern polling techniques (Gallup's scientific polling revolution didn't happen until mid-century), how exactly was the author even to know what the "great mass of the American people" were in "entire agreement" about? Lastly, does Gelernter really believe that in 1911, when there were still living Civil War veterans and women were still struggling for suffrage, that there were no deeply divisive social issues? Or does he simply believe that no public school taught about these issues in a fashion that was offensive to somebody?


* Actually, IMO, based on stories I hear from my family, the "success" of students in East Asian countries can be attributed to cultural factors that would be incredibly hard to replicate here. And I put "success" in scare quotes because I actually don't think that the educational system of South Korea, for example, ultimately shapes minds properly.