July 29, 2010
FTC in Charge of Net Ads? -- and Opt-In vs. Opt Out
Greetings. Word is that the U.S. FTC (Federal Trade Commission) is seriously toying with the concept of establishing some sort of Internet "do-not-track" list to ostensibly control Web ads that involve behavioral targeting and/or user tracking.
Outside of the fact that it's not entirely clear that the FTC has this authority per se, and in the maximal case would only have purview over U.S. sites (the Internet is international the last time I checked), the concept of trying to create such a list strikes me basically as undesirable and impractical -- for both policy and technical reasons.
I'll have much more to say about this later, but as a starting point let's consider these issues:
- How effective has the phone solicitation "do-not-call" list really been? In my experience and based on anecdotal evidence -- despite claims from some quarters of vastly reduced solicitations -- the reality is that the calls just keep on comin'. And the phone-based list deals with the comparatively simple target of phone numbers, not the complexities of Web sites.
- How do we actually define "tracking" and "ad targeting" in a rapidly evolving Internet environment? This is actually a very complicated matter.
- Are ads that are generally less targeted and more "scattershot" a net plus or minus for consumers?
- If ads lose significant value, what are the ramifications for the largely "free services" model that most Internet users have come to expect on the Web?
- How could a broad inter-site list of this sort be implemented without creating unacceptable privacy and security challenges carrying the potential for unintended negative consequences?
And so on.
My sense is that the concept of an Internet "do-not-track" list of the type under discussion represents largely the same sort of mostly (though not entirely) political posturing that was behind the telephone-based "do not call" concept, and that the practical issues and problems with such a plan for the Internet are vast.
At this juncture it might also be useful to mention again an important paper I first noted some months ago -- Opt-In Dystopias -- which explores in depth how seemingly obvious issues of "opt-in" vs. "opt-out" in reality can be far more complex and subtle than they might appear to be initially. This paper should be required reading for anyone interested in or involved with these issues.
More to come.
--Lauren--
July 28, 2010
ICANN Touts DNSSEC As Tool to Fight "Internet Criminals"
"ICANN said the DNSSEC would eventually allow Internet users to know 'with certainty' that they have been directed to the Web site they sought. 'This upgrade will help disrupt the plans of criminals around the world who hope to exploit this crucial part of the Internet infrastructure to steal from unsuspecting people,' ICANN President and CEO Rod Beckstrom said in a statement."
Greetings. While the implementation of DNSSEC is certainly important, and the avoidance of DNS cache poisoning attacks is clearly very useful, ICANN's "Dragnet-esque" pronouncements about fighting crime strike me as highly ironic.
The simple fact is that "Internet criminals" have a vast array of tools in their arsenal to misdirect users, and few of these depend on cache poisoning or DNS manipulation.
Much of the crime is enabled by the fundamental design of the domain name registry/registrars ecosystem, which enables crooks to easily create and abandon completely valid "disposable" domains that are only used for short periods of time and cannot be reasonable tracked to their owners.
In fact, through their plans to unleash vast numbers of new Top Level Domains (TLDs) on the Internet -- perhaps hundreds in the first year -- ICANN will only be increasing the confusion of consumers and providing fresh juice for criminal operations. Most Internet users aren't calling for new TLDs -- they mainly think in terms of dot-com and that's unlikely to change any time soon. The main push for new TLDs is from would-be registry operators and their registrar cohorts,
who see the promise of big bucks from the rush of purely defensive domain registrations that occur when every new TLD opens.
So as far as I'm concerned, ICANN isn't winning the "Joe Friday" crime-fighter award any time soon.
--Lauren--
July 26, 2010
Why I'm Using Google Buzz More and Twitter Less
Greetings. Without any conscious effort to change my patterns of applications usage, I've noticed of late that I'm using Google Buzz much more and Twitter considerably less -- this despite the fact that I'm (for now, anyway) following fewer people (and have far fewer followers) through Buzz as opposed to Twitter.
A bit of reflection reveals why Buzz seems increasingly useful, despite the perceived smaller user base.
In a word: Quality. This applies across a number of vectors.
First, and perhaps most obvious -- the 140 character Twitter message limit, supposedly related to SMS text message considerations in Twitter's original design, represents an increasing frustration. Perhaps there is good karma in learning how to become a better headline writer -- a skill that Twitter certainly tends to foster. However, getting beyond the fun of shouting headlines, and instead having some sort of intelligent discussion is extremely difficult within Twitter constraints.
By not imposing message length limits, Buzz avoids this class of problems. And its threaded structure, ability to directly display linked materials, granular privacy controls, and other features contribute to a far more "intelligent" user experience overall, capable of supporting genuine discussions in depth.
Buzz has been viewed by the public largely as a direct competitor to Twitter, but in reality they are significantly different types of applications. I would place Google Buzz somewhere between Twitter and a full-blown discussion forum message system -- but without the user interface baggage that often unnecessarily accompanies the latter -- and with a much cleaner e-mail notification system than either Twitter or most forums can currently offer.
The Google Buzz launch was famously embroiled in controversy over its initial default privacy settings related to contacts discovery (apparently the result of insufficient external testing pre-launch, in contrast to Google's usually robust external testing regimes). The related Buzz defaults were indeed somewhat problematic.
However, that being said, the possibility of associated potential problems related to those defaults were blown way out of proportion by some observers and the media at large, and to its credit Google moved within hours to alter the default behaviors in manners that completely mitigated any realistic concerns, however minor.
Unfortunately, many persons may have been scared off by the exaggerated reports of Buzz problems, and haven't yet come back to take a second look.
But it would be well worth their time to do so, especially in light of the continuing series of incremental fine tuning, new features, and other aspects of the evolving Buzz service that really do provide Buzz with far more usefulness overall than Twitter for day-to-day use.
It might be argued that the learning curve for Google Buzz is a bit steeper than for Twitter, but this is to be expected given Buzz's power and flexibility compared with Twitter. The intrinsic relationship between Buzz and Google Profiles -- judging from some e-mail queries that I receive -- may confuse some users initially, but this really should not be a significant ongoing problem for most persons, since Profiles are easy to create and can contain essentially as much -- or as little -- information as you wish.
No morals or dramatic wrap-ups today. Just a suggestion. If you've never used Google Buzz, give it a try. If you tried Buzz early on and stopped due to concerns about the launch or other issues, consider trying it again now.
My profile and current Buzz activity is open for public access.
Hope to see you there.
--Lauren--
July 21, 2010
Our Responsibilities on the Web That Never Forgets
Greetings. The New York Times has published an important article on the subject of the Net's long memory, and the impacts on reputations and other aspects of people's lives when previously posted materials exist essentially forever online.
Regular readers know (all too well!) my frequent comment that we must always assume that anything publicly posted on the Net may be permanent, despite attempts to expunge or ignore any particular data later on.
The Times article makes a number of good points, but also in my opinion downplays some key realities and confuses some important aspects of the issues in some cases.
The concept of passing laws to prevent employers from using publicly posted information (e.g. Facebook pages) in their employment decisions seems effectively meaningless. One way or another, those searches will be done and those pages seen, and if necessary some other reason will be cited for the decline of particular applicants.
The idea of self-destructing data implicitly assumes that all public copies of the public data in question are also deleted, and that all involved entities (in various countries) would even respond to data deletion requests or demands. While deleting data from the most widely seen repositories might reduce its impact for a time, the odds are that it could still be found on other servers and would eventually find its way back into primary search engines again at some point as Net site crawling proceeds.
The article seems to confuse the concept of deleting public data with (their example) Google's anonymizing of log data after a specified period of time. The former is explicitly public data, the latter explicitly private without copies in public view. The two cases are entirely dissimilar. That log data can be anonymized says nothing about the ability to effectively delete publicly available data.
It is disturbing that people are paying significant -- sometimes very large -- sums of money to third parties in an attempt to "game" Google search results to push "negative" links toward the back of results listings. I do continue to feel that some mechanism to help in situations of egregiously false information would be useful. Several years ago I discussed this -- in terms of concepts to think about, not a specific proposal -- in Extending Google Blacklists for Dispute Resolutions.
Decades ago, when the Internet (then ARPANET) was new, I started participating in and helping to develop the then novel concept of public discussion mailing lists, many of which were archived in one form or another even way back then, and the discussions from which in many cases remain online today. I still receive new comments responding to what I wrote on those lists so long ago that new generations of Internet users have discovered.
Yet I remember being conscious even at the time of the likely "permanence" of what I was writing. I distinctly recall saying to a colleague who inquired about this "new-fangled" mailing list stuff that it was useful and fun, but that he'd better assume that anything he sent to those lists might be around forever. He sort of chuckled at my suggestion.
And therein may be a key to these dilemmas. On one hand, individuals need to understand -- from a very young age indeed -- that (just like how you don't want to stick your hand into an open flame) what you post publicly (or even just to your "friends") may well be permanent, and that discretion is indeed the better part of valor in some situations on the Net.
This doesn't directly help in those cases where someone else posts damaging false information -- some sort of dispute resolution mechanisms as mentioned above may ultimately have some role to play in that respect.
But fundamentally, nobody puts a gun to your head and forces you to post personal goodies to Facebook or anywhere else. Peer pressure has always existed and has ruined many lives over time, but as adults the ultimate responsibility has to be our own, not just for ourselves but also for our children who are too young to understand the potentially lifetime ramifications of what they do and say online.
--Lauren--
July 20, 2010
Trusting Your Friends -- and Trusting the Cloud
Greetings. Internet "cloud"-based services, both for data storage and as computing resources, are expanding rapidly, and have become a flash point of controversy among some persons in the computer science and privacy fraternities.
On various discussion lists and forums, dialogues about the value and risks of "cloud computing" have devolved into name-calling and impassioned arguments about whether the term "cloud computing" itself is somehow misleading -- with suggestions that data storage services (where encryption is more easily applied by users) should be considered separately from remote computing services -- sometimes called "SaaS" (Software as a Service).
I'm more interested in issues than word wars, so for now (despite the related complaints that I'll receive) I will continue to refer to this entire area as "cloud computing" -- "the cloud" for short.
Some other time we can have a technical discussion of cloud computing's benefits and risks. But there are a couple of truths about the cloud that are in my opinion undeniable, and are too often lost amidst the forest of technical details.
Realize this: The future of computing and communications will increasingly be Internet cloud-based. There is no escaping this truth. The complexity of the services that will be demanded by persons around the world will increasingly be impractical to provide wholly through traditional locally-based resources.
Despite ever more encompassing attempts at automatic software updating regimes, many or most users' computers are in states of relatively poor (or even awful) security, and sport feeble or non-existent data backups, putting immense amounts of personal and business data at risk on users' local disks at any given time.
And to expect non-technical users to somehow manage these ever more complicated computing devices, even with the help of increasingly complex updating environments, is becoming about as nonsensical as requiring that everyone be their own auto mechanic.
That there are privacy and security challenges in the cloud is undeniable -- but research in these areas is proceeding rapidly and holds great promise. Laws that in some cases treat cloud-based user data as having fewer legal privacy protections than locally-based data are no longer tolerable and need to be harmonized so that user data gets the highest practicable level of legal privacy safeguards regardless of where that data resides at any given time.
But for some who dislike the cloud, no amount of technical and legal assurances will ever suffice, simply because they have a fundamental distrust of remote services -- "We never really know what's going on in the cloud!" they say.
And yet, do we really know everything going on in our local computers, even those of us who have spent our professional lives building these technologies?
In most cases, the answer is no. Unless we've written every line of code ourselves, or have compiled every program personally from source code that we've inspected (and presumably understood!) line by line, there is a leap of faith involved in everything we do on these machines.
For that matter, if you're of a conspiratorial bent, do you really know for sure what's going on in those CPU cores that run your computer? Have you inspected every line of microcode? Are you positive that something nefarious isn't going on deep within those busy chips??
More realistically, Ken Thompson -- co-creator of the UNIX Operating System itself -- noted in his 1984 paper Reflections on Trusting Trust, that you can't necessarily even depend on the compilers that you use being free of self-compiling malware and other subterfuge.
What this all boils down to in the end is -- to paraphrase Bob Dylan -- You Gotta Trust Somebody.
And in our modern world, you have to trust lots of somebodies at various levels or our entire technological civilization would simply grind to a halt.
We certainly depend on trust in our personal lives. Even though that trust may turn out to be misplaced in particular instances, this doesn't change the fact that trust is fundamental to getting virtually anything done in our modern world.
And trust isn't only a concept for individuals. Just as we trust our friends and lovers -- whose inner thoughts we can never truly know for sure -- we need to make decisions about trust related to technology as well.
The fact that we can't know everything about every aspect of cloud computing services is ultimately just another nuance of the same sort of necessarily incomplete information with which we make every other trust decision in our lives.
Ultimately, if you trust that a provider of cloud computing services is of good ethical standing, will defend your privacy rights against unreasonable intrusions, and provides services with a degree of security and reliability that you consider to be acceptable -- especially in contrast to what you can and do provide locally on your own machines, then an inability to personally inspect every aspect of operations in the cloud should not be an automatic deterrent to its use.
Technical and standards advances are making the cloud even more attractive. For example, Open Source cloud standards and efforts such as Google's Data Liberation Front provide increasing levels of transparency and data portability.
There are many factors to take into account when choosing cloud services -- just as there are in the process of making bosom buddies. There are no absolute guarantees -- there always risks in life, both today and tomorrow. But the various aspects of trust are key in both cases, and trust is possible without total knowledge of and control over the other parties involved.
Like love, trust makes the world go 'round.
--Lauren--
July 18, 2010
The Google Search "Secrets" in Plain View
Greetings. As I've mentioned previously, I tend to receive several hundred e-mails daily that relate to Google one way or another, many of which contain requests for advice regarding perceived or real Google-related issues. I try to help when I can.
And my concerns about what I consider to be significant shortcomings in Google's user communications structure -- especially when dealing with relatively unusual or serious problems -- are fairly well known.
But recent calls for regulatory oversight of Google Search are way off-base, and -- beyond the obvious First Amendment concerns -- threaten to undermine Google's efforts to provide the best possible natural (organic, algorithmic) search results via Google's continuing work to avoid distortions in or gaming of those results.
The fact that Google permits highly controversial search results to maintain algorithmically determined high rankings, even when it would be much easier from a public relations standpoint for Google to suppress those results, are another indication of Google's laudable efforts not to disrupt natural results rankings with manual alterations.
It's notable that in all of the countless cases where people have come to me (sometimes utterly convinced that Google has a vendetta against them) with complaints about their Web site "vanishing" from Google listings or not achieving the kind of result rankings that they felt were deserved, I've never once seen a case where any unfair or unreasonable actions by Google were actually in play in the situation.
In fact, in virtually all of these cases the problems have boiled down to one of two issues.
The first is that the sites have become contaminated with malware, often without the site owners' knowledge. This can result in Google quite reasonably flagging the sites as potentially dangerous to users, with resulting undesirable (but completely appropriate under the circumstances) effects. Even when site owners protest that their sites are clean, on closer inspection it turns out (in my experience) that their sites have been compromised in some manner.
An even more common case is that sites have not been organized in ways that make it possible for Google to effectively crawl their contents, or the sites include elements (often at the urging of unscrupulous SEO -- "Search Engine Optimization" -- firms), that violate the site guidelines Google has established to help avoid gaming of results to the detriment of Google's users overall.
There isn't anything intrinsically evil about SEO per se. In fact -- and here come those "secrets" in plain view that I promised -- Google has put major efforts into making available absolutely comprehensive resources and tools for webmasters, yet it appears that many or most Web site owners don't even realize that these exist.
Google's Webmaster Central is a universe of information, tools, video tutorials, and all manner of other resources that webmasters can use to better understand how Google crawls their sites, potential problems in sites; mechanisms to inform Google how sites are organized to enable efficient and complete crawling of text, video, and other formats; ways to gather metrics on how people discover sites; and so much more.
Over on YouTube, the Google Webmaster Central Channel contains hundreds of videos on related topics that should be of interest to anyone running a Web site, including many Q&A videos from Matt Cutts who heads up Google's "Webspam" team -- which is directly involved in these sorts of search quality issues (he's also a good guy -- I recommend paying attention to his suggestions!)
All of this isn't to assert that every problem anyone may have with Google will be solved via these Google resources -- nor to say that effective means to solving every other possible sort of Google-related problem necessarily even currently exists.
But for many common situations -- the kind where people may feel that Google is unfairly ranking their site -- or similar scenarios -- I believe that a reasoned analysis of the circumstances, especially in conjunction with the Google Webmaster Resources discussed above, will demonstrate that Google bends over backwards not only to keep their natural search results rankings as useful and honest as possible, but also that Google has worked very hard to explain how to optimize sites for best results -- and has provided tools to help make this as straightforward and painless as possible for webmasters.
--Lauren--
July 15, 2010
Truth via Captions: "What's Under Their Kilts?"
Greetings. A few weeks ago in Why Web Video Captioning Is So Important, I mused on the importance of captioning to Web videos, and emphasized why YouTube users should take advantage of various YouTube captioning tools (automated and manual) to create the best possible experience for their viewers.
What I didn't discuss then was a more basic issue -- how the ways that videos are captioned (or dubbed) can fundamentally alter how they are interpreted, even to the extent of completely changing intended meanings.
This was brought home to me very recently when I watched a broadcast copy of a film I had not viewed for many years, the delightful 1966 movie King of Hearts (Le Roi de Cœur). [Trailer]
Set near the end of World War I, the characters in the three involved armies each speak in their native languages. This creates a complicated dubbing/captioning scenario, since typically any audience would want at least two of the three languages translated.
Language translations are less than an exact science of course, especially when idiomatic phrases are involved. (That said, automated translation techniques, such as Google Translate, have become extremely useful indeed, and will only get better with time.)
When I first saw King of Hearts decades ago it was in a fully-dubbed form without subtitles, and used fake accents to try indicate which language the characters were speaking at any given moment. This led to specific plot elements that never quite made sense to me at the time.
Captioned versions of videos and films have -- in my opinion -- generally done a better job, though timing requirements in manually-captioned cases can sometimes result in "text simplifications" that might leave out words or entire phrases.
In the YouTube context, captions also open the ability to perform automated translations based on the captions themselves -- obviously of immense benefit.
But here's an interesting question -- what happens when captions (or dubbing) are used to fundamentally alter dialogue in a film, perhaps as a form of subtle censorship or worse?
Both dubbing and captions carry this risk, but the risk with dubbing seems far higher, since the underlying original dialogue tracks will not be heard for comparison by native speakers of the language.
Automated captions will virtually always be trustworthy in this sense. While there may be a significant error rate in automatic captions (especially in the presence of background noise or music), deliberate alteration of meaning is highly unlikely, and the original audio is still immediately available for comparison.
I haven't revealed "what's under their kilts" yet. The reference -- and the relationship to this entire discussion -- comes from a short segment of dialogue in King of Hearts itself that was the trigger for my pondering this topic today.
In a particular captioned scene (the actors are speaking French at this point), a number of Scottish soldiers are dancing a jig. A character in the film asks her companion, "What's under their kilts?" To which the companion -- after taking a quick peek -- replies, "Nothing!" Leading to the response from another character, "You mean everything!" A rather cute turn of phrase.
But when I saw this scene a couple of evenings ago, I couldn't remember ever having heard the "Everything!" response before. In fact, I recalled -- from many years ago -- an entirely different and rather odd line of dialogue entirely.
I dug out an old tape and discovered that I was correct. The ancient dubbed King of Hearts version in my collection had the character replying to the question, "What's under their kilts?" with the response I remembered: "Petticoats!" (making the following line, "You mean everything!" completely nonsensical).
Since this was a dub job, I couldn't hear the original French dialogue, and I can't effectively lip-read French (or any other language, for that matter).
I'll admit that a rather ham-fisted attempt at film sanitizing may not be a big deal in the scheme of things -- but it had me fooled for many years.
Still, these issues -- particularly the key potential to verify captions by inspection of the original audio -- may be particularly important (for example) in the context of sound bites with political ramifications, where unscrupulous parties might try to post materials with falsified translations aimed at particular target audiences.
While one might hope that Internet access to underlying source materials and references would tend to reduce such risks, the plain truth is that many persons will simply accept what they see or hear the first time around and never think to go digging on the Net for verification.
Just a little something to consider, especially if you're ever inclined to doubt the ever-growing importance of captions in our increasingly video-centric world.
--Lauren--
July 02, 2010
ICM Registry and Deceptive Dot-Ex-Ex-Ex Polls
Greetings. One of the techniques that ICM Registry has been using to try demonstrate public demand for a dot-ex-ex-ex top level domain (TLD) has been touting various "poll" results. Right now they're pushing a new CNN "poll" that seems to show an amazing 83% approval rate.
But wait a second. What are we actually talking about here? Turns out that the CNN "poll" wasn't a scientific poll at all -- merely a scientifically worthless "self-selected" online poll. Statistical value and meaning: virtually nil.
ICM also promotes other magazine and newspaper polls over the years that gave similar lopsided numbers. It's unclear from their statements whether any or all of those were also self-selected polls, but it appears quite possible. Way back in 2004, ICM hired Lombardo Consulting to poll 1000 people on the topic (1K is indeed a typical national scientific poll size), and got similar results.
But even aside from issues of self-selection, the key to polling is of course the nature of the questions. Remember How to Lie with Statistics? Still a classic ... [As always, I've substituted "dot-ex-ex-ex" below to avoid e-mail blocking problems]:
CNN: Do you think pornographic websites should have their own "dot-ex-ex-ex" domain?
Business Week: Should purveyors of porn get their own domain?
Huntington Herald Dispatch: Would creating dot-ex-ex-ex keep Internet users from accidentally stumbling upon porn sites?
And finally, Lombardo: If those who run the Internet could assist in preventing child pornography and make the Internet safer for children and families by creating a dot-ex-ex-ex Internet address, would you support this?
It's difficult to imagine a more intellectually dishonest set of questions. Leaving aside loaded words like "purveyors" -- the questions appear to obviously suggest that all targeted sites (Only professional? Also amateur? Just U.S. or worldwide?) would be somehow limited exclusively to the dot-ex-ex-ex domain.
And it appears (from what I can determine so far at least) that no significant mention was made of the fact that the proposal includes no mechanism to force such sites to only reside in dot-ex-ex-ex (via oppressive "domain ghettoization" legislation or the like) -- which would certainly be quite appropriately subject to immense litigation battles. I wonder how these poll participants (self-selected or not) would have responded if it was made very clear that dot-ex-ex-ex was in addition to existing (e.g. dot-com) domain names?
The Lombardo question seems the most egregious, making completely unsupported claims about making the Internet safer and assisting in the prevention of c-porn. The latter is particularly ludicrous because c-porn is already illegal and no legitimate sources for such materials exist on any site or in any TLD.
In other words, the polling data being promoted by ICM Registry is misleading and biased, therefore statistically worthless -- and the quintessential essence of unmitigated bull.
--Lauren--
Law Enforcement "Controls Access" to New Minneapolis Public Wi-Fi Network
Greetings. With much fanfare in mid-June, Minneapolis announced the activation of a free public Wi-Fi network, with 117 outdoor hotspots, "for use by residents and visitors alike."
Just one problem. At the apparently explicit "request" of law enforcement, you can't access the current system without first creating an account using a credit card!
We're told that, "[The] log-in process was requested by law enforcement officials because being able to log on to the Web anonymously presents security concerns."
One long-time expert on municipal wireless noted to me that the CEO of U.S. Internet (the firm operating Minneapolis' Wi-Fi system) claimed that federal law requires such a procedure. Say what? I've heard of no such law. Various public Wi-Fi systems require no log-in at all, and use of credit cards is normally restricted to systems that actually charge for access.
However, in Minneapolis, it appears to be "no credit card, no Wi-Fi" -- but if someone establishes an account in your name using a stolen credit card and then proceeds to do something nasty -- your hassle (or worse).
What's apparently going on in Minneapolis is a combination of the desire to enable the tracking of as much Internet activity as possible, and the time-honored tactic of CYA.
As I've noted in Why the New Federal "Trusted Internet Identity" Proposal is Such a Very Bad Idea and related essays, open access to the Internet is now under fire from a variety of government entities who want to be able to find out as much as possible about everything you do on the Internet, all of the time.
Efforts to predicate public Internet access on verifiable and easily trackable identification as a matter of course, should be strenuously resisted by all Internet users who care about their ability to routinely communicate as they choose without the threat of real-time or retrospective surveillance of their activities -- in an ever expanding circle of dubiously justified circumstances.
--Lauren--
June 30, 2010
Major Published Article on Microsoft vs. Google Privacy Issues
Greetings. Fairly recently, I was interviewed for an article to appear in Redmond magazine (The Independent Voice of the Microsoft IT Community). The topic of the article was presented to me essentially as "Who has bigger and scarier privacy problems, Microsoft or Google?"
Obviously, given the stated orientation of the publication, I couldn't help but anticipate that the finished piece might tend to skew toward the Microsoft-friendly side of the issues.
The resulting lengthy article, What Does Microsoft Know About You? has just appeared, and despite what I would characterize as an obvious and significant anti-Google bias in the overall piece, the article is still an interesting and worthwhile read, even if only to illustrate how these sometimes emotional issues are being played out in such publications and among such affinity groups.
But that's just my opinion. What's your take?
--Lauren--
Update (7:30 PM): OK gang, yeah, I did notice that the article (at least as currently displayed in the online version) includes a rather amusing typo given the topic of the piece. Out of deference to the author, I hadn't planned to mention the error, but with so many people sending me notes about it, that doesn't seem as practical now. Let's just assume that it was a genuine typo, not a Freudian slip!
June 29, 2010
Blaming Google and Android for Calling ID Spoofing
Greetings. An article on Slashdot today seems to blame Google and Android for the ease with which two Caller ID spoofing programs can manipulate Caller ID and gain illicit access to AT&T (and other) voicemail systems. It even attempts to draw in the (to my mind irrational) complaining about Google's accidental Wi-Fi payload data collection.
I've talked about CNID (Calling Number ID) spoofing various times before, but let's be really clear about this.
CNID spoofing is not the fault of Android or Google, any more than it's the fault of Time Warner or Comcast when users access Web-based CNID spoofing services. The fundamental problem is that the CNID system was never designed for an environment where, to use the vernacular, every Tom, Dick, and Harry has access to the underlying subsystems, a problem that has become much more serious with the rise of VoIP/SIP-based access mechanisms.
A rather comprehensive history of CNID spoofing [calleridspoofing.info] and related areas makes for useful reading. (This falls into the "it takes one to know one" category of Web sites, apparently.)
Google Voice, as an example of the correct approach, makes users explicitly aware of spoofing risks, and requires additional confirmation steps, if attempts are made to set up accounts without passcodes.
There are legitimate situations where manipulation of CNID data is completely reasonable. Services (like Google Voice, for example) may want to pass through calling number data so that called parties have accurate information regarding the origin numbers of callers. Businesses may want to send their main number as the CNID reference, not extension numbers, which may not even take incoming calls.
There are concerns that currently pending U.S. legislation to outlaw nefarious CNID manipulation might adversely affect legitimate uses. My belief is that it should be possible to craft wording in the final legislation that would protect such honest applications -- this is indeed important.
I do feel though that it is also important that U.S. federal law be on record that use of Caller ID spoofing for the purpose of intentionally falsifying the identity of a caller is generally unacceptable and so would normally be subject to appropriate legal sanctions.
--Lauren--
June 27, 2010
Why the New Federal "Trusted Internet Identity" Proposal is Such a Very Bad Idea
Greetings. Last Friday, in White House Proposes Vast Federal Internet Identity Scheme, I posted a brief thumbnail expressing my major concerns regarding the expansive federal Internet Trusted Identity proposal.
Here are a few details explaining why I'm taking such a negative view of this plan.
It's important to note that this entire proposal under discussion, at this stage, is of course nothing but smoke. It has no functional reality, other than as a (useful) starting point for further discussion. But when viewed in the context of other government-related efforts, trends, and statements, it is quite alarming nonetheless, and it's very difficult to overstate its potential for serious negative consequences. Though indeed, like the vision of Christmas Future provided to Ebenezer Scrooge, it's currently only a shadow of what might be, not of what must or necessarily will be.
Let's look at one of the "Envision It!" boxes in the plan as posted at the Department of Homeland Security:
An individual voluntarily requests a smart identity card from her home state. The individual chooses to use the card to authenticate herself for a variety of online services, including:
Credit card purchases,
Online banking,
Accessing electronic health care records,
Securely accessing her personal laptop computer,
Anonymously posting blog entries, and
Logging onto Internet email services using a pseudonym.
This is, by definition, a government-issued identity card. The plan appears to envision a user authenticating themselves for the purposes even of pseudonym-based or "anonymous" activities. We can call such a posting "anonymous" if we wish -- but if the user has already authenticated, we're then dependent on the "proper" behavior of all players to actually treat the following transactions in a truly anonymous manner.
And anonymous to what extent? Perhaps a blog comment would appear on the Web anonymously, but when the lawyers show up demanding to know who posted that critical comment -- something that's happening with increasing frequency even now -- I'll bet you dollars to donuts that the initial authentication records will be available through some means to unmask the poster, or to correlate pseudo-identities that users may prefer to use for different purposes and "roles" on the Net.
The goals behind such an all-encompassing identity regime seem clear. While it could indeed provide some improvements over existing authentication methods in financial transactions and the like, the cost to civil liberties could be very high indeed, because -- as I read the plan -- the end result would be a detailed record -- likely captured by upcoming government proposals for expansive Internet service data retention requirements -- that could be used to "unwind" (unmask) anonymity on demand.
As I noted in Saving Internet Anonymity -- The Struggle is Joined, the increasingly shrill calls to put every possible Internet transaction into government-accessible databases has become an ever louder drumbeat.
And I believe we can easily dismiss the term "voluntary" used in the proposal -- since there's every reason to believe that such authentication regimes would quickly become effectively mandatory -- due to various pressures and liability concerns that don't take a lot of imagination to understand. Identity "mission creep" is virtually a certainty, though the conflicts that this is likely to create in an international environment like the Internet are certainly interesting to contemplate.
History, both long past and recent, shows us very clearly that -- human nature being what it is -- governments on the whole can't be trusted to not abuse data about their citizens' activities. Such abuse will almost always evolve from what initially appears to be laudable motives of law enforcement and the public welfare, but could rapidly degenerate into totalitarian nightmares.
Even if you (appropriately) view our current and recent federal governments as essentially relatively benign, we've still seen many instances of unjustifiable and even illegal surveillance and Internet data abuse -- even in the absence of long-term data retention requirements of the sort now being contemplated.
And even with the best of intentions, firms who are the custodians of user data and identity info are at the mercy of the civil legal system, above-board government demands for data, and -- as we've seen -- "secret" government data demands as well.
What of future governments, who might not be as benign, but would have at their fingertips the vast Internet identity infrastructure being contemplated -- what will they do with that shiny bauble?
I'm all in favor of discussions about how the Internet industry can improve the security and validity of transactions that need strong authentication -- such as in the financial sector or when dealing with medical health records. But the sort of government-entangled identity structure being proposed by the White House in the current document is -- perhaps even to a very significant degree unintentionally and with genuinely good intentions -- a wolf in sheep's clothing with the potential to decimate civil liberties on and off the Net for generations to come.
--Lauren--
June 25, 2010
White House Proposes Vast Federal Internet Identity Scheme
Greetings. The White House has just released the draft of a rather chilling document -- tellingly hosted on Department of Homeland Security servers -- that proposes the creation of a vast, federally-led "Trusted identities in Cyberspace" infrastructure that would potentially reach into nearly every aspect of Internet use, from financial transactions to comments on blogs. The White House is seeking public comments on the proposal.
While touted as a voluntary public/private partnership toward universal Internet identities, it seems clear from an initial reading that such a scheme is a preemptive push toward what would eventually be a mandated Internet "driver's license" mentality of the sort I've been warning against (e.g. Saving Internet Anonymity -- The Struggle is Joined -- April/2010).
It is certainly true that there are some specific situations while using the Internet during which strong identity credentials are very useful, and various of the problem scenarios outlined by the White House draft are real to one degree or another. But Internet industries have been working effectively to develop systems, such as OpenID, that can address such concerns in a truly voluntary manner without government involvement or interference, and without requiring or coercing individuals into sharing identities across multiple sites against their wills.
Let me put it this way in brief for now. Attempts by the federal government -- or other governmental entities for that matter -- to usurp leadership roles in any aspect of Internet identity ecosystems should be politely but strongly rejected.
I will have much more to say about this in the future, but since many people were already asking me about the White House draft, I wanted to get this initial thumbnail analysis out the door as quickly as possible.
Frankly, the concept of the federal government taking their proposed role in this area, especially in today's political climate -- is so obviously unwise -- and perhaps potentially dangerous -- that it's not even a close call. This is especially true given the increasing calls from some in government for massive Internet data retention regimes that could easily be linked with such federally-coordinated Internet ID systems.
I am hosting a local PDF copy of the White House draft here: White House Identity Draft
You can also download the document from the Department of Homeland Security.
More to come.
--Lauren--
Blog Update (27 June 2010): Why the New Federal "Trusted Internet Identity" Proposal is Such a Very Bad Idea
ICANN Moves Forward with Dot-Ex-Ex-Ex, while ICM CEO Plans for Big Bucks and Censorship
Greetings. As predicted in ICANN Likely to Approve "Dot-Ex-Ex-Ex" Domain for Chumps!, it is now reported by AP that ICANN is moving ahead towards final approval of the dot-ex-ex-ex TLD (Top Level Domain).
Note in the AP piece how the fellow behind ICM Registry -- the proposed operator of dot-ex-ex-ex -- is already crowing about the bundles of money he believes he'll make, and claiming that he'll require registrants to include meta labeling on their Web pages, presumably so that they can be widely blocked. Wow, such an attractive deal for adult sites! Why not just pour nitric acid into your servers? The ultimate effect would likely be very similar.
I believe that if dot-ex-ex-ex is "finally" approved, we can expect that:
(a) Dot-ex-ex-ex blocking will become essentially a default condition for much of the Internet, enforced by governments, organizations, and many ISPs in various ways.
(b) Various governments will attempt to mandate that "adult entertainment" sites sign up in and/or move totally to dot-ex-ex-ex (likely what ICM is actually hoping for).
(c) A flurry of lawsuits on all sides will be forthcoming.
(d) Dot-ex-ex-ex will embolden legislative efforts to force other categories of inconvenient or "undesirable" Internet speech and materials into their own more easily blocked TLDs and labeling regimes.
The DNS (Domain Name System) has become a means to extort protective name registrations from sites who really have no desire to be involved in new TLDs, a mechanism for further confusing consumers (and incidentally, enhancing the value of dot-com), and now, an enabling mechanism for Internet censorship and "thought control."
By the way, why do I keep referring to the TLD under discussion as "dot-ex-ex-ex"? Because if I use its real label, e-mail filters will automatically reject my associated mailings at many sites -- which helps to demonstrate how utterly insane this entire situation has become.
--Lauren--
June 24, 2010
ICANN Likely to Approve "Dot-Ex-Ex-Ex" Domain for Chumps!
Greetings. Word is that tomorrow ICANN is likely to reverse itself yet again, and (under continuing lawsuit threats from the would-be TLD-operator who desperately wants to cash-in on this fiasco) unwisely approve a "dot-ex-ex-ex" top-level domain.
Every word that I wrote back in 2005 about this topic, in Open Letter: Why "Dot-Ex-Ex-Ex" is for Chumps still holds true, perhaps even more now half a decade later.
If ICANN moves as reported on this, it is bad news for everyone concerned about free speech and civil liberties on and off the Internet, regardless of how you feel about the sorts of enterprises being targeted by this new TLD.
--Lauren--
Blog Update (25 June 2010): ICANN Moves Forward with Dot-Ex-Ex-Ex, while ICM CEO Plans for Big Bucks and Censorship
"VidMe" Announces Private Video Sharing -- But Fails Big Time
Greetings. I've said it often -- once data is on the Internet, never assume that it can ever really be completely controlled or removed.
The latest example of this axiom in action is a new video service announced with much fanfare yesterday called VidMe.
Promoted as a sort of "privacy-enhanced" version of YouTube, the VidMe spiel is that they provide a video hosting service where you can control exactly who has access to your videos at any given time, revoke video playback access whenever you want, prevent downloading and forwarding of videos without your permission, and so on.
VidMe is attempting to tap into concerns regarding videos (potentially embarrassing, or otherwise where public viewing is not desired) that fall into the wrong hands or go unexpectedly and undesirably viral.
YouTube already provides three types of privacy control tiers (other than the default of public access) -- private videos, group shared videos, and the new (and very useful) "unlisted" video feature. VidMe takes this a step farther with per-user granularity in access controls, and reportedly implements some additional mechanisms to try make it harder for persons to access or save copies of videos without the owners' permissions.
Since VidMe is basically selling a promise of privacy, one would hope that it could actually provide the advertised abilities for owners to prevent unauthorized viewing or distribution of videos. This is especially important since VidMe apparently plans to eventually charge users to upload videos to the service, beyond a few free videos per account.
But VidMe has some significant problems, not the least of which being that they cannot deliver the level of video privacy and control that they seem to be promising -- not due to any technical limitations in their service per se, but rather because that kind of privacy control is essentially unattainable in the current public Internet environment.
The VidMe flash player seemed very slow to buffer and play with all browsers that I tested. It hung, crashed, and burned whenever I tried to play test VidMe videos under Google Chrome.
OK, that stuff almost certainly can be fixed. But a much bigger problem for the VidMe "control your videos' distribution" business model is that every single technique I tried to locally capture displayed VidMe videos was fully successful without any difficulty whatsoever.
Every video stream grabber utility that I executed was able to capture and locally store both video and audio from VidMe playback streams. There are some video sites that at least make this sort of stream capture more difficult -- VidMe isn't one of them.
And just for chuckles (since the results seemed preordained) I also easily captured VidMe playbacks using the freeware, CamStudio Open Source package, which quickly and neatly enables high frame rate, high-quality screen and audio grabs directly from display buffers -- no need to capture the actual data streams themselves.
In every case, in every test, I ended up with fine looking video copies, complete with audio tracks, that I could -- if I had wished -- post anywhere or forward to anyone without restrictions.
My real gripe here is with how VidMe is promoting their service, and the extent to which unsuspecting users who might not understand the technical realities of Internet video could be painfully surprised if they took VidMe's pitch at face value.
Common sense alone should remind us that if nothing else, anyone could aim an inexpensive digital camcorder at a computer display and capture a low-quality copy for distribution. And if VidMe wishes to assert that most people don't have the stream or display capture software that I used for testing, or wouldn't bother to use them, that's OK -- but at least dial back the promotional language that could easily mislead many persons into believing VidMe provided a level of video privacy and control that is simply impossible in the existing Internet ecosystem.
VidMe's fine-grained site playback access controls do have value in and of themselves, though I frankly have my doubts that their ultimate pay-to-upload plan is viable from a business standpoint.
But make no mistake about it -- videos played via VidMe, just like from every other video site on the Net, can be captured and redistributed without permission -- one way or another.
Love it or hate it -- that's just the way it is.
--Lauren--
June 21, 2010
Why Web Video Captioning Is So Important
Greetings. The New York Times has published a good article about Web video captioning, which provides me with a convenient hook to briefly discuss this very important topic.
I can't emphasize enough why captioning is so vital for the entire Web. It provides the critical link between text and the audio layer of video presentations, not just for the crucial purpose of serving the hearing-impaired community, but also to enable video content search for all users (e.g., to find videos in the main Google index based on narration or dialogue), and to enable associated automated language translations for everyone.
Google's work in the area of YouTube videos "auto-captioning" is particularly fascinating, in far more ways than I can discuss here right now.
If you have videos on YouTube, I urge you to explore the various captioning control and enhancement options that are now present in your YouTube account video controls. In particular, the Google-provided auto-captioned transcripts can be used as the basis to manually "clean up" errors in the automated captions for videos, much more rapidly than videos could normally be captioned manually from scratch.
As I understand the current situation, the caption texts from purely auto-captioned videos are not currently included in YouTube/Google search results due to the perceived auto-captioning error rate (though in many, even most cases, that rate seems to be quite low). So it's important at this stage to "clean up" the auto-captions on your videos yourself whenever possible, so that your video captions can be integrated into the search databases.
Please let me know if you're interested in more information regarding this area.
--Lauren--
June 18, 2010
Google's Wi-Fi Crucifixion, an Open Mike, and Public Is As Public Does
Greetings. As I noted recently in "Highly Illogical": The Hysteria Over Google's Wi-Fi Scanning, the unseemly and opportunistic attacks, lawsuits, and now perhaps even criminal prosecutions of Google over their accidental recording of unencrypted Wi-Fi payload data seem to call into question the overall rationality of our species.
After all, these were unencrypted transmissions being broadcast on public airwaves, and Google's accidental capturing of data snippets can hardly compare with the risks to those Wi-Fi owners of bad guys purposely collecting that data to actually use for evil purposes (but even then, we're only talking about data that wasn't protected above the Wi-Fi layer by mechanisms such as SSL/TLS).
On the other hand, it's obviously to be expected that Google's adversaries (including some governments with somewhat irrationally conflicted views over public vs. private data, imagery, etc.) would seize on any slip to try stake Google out for the wolves.
But ultimately, public is public. Information that is disseminated in unencrypted forms is always going to be vulnerable to purposeful or accidental interception, and the solution to this situation is encryption, not legislation.
I had an interesting personal incident occur recently that may be at least a bit illuminating in this "what is public?" discussion.
For several years, I've sometimes dictated the initial drafts of particularly long papers or reports into an inexpensive hand-held digital recorder. I blab my thoughts into this thing wherever I am, later dump the audio data files via USB, then run them through speech-to-text software (usually NaturallySpeaking) -- typically with highly satisfactory results.
Brief Aside: Speech recognition systems have long been one of my areas of interest. The availability of speech-to-text systems today always strikes me as a true science fiction concept brought to fruition. (Here's a one-minute video clip I threw together featuring two 20th century science fiction TV show concepts of "futuristic voice dictation" - from 1979's original Battlestar Galactica, and -- a bit more tongue-in-cheek -- from Star Trek in 1968.)
Anyway, one day a few months ago when I examined the automatically transcribed results of the last week's dictation dump, I was startled to find (sometimes garbled, sometimes intelligible) snippets of conversations in the resulting text, that had nothing to do with what I had dictated!
What the ...? I went back and listened to the original audio files, which I normally didn't do (my standard protocol is to simply upload the audio data files and later inspect the resulting text).
The problem's source was immediately apparent. There were other voices in the background of some recordings, that had been picked up in the vicinity of where I had been dictating in stores, fast food eateries, and so on. Many of these voices were clear enough that the speech software had tried -- often successfully to a considerable degree -- to transcribe these along with my intended verbiage. I deleted the original audio files as is my standard practice, and edited out the erroneously collected text snippets.
The "foreign" remarks were all pretty much meaningless bits and pieces -- a few words here and there -- but why had this suddenly occurred and how had it gone on for days unnoticed? After all, I've been using this recorder for years, often in public places, and it had never picked up anything from other conversations before.
I found the reason. On the back of the recorder is a tiny little flush switch, that I had never knowingly altered, that selected between high and low microphone sensitivity. I had always left it on the "low" setting, which caused the unit to effectively ignore all but my own voice. Somehow that switch had moved to the "high sensitivity" position, causing the unit to pull in surrounding voices as well as my own. There was no obvious indication of this, and I didn't even notice the switch since I carry the recorder in a small case.
You know where I'm going with this. The accidental recording of very short ambient background speech snippets doesn't represent a real risk to anyone, just as Google's accidental recording of unencrypted Wi-Fi payload snippets was an unfortunate oversight, not an evil plot.
We need to understand the fact that unless we take steps to protect what we consider to be "confidential data" in public spaces, that data is vulnerable to be overheard not only accidentally as in both of these cases described above, but also by bad actors who truly have nefarious goals -- and it's the latter group that we really need to be concerned about.
This holds true in the world of Wi-Fi, and in the more mundane environs of the local burger joint ordering queue.
Trying to treat public spaces as if they were somehow legislatively "private on demand" is ultimately a fool's game.
--Lauren--
June 14, 2010
My Upcoming Internet Issues Interview on China Radio International (CRI)
Greetings. I've been booked as part of panel to discuss Internet issues on China Radio International's Today show airing live this Wednesday, 16 June, from 1000-1100 Beijing Time (that's 1900-2000 Tuesday, 15 June, Pacific Daylight Time).
China Radio International (CRI) is a state-owned media network heard over-the-air in major Chinese cities, through various AM and FM broadcast stations around the world, over the Internet (live and podcast), and via shortwave, satellite, etc. (More info.)
While I don't yet have a complete list of topics for the show, I believe they will be of broad interest relating to the Internet in general, and to issues focused on China's rapidly expanding Internet operations in particular.
--Lauren--




