Google News: Positives, Negatives, and the Rest

16 Nov

Google News is the sixth most visited news site, according to Alexa Web Traffic Rankings. Given its popularity, it deserves closer attention.

What is Google News? Google News is a news aggregation service that scours around ten thousand news sources, categorizes the articles and ranks them. What sets Google News apart is that it is not monetized. It doesn’t feature ads. Nor does it have deals with publishers. The other distinguishing part is that it is run by software engineers rather than journalists.

Criticisms

1. Copyright: Some argue that the service infringes of copyrights.

2. Lost Revenue: Some argue that the service causes news sources to lose revenue.

3. Popular is not the same as important or diverse: Google News highlights popular stories and sources. In doing so, it likely exacerbates the already large gap between popular news stories and viewpoints and the rest. The criticism doesn’t ring true. Google News merely mimics the information (news) and economic topography of the real world, which encompasses the economic underpinnings of the virtual world as in better-funded sites tend to be more popular or firms more successful in real world may have better-produced sites and hence may, in turn, attract more traffic. It does, however, bring into question whether Google can do better than merely mimic the topography of the world. There are, of course, multiple problems associated with any such venture, especially for Google, whose search algorithm is built around measuring popularity and authority of sites. The key problem is that news is not immune to being anything more than a popularity contest shepherded by rating (euphemism for financial interests) driven news media. A look at New York Times homepage, with extensive selection of lifestyle articles, gives one an idea of the depth of the problem. So if Google were to venture out and produce a list of stories that were sorted by relevance to say policy, not that any such thing can be done, there is a good chance that an average user will find the news articles irrelevant. Of course, a user-determined topical selection of stories would probably be very useful for users. While numerous social scientists have issued a caveat against adopting the latter approach arguing that it may lead to further atomization and decline in sociotropism, I believe that their appeals are disingenuous given that specialized interest in narrowly defined topics and interests in global news can flower together.

4. Transparency: Google News is not particularly transparent in the way it functions. Given the often abstruse and economically constrained processes that determine the content of newspapers, I don’t see why Google News process is any less transparent. I believe the objection primarily stems from people’s discomfort with automated processes determining the order and selection of news items. Automated processes don’t imply that they aren’t based on adaptive systems based on criteria commonly used by editors across newsrooms. More importantly, Google News works off the editorial decisions made by organizations across the board, for they include details like placement and section of the article within the news site as a pointer for the relative importance of the news article. At this point, we may also want to deal with the question of accountability, as pertaining to the veracity of news items. Given that Google News provides a variety of news sources, it automatically provides users with a way to check for inconsistencies within and between articles. In addition, Google News relies on the fact that in this day and age, some blogger will post an erratum to a “Google News source” site, of which there are over ten thousand, and that in turn may be featured within Google News.

Positives

Google News gives people the ability to mine through a gargantuan number of news sources and come up with a list of news stories on the same “topic” (or event) and the ability to search for a particular topic quickly. One can envision that both the user looking for a diversity of news sources or looking for quick information on a particular topic, could both be interested in other related information on the topic. More substantively, Google News may want to collate information from its web, video and image search, along with links to key organizations mentioned in the websites and put then right next to the link to the story. For example, BBC offers a related link to India’s country profile next to a story on India. Another way Google News can add value for its users is by leveraging the statistics it compiles of when and where news stories were published, stories published in the last 24 hrs or 48 hrs etc. I would love to see a feature called the “state of news” that shows statistical trends on news items getting coverage, patterns of coverage etc. (this endeavor would be similar to Google Trends)

Diversity of News Stories

What do we mean by diversity and what kind of diversity would users find most useful? Diversity can mean diverse locations—publishers or datelines, viewpoint—for or against an issue, depth—a quick summary or a large tome, medium—video, text, or audio, type of news—reporting versus analysis. Of course, Google can circumvent all of these concerns by setting up parallel mechanisms for all the measures it deems important. For example, a map/google news “mashup” can prove to be useful in highlighting where news is currently coming from. Going back to the topic of ensuring diversity – conceptual diversity is possibly the hardest to implement. There can be a multitude of angles for a story – not just for and against binary positions and facets can quickly become unruly, indefensible and unusable. For example if it splits news stories based on news sources (like liberal or conservative – people will argue over whether right categorizations were chosen or even about the labeling, for example, social conservatives and fiscal conservatives) or organizations cited (for example there is a good chance that an article using statistics from Heritage foundation leans in a conservative direction but that is hardly a rule). Still, I feel that these measures can prove to be helpful in at least mining for a diversity of articles on the same topic. One of the challenges of categorization is to come up with “natural” categories as in coming up with categorization that is “intuitive” for people. Given the conceptual diversity and the related abstruseness, Google may though want to preclude offering them as clickable categories to users thought it may want to use the categorization technique to display “diverse” stories. Similarly, more complex statistical measures can also prove to be useful in subcategorization, for example providing a statistical reference to the most common phrases or keywords or even Amazon like statistics on the relative hardness of reading. Google News may also just want to list the organizations cited in the news article and leave the decision of categorization to users.

Beyond Non-Profit
Google News’ current “philanthropic” (people may argue otherwise viewing it as a publicity stunt) model is fundamentally flawed for it may restrict the money it needs to innovate and grow. Hence, it is important that it explores possible monetization opportunities. There are two possible ways to monetize Google News – developing a portal (like Yahoo!) and developing tools or services that it can charge for. While Google is already forging ahead with its portal model, it has yet to make appreciable progress in offering widely incorporable tools for its Google News service. There is a strong probability that news organizations would be interested in buying a product that displays “related news items” next to news articles. This is something that Technorati already for does for blogs but there is ample room for both, additional players, and for improving the quality of the content. It would be interesting to see a product that helps display Google News results along with Google image, blog, and video search results.

Comments Please! The Future Of Blog Comments

11 Nov

Often times the comments sections of blogging sites suffer from a multiplicity of problems – they are overrun by spam or by repeated entries of the same or similar point, continue endlessly, and are generally overcrowded with grammatical and spelling mistakes. Comments sections that were once seen as an unmitigated good are now seen as something irrelevant at best, and a substantial distraction at worst. Here, I discuss a few ways we can re-engineer commenting systems to mitigate some of the problems in the extant models, and possibly add value to them.

Comments are generally displayed in a chronological or reverse chronological order, which implies that, firstly, the comments are not arranged in any particular order of relevance and, secondly, that users just need to repost their comments to position them in the most favorable spot – the top or the bottom of the comment heap.

One way to “fix” this problem is by having a user based rating system for comments. A variety of sites have implemented this feature to varying levels of success. The downside of using a rating system is that people don’t have to explain their vote for, or against, the comment. This occasionally leads to rating “spam”. The BBC circumvents this problem on its news forums by allowing users to browse comments either in a chronological order or in the order of reader’s recommendations.

Another way we can make comments more useful is by creating message board like commenting systems that separate comments under mini-sections or “topics”. One can envision topics like “factual problems in XYZ” or “readers suggested additional resources and links” that users can file their comments under. This kind of a system can help in two ways – by collating wisdom (analysis and information) around specific topical issues raised within the article, and by making it easier for users to navigate to the topic, or informational blurb, of their choice. This system can also be alternatively implemented by allowing users to tag portions of the article in place – much like a bibliographic system that adds a hyperlink to relevant portions of the story in comments.

The above two ways deal with ordering the comments but do nothing to address the problem of small irrelevant repetitive comments. These are often posted by the same user under one or multiple aliases. One way to address this issue would be to set a minimum word limit for comments. This will encourage users to put in a more considered response. Obviously, there is a danger of angering the user, leading to him/her adding a longer, more pointless comment or just giving up. On average, I believe that it will lead to an improvement in the quality of the comments. We may also want to consider developing algorithms that disallow repeated postings of same comments by a user.

The best way to realize the value of comments is to ask somebody – preferably the author of the article – to write a follow-up article that incorporates relevant comments. Ideally, the author will use this opportunity to acknowledge factual errors and analyze points raised in the comments. Hopefully, this follow-up piece will be able to solicit more comments, and the process would repeat again, helping to take discussion and analysis forward.

Another way to go about incorporating comments is to use a wiki-like system of comments to create a “counter article” or critique for each article. In fact, it would be wonderful to see a communally edited opinion piece that grows in stature as multiple views get presented, qualified, and edited. Wikipedia does implement something like this in the realm of information but to bring it to the realm of opinions would be interesting.

One key limitation of most current commenting systems on news and blog sites is that they only allow users to post textual responses. As blog and news publishing increasingly leverages multimedia capabilities of the web, commenting systems would need to be developed that allow users to post their response in any media. This will once again present a challenge in categorizing and analyzing relevant comments but I am sure novel methods, aside from tagging and rating, will eventually be developed to help with the same.

The few ideas that I have mentioned above are meant to be seen as a beginning to the discussion on this topic and yes, comments would be really appreciated!

Making Comments More Useful

10 Nov

Often times comments sections of blogging sites suffer from a multiplicity of problems – they are overrun by spam or by repeated entries of the same or similar point; continue endlessly and generally overcrowded with grammatical and spelling mistakes. Comments sections that were once seen as an unmitigated good are now seen as something irrelevant at best and a substantial distraction at worst. Here below I discuss a few ways we can re-engineer commenting systems so to mitigate some of the problems in the extant models, and possibly add value to them.

Comments are generally displayed in a chronological or reverse chronological order, which implies that firstly the comments are not arranged in any particular order of relevance and secondly that users just need to repost their comments to position them in the most favorable spot – the top or the bottom of the comment heap. One way to “fix” this problem is by using a user based rating system for comments. A variety of sites have implemented this feature to varying levels of success. The downside of using a rating system is that people don’t have to explain their vote ( Phillip Winn) for or against the comment leading occasionally to rating “spam”. BBC circumvents this problem on its news forums by allowing users to browse comments either in a chronological order or in the order of reader’s recommendations.

Another way we can make comments more useful is by creating message board like commenting systems that separate comments under mini-sections or “topics”. One can envision topics like “factual problems in XYZ” or “readers suggested additional resources and links” that users can file their comments under. This kind of a system can help in two ways – by collating wisdom (analysis and information) around specific topical issues raised within the article and by making it easier for users to navigate to the topic or informational blurb of their choice. This system can also be alternatively implemented by allowing users to tag portions of the article in place – much like a bibliographic system that hyperlinks relevant portions of the story to comments.

The above two ways deal with ordering the comments but do nothing to address the problem of small irrelevant repetitive comments, often times posted by the same user under one or multiple aliases. One way to address this issue would be to set a minimum word limit for comments. This will prod users to put in a more considered response. Obviously, there is a danger of angering the user leading to him/her adding a longer more pointless comment or just giving up but on an average, I believe that it will lead to an improvement in the quality of the comments. We may also want to consider coding in algorithms that disallow repeated postings of same comments by a user.

The best way to realize the value of comments is to ask somebody – preferably the author of the article- to write a follow-up article that incorporates relevant comments. Ideally, the author will use this opportunity to acknowledge factual errors and analyze points raised in the comments. Hopefully, then this follow up piece will be able to solicit more comments and the process repeated again helping take discussion and analysis forward.

Another way to go about incorporating comments is to use a wiki-like system of comments to create a “counter article” or critique for each article. In fact, it would be wonderful to see a communally edited opinion piece that grows in stature as multiple views get presented, qualified, and edited. Wikipedia does implement something like this in the realm of information but to bring it to the realm of opinions would be interesting.

One key limitation of most current commenting systems on news and blog sites is that they only allow users to post textual responses. As blog and news publishing increasingly leverages multimedia capabilities of the web, commenting systems would need to be developed that allow users to post their response in any media. This will once again present a challenge in categorizing and analyzing relevant comments but I am sure novel methods, aside from tagging and rating, will eventually be developed to help with the same.

The few ideas that I have mentioned above are meant to be seen as a beginning to the discussion on this topic and yes, comments would be really appreciated.

Muslim Issues, Humanitarian Issues

4 Aug

The latest Lebanese crisis—I cringe at using the word crisis for it seems news organizations use it all too frequently to condense all human suffering and all other news into this pointless pithy—has been covered in the Arab media as a predominantly Muslim affair where a Jewish state is attacking Muslims. While the thrust of the statement remains true, the fact of the matter is that what is happening in Lebanon is a humanitarian crisis, a human tragedy if you will and has little or nothing to do with people there being Muslims or non-Muslims. The portrayal is all the more bankrupt given the fact that Lebanon has about 40% Christian population. Kashmir, Chechnya, Palestine, Lebanon or Bosnia are and should be treated as a humanitarian crisis and not as Muslim crisis by the Arab media. There is a subtext in all the coverage in the Arab media that a Saudi resident or an Arab should feel more about the Lebanese than say someone sitting in EU. There is subtle and not too subtle racism that accentuates the us vs. them schism that has opened up between the world and Islam as a whole. There are mitigating reasons that are offered including the fact that Arab press is deliberately framing it as a Muslim issue to demand action from their ostensibly Muslim governments but then again I think it is giving too much credit to the Arab media for this deep-rooted problem that finds its face in all major Muslim media from Indonesia to Pakistan.

Of course, the Western media can’t go scot-free either. Western media outlets eager to portray Hezbollah as a Shiite militia backed by Iran and eager to portray Lebanese as a bunch of ‘enemy terrorists’ have overlooked the fact that “Hezbollah is principally neither a political party nor an Islamist militia. It is a broad movement that evolved in reaction to Israel’s invasion of Lebanon in June 1982” NY Times

Roger Pape, in his NY Times op-ed piece, adds,

“Evidence of the broad nature of Hezbollah’s resistance to Israeli occupation can be seen in the identity of its suicide attackers. Hezbollah conducted a broad campaign of suicide bombings against American, French and Israeli targets from 1982 to 1986. Altogether, these attacks, which included the infamous bombing of the Marine barracks in 1983, involved 41 suicide terrorists.

In writing my book on suicide attackers, I had researchers scour Lebanese sources to collect martyr videos, pictures, and testimonials and the biographies of the Hezbollah bombers. Of the 41, we identified the names, birthplaces and other personal data for 38. Shockingly, only eight were Islamic fundamentalists. Twenty-seven were from leftist political groups like the Lebanese Communist Party and the Arab Socialist Union. Three were Christians, including a female high-school teacher with a college degree. All were born in Lebanon.”