Big Data

Army intell wants to scan social media from 40 countries

The Army wants to be able to anonymously scan social media platforms and open-source information from up to 40 countries and in 66 languages, and perform big data analytics on huge data sets of the information in search of trends in political, military, economic and other areas. And it wants to be able to do it from a smartphone.

In a sources sought notice published at FedBizOpps.gov, the Army Intelligence and Security Command (INSCOM) said it “anticipates the need” for a non-attributable (anonymous) service that will collect and analyze the data, and allow INSCOM personnel to perform their own analyses using customized big data tools.

Intelligence agencies, like businesses and political campaigns, recognize the value of social media in track trends, public sentiment and the kind of emerging public uprisings that took place during the Arab Spring. Agencies from the Homeland Security Department to the Defense Advanced Research Projects Agency have looked to use social media analytics for signs of terrorism or as a conduit during emergencies. The challenges have included the size of the data and the fractured language used on the likes of Twitter and Facebook.

INSCOM’s notice said it’s looking for tool that can perform analytics on data from a changeable list of up to 40 countries and follow up to 10 analytical themes, including “political, military, economic, social, infrastructure, and information systems of foreign states,” as well as perform sentiment analysis and predictive analytics. The service should be available to INSCOM’s entire enterprise via smartphones or tablets, giving access to users anywhere in the field, the notice said.

Other features the Army is looking for include:

Automated reporting interfaces with additional focused-analytics as needed.

  • The ability to search foreign social media and open source information and conduct predictive analysis, sentiment analysis and deliver situational awareness.
  • Node clustering of big data utilizing a distributed, scalable and portable file-system and a software framework that allows for data-intensive applications.
  • A non-intrusive framework for processing parallelizable problems across huge datasets using a large number of clusters on all ingested data sources.

In addition to supporting basic machine language translation for 66 languages, the tools also should support synonym translation for Pashto, and Arabic, and Urdu.

The languages INSCOM wants to translate: Afrikaans, Albanian, Arabic, Armenian, Azerbaijani, Basque, Belarusian, Bengali, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Esperanto, Estonian, Filipino, Finnish, French, Galician, Georgian, German, Greek, Gujarati, Haitian Creole, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Irish, Italian, Japanese, Kannada, Khmer, Korean, Lao, Latin, Latvian, Lithuanian, Macedonian, Malay, Maltese, Norwegian, Persian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tamil, Telugu, Thai, Turkish, Ukrainian, Urdu, Vietnamese, Welsh, Yiddish.

Responses are due by Dec. 23 via email to Courtney Webber at Courtney.j.webber.civ@mail.mil.  

 

Reader Comments

Wed, Dec 18, 2013 Andy

Why does a mil branch feel they need to undertake this kind of all sources effort? I assumed this was already being done by our intell organizations. If the toxic political climate is preventing them from undertaking necessary work, we are making a mistake with the current privacy panic in DC. We should not be overlooking any low hanging fruit such as what people post on sites as this speaks volumes about where communities and nations are headed.

Please post your comments here. Comments are moderated, so they may not appear immediately after submitting. We will not post comments that we consider abusive or off-topic.

Your Name:(optional)
Your Email:(optional)
Your Location:(optional)
Comment:
Please type the letters/numbers you see above