Semantic News Analysis and Prediction

Orell, Seth R.

Semantic News Analysis and Prediction

dc.contributor.advisor	Ngu, Anne Hee Hiong
dc.contributor.author	Orell, Seth R.
dc.contributor.committeeMember	Gao, Byron
dc.contributor.committeeMember	Podorozhny, Rodion
dc.date.accessioned	2011-09-14T19:41:19Z
dc.date.available	2011-09-14T19:41:19Z
dc.date.issued	2011-08
dc.description.abstract	Active stock trading firms have a need for quick analysis of financial news items. News affects markets. Predicting how a news article may move a stock’s price can give a trader an edge over competitors and this involves the automatic understanding of a news item’s semantics. Years of research on semantic Web Services has yielded a variety of techniques to discern or provide meaning beyond the basic WSDL syntax. I believe that this research into Web Service semantics has relevance in other fields, specifically the content analysis of news as it applies to markets. The purpose of the present study is to determine if specific academic models of Web-based semantic analysis can be utilized to provide market price predictions. The study’s design allows for an objective measure of accuracy by comparing predictions against actual market changes. In the study, I explore the application of current “Top-Down” Web service semantic analyzers to distill the various approaches into abstract concepts. I take a common approach of textual content matching and apply it with and without synonym-analysis (a form of spread activation) with promising results. Using the securities in the Russell 1000 Index (chosen for market liquidity and activity), I collected corresponding news articles from Reuters for 8 months. For each article, I pulled one-minute snapshots of market data for the article’s publishing date and corresponding security. I then divided the news items into two groups: an in-sample learning set and an out-of-sample input set. The in-sample set of news provided “predictions” for price movement and I could contrast this against what the input item actually did in the market. Simple semantic analysis produced encouraging results with a rate of return (profit) better than random for shorter hold durations (one to five minutes). A synonym-based strategy showed a stronger return for longer hold periods (thirty to forty-five minutes). Both strategies performed better than a random matching approach, which lost money for every hold duration. These results show potential for similar and broader market analysis using established academic models of semantic Web analysis.
dc.description.department	Computer Science
dc.format	Text
dc.format.extent	63 pages
dc.format.medium	1 file (.pdf)
dc.identifier.citation	Orell, S. R. (2011). Semantic news analysis and prediction (Unpublished thesis). Texas State University-San Marcos, San Marcos, Texas.
dc.identifier.uri	https://hdl.handle.net/10877/2533
dc.language.iso	en
dc.subject	semantic web
dc.subject	stock market
dc.subject	lucene
dc.subject	web service composition
dc.title	Semantic News Analysis and Prediction
dc.type	Thesis
thesis.degree.department	Computer Science
thesis.degree.discipline	Computer Science
thesis.degree.grantor	Texas State University-San Marcos
thesis.degree.level	Masters
thesis.degree.name	Master of Science

Files

Original bundle

Now showing 1 - 1 of 1

Name:: ORELL-THESIS.pdf
Size:: 1.38 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 2 of 2

Name:: license.txt
Size:: 2.12 KB
Format:: Plain Text
Description:

Download

Name:: 1_license.txt
Size:: 1.71 KB
Format:: Plain Text
Description:

Download

Collections

Graduate Theses and Dissertations