2017年4月8日土曜日

2-3. Previous Research

Those studies above clearly suggest that “Sentimental Analysis” contains a lot of aspects which have been developed in various fields. Additionally, it would be beneficial for us to systematize the whole elements which are introduced the previous part.
It must be safe to say that the direct ancestors of “Sentimental Analysis” are “Content Analysis”, “Statistics” and “Computer Science” because the new style uses every component those preceding research have established for a long time. If we show the relationship in a figure, that would be like Figure.5 below. The illustration explains how different territories of linguistic synergize with each other as well as how Sentimental Analysis is constructed academically.
 In “Foundations of behavioural research (1986)”, Kerlinger stated that “Content Analysis” is characterized as “a method of studying and analyzing communication in a systematic, objective, and quantitative manner for the purpose of measuring variables (Content Analysis A method, p.2).” On the other hand, “Quantitative Content Analysis” is a part of “Content Analysis” and it especially features the aspect of “Statistics” more to attain the objective. Zhang and Wildemuth introduced the three definitions for “Quantitative Content Analysis”;

• “a research method for the subjective interpretation of the content of text data through the systematic classification process of coding and identifying themes or patterns”
• “an approach of empirical, methodological controlled analysis of texts within their context of communication, following content analytic rules and step by step models, without rash quantification”
• “any qualitative data reduction and sense-making effort that takes a volume of qualitative material and attempts to identify core consistencies and meanings” (Zhang & Barbara, p. 1)

Moreover, they also indicate the strong tie between the two approaches saying; “The quantitative approach produces numbers that can be manipulated with various statistical methods (Zhang & Barbara, p. 2).” Therefore, it would be rational inference to formulate that the combination of “Content Analysis” and “Statistics” creates “Quantitative Content Analysis”. The same thing can be said about the rapport between “Content Analysis” and “Computer Science” which creates “Computer Content Analysis”. “Computer Content Analysis” is a computer-assisted evaluation method that treats text or its contents (Alexa, 1997, p. 5). This modern technics has been evolved largely due to the abundant electronic text and databases from large variety of sources which are available for researchers who dedicate their time and effort to mine the information for text structures. One notable trait that machine-readable text has is that it is recognizable by application so that many data the target materials have can easily be collected and sorted in order to analyze them. As stated in Alexa’s report in 1997, computer systems such as "General Inquirer" which is an IBM 7090 program system that was developed at Harvard in the spring of 1961 by Philip J. Stone and his colleague and "WORDS" proposed in 1969 by H.P. Iker and N.I. Harway contributed to the establishment of “Computer Content Analysis” (Alexa, 1997, p. 5). In addition to those software for massive “Content Analysis”, Alexa pointed out that DeWeese’s proposals and technics shown in his two theses during late 70’s (DeWeese, 1976 & 1977) may be regarded as path-breaking works of the inception of “Computer Content Analysis”. The progress above clearly suggests that “Computer Content Analysis” consists of “Content Analysis” and “Computer Content Analysis”. Though there might be some possibilities about the involvement of “Statistics” in the aseessment, it is a logical determination to omit “Statistics” from the components of “Computer Content Analysis” because the main purpose of the computerized approach focuses on the exploration of the text structures, rather than numerical facts or data.
The third academic territory given birth of as a result of theoretical fusion is “Statistical Machine Learning (a.k.a: Statistical learning theory)” which is defined as a unification of statistics and the computational sciences.  “Statistical Machine Learning” generally means the self-educated computer system for “the automated detection of meaningful patterns in data (Shai, 2014, 9. 7).” Recently, this kind of technology has widely used for creating various applications we use every day. For example, search engines represented by Google, Yahoo, MSN, and portal sites like that are made up based on this methodical approach. Furthermore, “Statistical Machine Learning” has a unique feature that can execute any complicated tasks by its ability to “learn” and “adapt”. Those capacity is able to be achieved only by “Statistical Machine Learning”. This is because that it is impossible for human engineers to program a system to deal with every assignment which needs an explicit and detailed specification.  Thus, the mechanics also plays a crucial role in Artificial Intelligence which performs operations analogous to learning and decision making in humans.  As the name suggests, “Statistical Machine Learning” has deep relationship with “Statistics”. In a book “An Introduction to Statistical Learning”, the authors say that “statistical learning (“Statistical Machine Learning”) has emerged as a new subfield in statistics, focused on supervised and unsupervised modeling and prediction (James, 2013).”As shown above, “Sentimental Analysis” is a trinity complex including the three basic elements; “Content Analysis”, “Statistics” and “Computer Science”. Moreover, it is hugely affected by the derivatives like “Quantitative Content Analysis”, “Computer Content Analysis” and “Statistical Machine Learning”. In other word, “Sentimental Analysis” represents a fruits of long-term research from various fields.

In summary, “Content Analysis”, “Statistics” and “Computer Science” are distinct stand-alone area respectively. Nevertheless, by cooperating with each other, they successfully invented different genres. As a result, the movement make it possible to emerge the three subsidiaries and one next-generation technology; “Quantitative Content Analysis”, “Computer Content Analysis”, “Statistical Machine Learning” and “Sentimental analysis”. As for the three subsidiaries, it might be difficult to set a boundary between them strictly because all of them are influenced by the three basic parent elements to some extent. However, in this thesis, it would be convenient to treat them as independent existences for the purpose to grasp the latent functions of “Sentimental Analysis” systematically.

0 件のコメント:

コメントを投稿

The Ouster of John Bolton probably means the some signals for the North Korea?

Some western media have reported that Trump's National Security Advisor John Bolton was fired. There are various opinions about this dec...