Under the Hood
All of the data we collected for this project was organized using XML mark-up to create individual XML files for each video or article. These files were then manipulated via programs such as oXygen and eXist to extrapolate and display the data into the lists, charts, and graphs you see. Below are links to the various XML and XSLT files we created, as well as the queries we used to transform the data.
- Our Comment XMl Files - This folder contains all our XML files for both the Youtube videos and NBC news articles, as well as all associated schemas and tranformation files. Each XML file is structured the same way with all pertinent information on the video or article and its comments contained within appropriated marked up elements. If the files inside the folder do not display correctly in your browser when clicked, please right click the link and select "Save Link As..."
- Comment Schema - Our schema document that defined how the videos and articles and their comments should be marked up. The schema allowed us to create a consistent XML structure across all files, with detailed explanations of what each element or attribute was meant to define. Some of the things we defined in our schema, and can be seen in bar graph form on the Comments Analysis section of the site, were the tone of the comments, obscenities, sentence structure, excessive punctuation, emotes, and acronyms. All were meant to give us a framework by which to judge both the emotional content and the quality of the grammatical structure of each comment.
- Project Comment Puller - Our XSLT file we used to pull and format the comments for the Comments Source section of the sight. This simple code took both the tone of the comment and the text inside the sentence element from the XML files and transformed them into color-coded, organized lists of comments that we placed beneath each video/article. If the file does not display correctly in your browser, please right click the link and select "Save Link As..."
- Tone Bar Graph XSLT File - Our XSLT file we used to pull the tone attribute from the comments in our XML files and create an SVG bar graph displaying the various percentages of positive, negative, and neutral comments. Similar If the file does not display correctly in your browser, please right click the link and select "Save Link As..."
- Tone Bar Graph XSLT File (Youtube) - Our XSLT file we used to pull the tone attribute from the comments in our yYoutube XML files and create an SVG bar graph displaying thevarious percentages of positive, negative, and neutral comments. If the file does not display correctly in your browser, please right click the link and select "Save Link As..."
- Tone Bar Graph XSLT File (NBC) - Our XSLT file we used to pull the tone attribute from the comments in our NBC XML files and create an SVG bar graph displaying the various percentages of positive, negative, and neutral comments. If the file does not display correctly in your browser, please right click the link and select "Save Link As..."
- Quality Bar Graph XSLT File - Our XSLT file we used to pull the structure attribute from the sentence elements in our XML files and create an SVG bar graph displaying the various percentages of well-structured sentences to poorly-structures sentences across all comments, as well as specifically Youtube and NBC comments. If the file does not display correctly in your browser, please right click the link and select "Save Link As..."