Difference between revisions of "The Real A Team"
(→Progress) |
(→Project Name Goes here) |
||
Line 1: | Line 1: | ||
{{GPU621/DPS921 Index | 20161}} | {{GPU621/DPS921 Index | 20161}} | ||
− | = | + | = Determining Author By Style Of Writing = |
== A Team Members == | == A Team Members == | ||
# [mailto:aasauvageot@myseneca.ca?subject=dps921 Adrian Sauvageot], All | # [mailto:aasauvageot@myseneca.ca?subject=dps921 Adrian Sauvageot], All | ||
Line 21: | Line 21: | ||
# '''average commas/sentence''' | # '''average commas/sentence''' | ||
# '''average colons/paragraph.''' | # '''average colons/paragraph.''' | ||
+ | |||
I then use this information to calculate how different two pieces are from each other. If they are within what I determined to be a 5% different writing style, I suggest the two pieces were written by the same person, otherwise I suggest they were written by two separate people. | I then use this information to calculate how different two pieces are from each other. If they are within what I determined to be a 5% different writing style, I suggest the two pieces were written by the same person, otherwise I suggest they were written by two separate people. | ||
Line 29: | Line 30: | ||
− | === Assignment | + | === Assignment === |
− | |||
− |
Revision as of 20:24, 15 March 2016
GPU621/DPS921 | Participants | Groups and Projects | Resources | Glossary
Contents
Determining Author By Style Of Writing
A Team Members
- Adrian Sauvageot, All
- ...
Progress
Pre-Assignment
I decided to create a new program to test a theory I was told.
I was told by a professor that she believed that by taking a look at how a paper was written, she could tell if it was written by the same author. Further, she believed that a computer could tell if two pieces of text were written by the same author by looking at how it was written.
I decided to create a program that would analyze two pieces of text to try and determine if the same person wrote both pieces.
I decided to look at:
- average words/sentence
- average word length
- average sentences/paragraph
- average commas/sentence
- average colons/paragraph.
I then use this information to calculate how different two pieces are from each other. If they are within what I determined to be a 5% different writing style, I suggest the two pieces were written by the same person, otherwise I suggest they were written by two separate people.
To test this I ran the program on work by Shakespeare, One of my friends, and myself.
The program successfully was able to determine which author wrote each piece of text.