Difference between revisions of "The Real A Team"

From CDOT Wiki
Jump to: navigation, search
(Progress)
(Project Name Goes here)
Line 1: Line 1:
 
{{GPU621/DPS921 Index | 20161}}
 
{{GPU621/DPS921 Index | 20161}}
= Project Name Goes here =
+
= Determining Author By Style Of Writing =
 
== A Team Members ==  
 
== A Team Members ==  
 
# [mailto:aasauvageot@myseneca.ca?subject=dps921 Adrian Sauvageot], All
 
# [mailto:aasauvageot@myseneca.ca?subject=dps921 Adrian Sauvageot], All
Line 21: Line 21:
 
# '''average commas/sentence'''
 
# '''average commas/sentence'''
 
# '''average colons/paragraph.'''
 
# '''average colons/paragraph.'''
 +
  
 
I then use this information to calculate how different two pieces are from each other. If they are within what I determined to be a 5% different writing style, I suggest the two pieces were written by the same person, otherwise I suggest they were written by two separate people.
 
I then use this information to calculate how different two pieces are from each other. If they are within what I determined to be a 5% different writing style, I suggest the two pieces were written by the same person, otherwise I suggest they were written by two separate people.
Line 29: Line 30:
  
  
=== Assignment 1 ===
+
=== Assignment ===
=== Assignment 2 ===
 
=== Assignment 3 ===
 

Revision as of 20:24, 15 March 2016


GPU621/DPS921 | Participants | Groups and Projects | Resources | Glossary

Determining Author By Style Of Writing

A Team Members

  1. Adrian Sauvageot, All
  2. ...

Email All

Progress

Pre-Assignment

I decided to create a new program to test a theory I was told.

I was told by a professor that she believed that by taking a look at how a paper was written, she could tell if it was written by the same author. Further, she believed that a computer could tell if two pieces of text were written by the same author by looking at how it was written.

I decided to create a program that would analyze two pieces of text to try and determine if the same person wrote both pieces.

I decided to look at:

  1. average words/sentence
  2. average word length
  3. average sentences/paragraph
  4. average commas/sentence
  5. average colons/paragraph.


I then use this information to calculate how different two pieces are from each other. If they are within what I determined to be a 5% different writing style, I suggest the two pieces were written by the same person, otherwise I suggest they were written by two separate people.

To test this I ran the program on work by Shakespeare, One of my friends, and myself.

The program successfully was able to determine which author wrote each piece of text.


Assignment