Open main menu

CDOT Wiki β

Fall 2017 SPO600 Project

Revision as of 09:58, 6 December 2017 by Chris Tyler (talk | contribs)

Overview

The Fall 2017 SPO600 Project consists of:

  • Stage 1: Identifying and Benchmarking a Hash or Checksum Function in an Open Source Software Project
  • Stage 2: Implementing and Benchmarking an Optimization


Stage 1

1. Find an open source software package that includes a checksum or hash function or method that is implemented in a language that compiles to machine code (C, C++, or Assembler). This could be the Murmurhash, SHA family of hashes, or a similar function.

2. Benchmark the performance of the current hash implementation (I'll refer to hash and checksums both as hashes from now on, and functions and methods as functions). Be sure to isolate the hash performance from the performance of all other functions. Prove the repeatability of your observations.

3. Identify the strategy you will use in Stage 2 to attempt to optimize the hash function.

4. Report on your results to this point.


Stage 2

1. Implement your optimizations to the hash function.

2. Prove that the optimized code produces equivalent results to the original code.

3. Benchmark the performance changes to the hash function with your optimizations.

4. Report on your results.


Reporting on Results

  • Blog about your work frequently as you are performing it: the package you have selected, the quality of the code you've found, benchmarking challenges, and so forth.
  • Write a final blog post for each stage to summarize the results.
  • Write in detail, and include resources (such as test results, links to modified source code, and so forth) so that others can reproduce your results.
  • Be thorough, professional, and convincing in your reported results.
  • Include your personal reflections on the process and your learning.


Bonus

A bonus of 5% of the course mark is available if you do either one of these two additional steps:

  • Report on the total impact of your changes on the application. For example, you may have sped up the hash function by 100%, which will have a total impact of 1.5% on the application when running a particular workload.

or

  • Get your code changes accepted by the upstream open source community.


Due Dates =

  • Stage I is due by 11:59 pm on December 14, 2017.
  • Stage II is due by 11:59 on January 8, 2017.


FAQ

  • Q: What if I can't improve performance?
    • A: You may have selected your area of focus poorly, or be using the wrong approach to optimization. Try alternate approaches.
  • Q: What if, after trying alternate approaches to optimization, I still can't improve performance?
    • A: You can complete the project by proving that the code and build instructions, as provided by upstream, cannot be further optimized. Note that in most cases this is much harder to do that to optimize some portion of the code.