Week 3
09.26.2016
Added REGEX to Excel-Parser to allow scrubbing of Data.
Added functioning list comparison prototype to Java Program.
Searching and implementation of comparison algorithms for Excel.
REGEX: "In theoretical computer science and formal language theory, a regular expression
(sometimes called a rational expression) is a sequence of characters that define
a search pattern, mainly for use in pattern matching with strings, or string matching,
i.e. "find and replace"-like operations. The concept arose in the 1950s,
when the American mathematician Stephen Kleene formalized the description of a regular
language, and came into common use with the Unix text processing utilities ed, an editor,
and grep, a filter.
In modern usage, "regular expressions" are often distinguished from the derived,
but fundamentally distinct concepts of regex[pronunciation?] or regexp,
which no longer describe a regular language."
Source: https://en.wikipedia.org/wiki/Regular_expression
09.27.2016
Created Excel-Writer to write the output into an Excel File.
Excel-Algorithms do not give the desired result.
We got new data from SCHURTER for the segmentation of the medical market. As example, one of the sources looks like this:
We extract the data from this and the other websites and intergrate the data into an Excel sheet to use in the comparison algorithm.
09.28.2016
Java Excel-Writer fully functional giving the desired output.
Added the «Jaro-Winkler distance»-algorithm to the Excel-Macro for the comparison. Example of how it works
09.29.2016
Changed format of the medical market list to also include the proper segment per customer”.
Java List-Comparison needs to be rewritten to allow usage of better and faster algorithms as well as different Industry input.
Working on the Excel-Macro and trying to find an algorithm that can be used additionally to the “Jaro-Winkler Distance” algorithm.
GUI Design, without touching the enrichment part of SPOSDES Version 2.0.
Version 2.0 is running perfectly, so we building the functionalities from Version 3.0 around Version 2.0:
09.30.2016
The structure from the Java comparison algorithm for the segmentation in EclipseNeon.
Rewrote the entire Java-Program to work with the new medical market list format.
The Heart of the SPOSDES Segmentation Algorithm where the comparison of the medical market list with the POS-data list of SCHURTER happens: