Rebecca Taylor

-----------------------------------------------------------------------------------------------------------------------------

-----------------------------------------------HONR269L--------------------------------------------------------------

-----------------------------------------------------------------------------------------------------------------------------

February 3rd, Sunday:

Samuel Howard and I decided on a meeting time with our professor for this research in IceCube, Erik Blaufuss. We will meet at 2:30 on Fridays.

February 6th, Wednesday:

Dr. Blaufuss told us to look at previous logbooks. I found that a lot of them used Python, and they made skymaps of where the neutrinos could have possibly come from, based on their point source searches. A lot of their work was confusing, since I do not know the technicalities of a lot of this yet.

February 7th, Thursday:

Sam and I discussed what we maybe wanted to do. He suggested something with machine learning, since he finds that really interesting, and he is a Computer Science major. I also find that interesting, although I do not know that much about it.

February 8th, Friday:

We met with Dr. Blaufuss for the first time. We discussed our ideas of a possible machine learning approach with him, and we talked about what we found in the previous logbooks. It seems like we have two options we could pursue: do similar work to previous years' teams, choosing a different part of the sky to search, or do a new type of work, with machine learning, in which it will be less known what to do.

If we were to pursue the machine learning approach, we would also have to decide if we wanted to use a Windows-based system or a Virtual Machine. Dr. Blaufuss said that Windows would probably be better, since it would be directly on our computers, but if we went with that it would be uncharted territory for all three of us.

February 11th, Monday:

Dr. Blaufuss emailed us with some papers to read before our next meeting.

One gave a general overview of how Icecube works: When particles enter the ice, if they go fast enough, they give off Cherenkov light radiation, by going faster than the speed of light in that material. This is basically the equivalent of a sonic boom for light.

February 15th, Friday:

Sam and I decided that we wanted to pursue machine learning as opposed to the previous years' point source searches. We discussed this with Dr. Blaufuss, and he looked up online sites to get us started. Amazon Web Services (AWS) will be a good place to go if we want to do some cloud computing, or need more storage for data. He set up a classroom and requested access for the two of us to create an educate account there. In addition, he told us to make an account for Enthought Canopy and to get Scikit-learn.

Dr. Blaufuss suggested that we find papers on neutrino oscillations and neural networks.

February 20th, W:

I got an invitation today to join AWS. I had to confirm my email with them and create a password for my account.

February 25th, M:

We had to have a meeting on a Monday this week, because Dr. Blaufuss will be out of town on Friday. We talked about multiple types of cloud computing that we could use, mainly either Google Cloud or AWS. If we start to use this, we will have to decide which one to use.

We went over how to use AWS during the meeting. Dr. Blaufuss had tried it out some, and there are multiple resources and tutorials on there that assume you have a full account, as opposed to the educate account that we have. So, he told us to add different Rolls on the site so that we can use them. Later this week I will do this myself.

We also discussed beginning to use our computers to start using some machine learning. Dr. Blaufuss suggested using public census data, which he will email us a link with the data and a tutorial of how to use machine learning with this.

Another thing we have to decide is how in-depth we want to go with our research. We can either go in depth in one thing--like boosted decision trees or cloud computing--or learn and use the basics of multiple things.

By next week, we should fully go through one analysis of data, training the computers with data, using them to predict aspects of something, and quantifying how accurate the computer was able to predict these. In the census data that we will use, the computer will predict the characteristics of people based on certain other demographics.

After we work with public data sets and get a feel for how it works, we can start using these skills and apply them to neutrino analysis.

March 1st, F:

I have started working on the tutorial Dr. Blaufuss sent us on how to train the computers to learn census data characteristics. We are using the Enthought Canopy program to do this on our computers, since the data files are not too large for our laptops to handle. The tutorial is found here: https://medium.com/district-data-labs/building-a-classifier-from-census-data-18f996c4d7cf and it is outlined with different subparts, which I will show.

Data Ingestion:

We created a function to download the census data from web files. At first I tried to do exactly the code on the tutorial, but I got this error:

So, I looked it up, and turns out the page is from a less updated version of Python. All I had to do was add in a "b" to the "w" part so that it read "wb". This is the the code:

--------

import os

import requests

CENSUS_DATASET = (

"http://archive.ics.uci.edu/ml/machine-learning-databases/adult/adult.data",

"http://archive.ics.uci.edu/ml/machine-learning-databases/adult/adult.names",

"http://archive.ics.uci.edu/ml/machine-learning-databases/adult/adult.test",

)

def download_data(path='data', urls=CENSUS_DATASET):

if not os.path.exists(path):

os.mkdir(path)

for url in urls:

response = requests.get(url)

name = os.path.basename(url)

with open(os.path.join(path, name), 'wb') as f:

f.write(response.content)

download_data()

---------

Data Exploration:

In exploring the data, we used Pandas, and had to supply names for the header row to the computer, since the data did not contain that already. This is the code:

---------

import pandas as pd

import seaborn as sns

names = [

'age',

'workclass',

'fnlwgt',

'education',

'education-num',

'marital-status',

'occupation',

'relationship',

'race',

'sex',

'capital-gain',

'capital-loss',

'hours-per-week',

'native-country',

'income',

]

data = pd.read_csv('data/adult.data', names=names)

data.head()

---------

Here's the output it gives us. The head command in the last line of the code tells it to show us the first five lines of the data, that we have made a header row for.

In this tutorial, we are trying to get the computer to predict the income of the people in the census. To help us visualize the actual data more, we can create plots that compare two characteristics. The orange lines show the amount of people in that category with an income above 50 thousand dollars, and the blue lines show people with an income below that or equal to it. There is a simple code for this...

---------

sns.countplot(y='occupation', hue='income', data=data,)

---------

...that we can then change the "y" part to whatever characteristic we want to look at, and it will make the corresponding graph. Here's the output for occupation and race.

Later, I will continue this tutorial.

March 4th, M:

We had to meet with Dr. Blaufuss on a Monday again because he will be out of town starting tomorrow. In this meeting, we discussed a new idea that Dr. Blaufuss proposed for us to use machine learning for. At Icecube, they have an alert system that sends out a notification whenever a significant neutrino event occurs. The machine that sends out these alerts is trained to notify people when there is cascade and a line happening right after another in time. Usually, this is when an event occurs within the detector. Yet, sometimes the machine sends out an alert when there are two unrelated things, the line does not come out of the cascade and is just another random particle. They then have to manually send out an alert to disregard the previous alert. Using machine learning, we could try to train a machine to recognize the pattern in this, and to know when the line come out of the cascade and when they do not. This would require a pattern recognizing machine learning algorithm. It would also require both "good" samples--lines coming out of cascades--and "bad" samples--the false alarm alerts. We would need enough of these to train the machine, and it will be difficult for Dr. Blaufuss to find enough of the "bad" data samples if we were to pursue this.

In case we do decide to pursue this though, since it does seem interesting, Dr. Blaufuss told us to investigate different machine learning algorithms and what different approaches we could take to machine learning and classifying. Specifically for me, I plan to read into more of how machine learning actually works, since I do not fully grasp the fundamentals of that yet; I did not know much programming at all until this year. Dr. Blaufuss recommended that I just start with what I don't know (even if that's just the basics), figure it out, and go from there.

March 9th, Sa:

I watched a few videos on Machine Learning to check that I understood the concept thoroughly. I realized that Machine Learning comes in many different types and shows up in many places we don't realize, like google search giving us the best suggestions based on what we want, or the workings of self-driving cars.

I worked on the census data tutorial more. It does not actually take that long to do, but I am trying to really understand each step as I do it. After data exploration in the census tutorial comes data management.

Data Management:

We had to organize the data to be more readable by Scikit Learn. Using this code...

------

import json

meta = {

'target_names': list(data.income.unique()),

'feature_names': list(data.columns),

'categorical_features': {

column: list(data[column].unique())

for column in data.columns

if data[column].dtype == 'object'

},

}

with open('data/meta.json', 'w') as f:

json.dump(meta, f, indent=2)

------

...we made a meta.json file, which told the computer which parts of the data was out target value. In this case, it's the people's incomes.

In the tutorial, the author uses a "readme" file within their code that I am not sure what it does. This was their own file though, and when I commented out the lines containing it, the code still worked. Here is the next code we used, with the readme parts commented out.

------

from sklearn.datasets.base import Bunch

def load_data(root='data'):

# Load the meta data from the file

with open(os.path.join(root, 'meta.json'), 'r') as f:

meta = json.load(f)

names = meta['feature_names']

# Load the readme information

#with open(os.path.join(root, 'README.md'), 'r') as f:

#readme = f.read()

# Load the training and test data, skipping the bad row in the test data

train = pd.read_csv(os.path.join(root, 'adult.data'), names=names)

test = pd.read_csv(os.path.join(root, 'adult.test'), names=names, skiprows=1)

# Remove the target from the categorical features

meta['categorical_features'].pop('income')

# Return the bunch with the appropriate data chunked apart

return Bunch(

data = train[names[:-1]],

target = train[names[-1]],

data_test = test[names[:-1]],

target_test = test[names[-1]],

target_names = meta['target_names'],

feature_names = meta['feature_names'],

categorical_features = meta['categorical_features'],

#DESCR = readme,

)

dataset = load_data()

------

I wasn't really sure what bunch was, so I looked it up. It seems to be an object that acts as a dictionary, and it is used a lot with scikit-learn. It is a container object, used for datasets.

The code above basically splits the data into target and data variables. This makes them ready to be used by scikit-learn.

March 11th, M:

I also realized that I do not really know what scikit-learn is either. So i went to the main scikit-learn website and tried to learn what it is. Scikit-learn is a library (like numpy and other libraries that we used first semester) specifically for python. It is actually built ON numpy, scipy, and matplotlib, presumably using some of their codes and words too. It is used for Machine Learning, since it includes code for data analysis and using data. It can classify data, predict the attributes of data (which is what happens at the end of the census tutorial), cluster similar data objects together, reduce the amount of random variables, choose a good model based on comparisons with others, and transform text data into data ready for machine learning (which is what we did in the previous census tutorial step)

March 13th, W:

I started to look into more of the different types of machine learning, to see what our options are for our project.

March 18th, M:

Today I looked at the handwriting tutorial on Scikit Learn, found here: https://scikit-learn.org/stable/tutorial/basic/tutorial.html#introduction

At the beginning of this tutorial, I learned more about machine learning. There are two categories: supervised learning and unsupervised learning. In supervised learning, one part of the training data is the target data, which we want to predict in later data sets. Basically, you are trying to figure out how the input variable and the output variable are connected. Then, when we have new input data we will be able to predict the output data. Supervised learning also has two subgroups: regression and classification. Classification is when the output variable/target data is of a categorical form--like predicting whether something is going to be either blue or red. Regression is when the output variable/target data is a value--like predicting the exact income of somebody. The census data tutorial above is a classification problem because it does not predict their incomes, just whether they earn either more or less than 50k. With unsupervised learning, there is no correct answer that machines can learn from. Instead, they are supposed to find their own patterns and interesting aspects of the data.

Data sets always are in the form of a 2D array. The data has to be put in this form to used by Scikit-learn.

March 26th, Tu:

I decided to start meeting with Sam so that our work was more similar and I could ask him any questions I had, since he knows more about computing than I do.

Today we met and completed a task Dr. Blaufuss had wanted us to do in an email he sent on Sunday.

We are starting to apply our machine learning knowledge to the actual Icecube data. Dr. Blaufuss found some data that we can use, but it is very large and not accessible to the public. So, he wanted us to make sure we could read it with our Enthough Canopy first, and then we would figure out how to transfer the data from him to us. He gave us a set of aroudn 5 or 6 data points that we could download and then make the computer read. He provided us with a code we could use to accomplish this, once we downloaded the data:

--------

import cPickle as pickle

with (open("coinc.pkl", "rb")) as openfile:

while True:

try:

event = pickle.load(openfile)

## do something with this event (a dictionary)

print event

except EOFError:

break

--------

When I met with Sam, he had already run the code and gotten it to work. He had to download the file, make sure it was in the same place as his directory, and just change a minor detail in the code (which I will show later), and then it worked.

I downloaded the data, and moved it from my Downloads folder into my Users/Rebecca folder, since that is where my computing directory is located. However, when I copied the code into Canopy, it gave me unusual errors that Sam did not have. The first one was saying that I needed parentheses after the print function. Looking this up, we found that this was a requirement if using Python 3. We realized that we were using two different Python types--somehow I was using Python 3.5, and Sam was on 2.7. It seemed to us that I was still using Python 2, though, since that was what I had selected on Canopy, as shown below.

However, it seems that this selection does not mean exactly what we had thought, and we are still not sure exactly what it means. So, we looked up how to change which version of Python to use in Canopy. Following the instructions there, it worked. In the main "Welcome to Canopy" window, I went to Edit, and then Preferences. Under the "Python" tab, I selected Create New Environment. From here, I named a new environment and selected the EDM Bundle file of Python 2.7 for it. Once I had done this, it took about half an hour to download. Then I restarted Canopy and chose the new environment from the drop-down list of EDM Environments. Once I had done all this, the code got the normal error that Sam said he had gotten. It was the error "could not convert string to float". Sam had previously looked this up, and found that we just need to change the rb in the code to solely r. Once I had done this, the code ran successfully.

--------

import cPickle as pickle with (open("coinc.pkl", "r")) as openfile: while True: try: event = pickle.load(openfile) ## do something with this event (a dictionary) print event except EOFError: break

--------

The output of this code is a dictionary. It says {'Hits': all the data points, with an array of three numbers each--the string number, DOM number on the string, and time, 'Coincident': True or False}. There are 5 or 6 of these in the output.

March 29th, F:

Today we met with Dr. Blaufuss. In the meeting, we told him that we were able to read the small amount of data he sent us. He showed us that he had a lot more data for us--thousands of actual good events and around 400 coincident events, since these are more rare. In their Icecube computer environment, Dr. Blaufuss has a simulator that shows us the detection of particles in Icecube. Blue colors are later in time, while red colors are earlier. The size of the ball corresponds to the charge and energy of the event. Dr. Blaufuss will email us pictures of clear events and coincident events that we looked at during the meeting. In addition, he will add to dropbox the larger sample of data, since it is too large to send via email, and we will begin applying different types of machine learning to this.

Also within the meeting, we discussed any questions we had about the data and other subjects. As stated earlier, there is a number in the data that corresponds to time. However, this number is usually around 10,000, so Sam and I had no idea what it meant. Dr. Blaufuss described how, because of the speed of neutrinos, scientists decided that looking at a window of 20,000 nanoseconds was a good amount. When the detector detects a significant event, the 0 value of this window is placed so that the event happens in the middle of the window. This way, they can look at what happened before and after the event. That is why a lot of the data was around 10,000.

Within some of the reconstructed event pictures, there is a line in the detector cube where no doms are lit. Dr. Blaufuss explained that this was because of dust within the ice, from a volcanic eruption. The dust is more absorbing of light, so it usually causes a gap in the data. However, sometimes the particles go close enough to a string that this light is still detected, since it is more intense right at the path of the particle.

We made plans for the upcoming week to start to use machine learning with the data that Dr. Blaufuss will send us. First we will do this without the time data, and later once we are more skilled with what we are doing, we will try to add in the time data to see if that helps. Dr. Blaufuss also will email us the geometry of the Icecube strings and doms in case we think that will help the computer too, but he thinks that the computer can figure that out by itself.

March 30th, Sa:

These are the emailed pictures.

These two are notable, single events:

While these two are coincident events:

April 2nd, Tu:

Today Sam and I met together to begin working with the data that Dr. Blaufuss had sent us. Our plan is to base our machine learning off of the census data steps that we previously completed. Here is the code we have so far:

-------

# import cpickle for reading and writing data import cPickle as pickle # # import libraries for handling data import pandas as pd import seaborn as sns coinc_test = open("coinc_test.pkl", "r") coinc_test1 = pickle.load(coinc_test) print coinc_test1['Coincident'] print coinc_test1['Hits'][0][0]

-------

In order to use machine learning, we will have to convert the dictionaries, that our data came as, into pandas data structures. We thought about how to do this, and ended up finding a website that seemed straightforward and useful: https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.from_dict.html .

April 5th, F:

Today we met with Dr. Blaufuss, since it is Friday. It was a very short meeting this week, since we had not really done much that we could actually show him. We had mainly just acquired the data, learned how to read it, and planned our next steps. Dr. Blaufuss did add a little to our code, though. At first, we were only actually using one event, when we had wanted to use all of them. So Dr. Blaufuss added the line in the code that I will show later, in which .append was used.

April 8th, M:

Sam and I met again, to proceed more with the machine learning steps and discuss what we had done over the weekend. Sam shared a code he had come up with, that changed the data into a data array. He was able to accomplish this with code for the first event. Our plan with this code was to make a data array, full of 1s and 0s. The rows would be each event. The columns would correspond to each DOM. Since there are 84 strings and 60 DOMs on each string, we numbered the DOMs 1-5160, with 1-60 on string number 1, and so on. To make it simple, we are neglecting the time value for right now. A 1 in the array will mean that that certain DOM was hit. A 0 will mean it was not. Hopefully, once we tell the machine whether the events were significant or not, it can use this data to train itself.

We were able to successfully make the code have an array of 1s and 0s. Then, we wanted a column at the end to show whether each event was significant or not. So we put a boolean at the end that says that. We already know what it is, because the sets of data given to us were each labeled. The machine then should be able to use these labels accordingly, and print out its own once we test with the testing events.

Here is the end result. The actual array is very large, this is just a preview of the end of it:

This array of events has 394 events in total. There are 5161 columns because of the 5160 total DOMs, plus the boolean column at the end.

Here is the full code that we came up with:

With this code, we first made all the values in the array zero. Then we went through the data and put a one everywhere there was a hit. The data that was given to us was just the hit data, not the non-hits.

April 10th, W:

Today we transferred this code to work with all four data sets that we have. The above code is specific to the Coincident Test sample, the smallest one. So, using find and replace, we just changed all the names of files within the code to correspond to the Coincident Train, Single Test, and Single Train data (when we used find and replace coincident to single, we had to cancel the part that changed 'Coincident' to 'Singlecident'). This was successful.

However, when we tried to run the code for the Single Train data set, Sam's computer froze for two hours. It was too much for our personal laptops to handle. We are thinking of making the code better and more efficient, or using a GPU or computer that we could hopefully gain access to through Icecube and Dr. Blaufuss. For now, we just shrunk the size of the data. We made the Single Train data set consist of only 2000 events, and the Single Test data set consist of only 1000 events. From this, we were able to run the code. We then pickled the data, so that we can access it later by unpickling it. So now all four data sets are in pickle files, in an array.

April 15th, M:

We met with Dr. Blaufuss today and showed him our code. We also told him how it was too big, and how we had shrunk it. He enlightened us that we are making the computer hold all the events in its memory at once when it runs the code. Instead, we should make it go through each event and then move on to the next one, so that only one is in its memory at one time. That way, we will have enough memory to use all of the events in the data set. This is what Dr. Blaufuss did within his code, which we have in our large code in the part where it opens the Icecube dataset file (labeled by a commented line). It adds each of the events on once it is done, and then takes them out of its immediate memory.

April 16th, Tu:

We started applying machine learning to our files. Here is the code so far:

------------

# This file is for applying machine learning to IceCube Neutrino data

# import libraries for handling data

import pandas as pd

import numpy

# Obtain the DataFrames

coinc_test_events_frame = pd.read_pickle("coinc_test_events_frame.pkl")

coinc_train_events_frame = pd.read_pickle("coinc_train_events_frame.pkl")

single_test_events_frame = pd.read_pickle("single_test_events_frame.pkl")

single_train_events_frame = pd.read_pickle("single_train_events_frame.pkl")

#print coinc_test_events_frame

#print coinc_train_events_frame

#print single_test_events_frame

#print single_train_events_frame

#######################################

#

# Model Build

#

# Import DummyClassifier from scikit-learn

from sklearn.dummy import DummyClassifier

# Create a Dummy Classifier

clf = DummyClassifier()

# untouched versions

untouched_coinc_train_events_frame = coinc_train_events_frame

untouched_single_train_events_frame = single_train_events_frame

untouched_coinc_test_events_frame = coinc_test_events_frame

untouched_single_test_events_frame = single_test_events_frame

# single and coinc targets

coinc_target = coinc_train_events_frame.pop(5160)

single_target = single_train_events_frame.pop(5160)

# entire target and data

train_events_frame = single_train_events_frame.append(coinc_train_events_frame)

target = single_target.append(coinc_target)

# train the clf classifier using .fit(x, y)

clf.fit(train_events_frame, target)

# entire test frame

test_events_frame = single_test_events_frame.append(coinc_test_events_frame)

y_true = test_events_frame.pop(5160)

y_pred = clf.predict(test_events_frame)

#print y_pred

# Classification Report

from sklearn.metrics import classification_report

# execute classification report

print "Classification Report"

print classification_report(y_true, y_pred)

################################

# Import MLPClassifier from scikit-learn

from sklearn.neural_network import MLPClassifier

# Create a MLP classifier

#clf = MLPClassifier(activation="logistic", solver ="adam", learning_rate = "adaptive")

mlpclf = MLPClassifier()

# train the clf classifier using .fit(x, y) as described above

mlpclf.fit(train_events_frame, target)

# Now use the classifier to predict the label for the last entry we left out

mlp_y_pred = mlpclf.predict(test_events_frame)

#print y_pred

#

# Classification Reports

#

# import library for assessing machine learning models

from sklearn.metrics import classification_report

# execute classification report

print "MLP Classification Report"

print classification_report(y_true, mlp_y_pred)

------------

The code has three parts. The first part just gains access to the pickled files--the data arrays--that we made in the previous code. Then, after the line of #s, we are using the Dummy Classifier. First we made untouched versions of our dataframes, in case we need those. Then we took out the boolean from end of the data frames, and made this its own variable, the target. We put the single and coincident parts together by using ".append". We predicted the boolean for the test data set, using the training from the train data set, and compared our results to the actual boolean of the test set. In the next part of the code, after the next line of #s, we did the same thing but with an actual classifier, the MLP classifier. We got this code from the digits data set tutorial on scikit-learn, and changed various names and parts so that it could be used with our data.

This code was able to give us this output, when we ran it twice on Sam's computer:

The first report in each run is from the dummy classifier, which is basically just what would happen if it randomly selected ones. The second reports are from an actual type of classifier; right now we are using MLP. We can see that it worked somewhat better than the dummy classifier, since the percentages are higher. However, it is still not very accurate, especially with the Coincident: True samples. We would like more Coincident data, but Icecube does not have any. We plan to try to get the code to be able to use more of the Single data, and hopefully this will help make them more accurate.

April 24th, W:

This week, we made our code process each event one after the other, so that all of the events are not stored in memory at once. This way, we can use more data. We based this new version of the code on Dr. Blaufuss' code that he gave us to read the events. This code uses a While True loop instead of a For loop, so we did the same in changing our code. However, we still have not been able to use the full amount of data, and the computers are being very slow.

This is our old code that we had, that stored all the events in memory at once, and therefore crashed Sam's computer for two hours:

We put all the outer For loops into this one While True loop, and the inner For loop is still a For loop within this. Here is the new code:

With this code, we were able to use larger sample sizes. However, using this new code within the machine learning part of the full code, the computer still got stuck. We have been running different iterations of that code, and it still gets stuck. We also tried different numbers of samples. It seems to work at 20000 samples at least, which is much greater than the original 1000 we had. We hope to figure out what is wrong soon and get larger samples, since this does not make the machine learning very successful.

April 30th, Tu:

In the past week, we have tried various different machine learning tactics, to see which one is the best. In addition to the MLP Classifier, we used a decision tree, which seemed to produce better results. In addition, we added the time values, to see if they would help. We had thought this would require a non numeric value, since 0 does not work if you are using actual number values. However, scikit-learn does not take NaN, so we tried to use 0. This did not produce good results. We tried to use -1, and this did help. For all of these changes, we just switched out simple lines of code. To use the decision tree, we just used that code instead of the MLP code. To input time, we changed the 1 that we had put into the dataframe into the third value from the data (the time value), and we changed the 0, unhit, to -1. We ended up with four results, each including the Dummy, MLP, and Decision Tree classifiers.

First we have the original code, with 0s and 1s in the dataframe, with 1000 single events to train it:

Then we put in more data--training it with 20000 single events:

Then we added time values, and computed it with both less and more data again.

Less data:

More data:

We are not sure why these results are the way they are. However, we can look at them and try to analyze it. The best scenario for predicting Coincident False events was using more data, without time values. However, the best scenario for predicting Coincident True events was using less data, with time values.

May 8th, W:

We are done with our research for this semester. There are many ways to continue this; here are three that we would have tried, had we had more time:

1. We could modify the classifiers, instead of using those just built into Scikit-learn. For instance, we can change and set the depth of the decision trees, which will give us different results. For neural networks, we can change the activation, which is a function used to transform the values coming from each neuron. Some options for this are identity, logistic, tanh, and relu. We do not know exactly how to do this yet or which one will be better.

2. We could try more classifiers, since so far we have only had time to try MLPs and Decision Trees.

3. We could try regression instead of classification. Regressions gives a value instead of a category. With this, we could make the computer predict the probability, as a number, of the events being Coincident True.

-----------------------------------------------------------------------------------------------------------------------------

-----------------------------------------------HONR268N--------------------------------------------------------------

-----------------------------------------------------------------------------------------------------------------------------

Useful commands and shortcuts (HW 2)

-cmsenv puts you into the cms environment

-you can do shift>ctrl>v to paste into the terminal

-if youre writing a command but forgot something earlier, do ctrl>u type the thing you need to do and do it and then ctrl>v brings back what you were in the middle of writing

-you can open a terminal in a specific place by right clicking on that directory

-ctrl c force stops a command!

-man intro shows you some useful commands and introduction to it

-cd changes the directory

-ls lists what's in the directory; ls -l shows a longer list; ls -h makes it more human readable

-mkdir makes a directory

-pwd shows what current directory you are in (print working directory)

-history lists all the previous commands you have done

-cal with the month and year as numbers afterwards presents that month

-cat filename displays what is in a file

-The > operator writes the word into the text file given, replacing all other text in the file. The >> operator adds the word into a new line of the text file, and leaves what is already there.

-mv filename filenamenew replaces the name of a file!

-when you click somewhere and start typing, it puts things BEFORE the cursor

-CTRL S SEARCHES IN EMACS

9/25/18

I got a VirtualBox error whenever I tried to open it, saying that the interface "virtualbox host-only ethernet adapter" is not a host-only adapter interface (e_fail 0x8004005). I think this may have been caused when I updated my computer or when my computer shut down with everything still running. To fix this, I had to go to settings>network, and there adapter 2 was checked but there was no specific adapter that it was attached to. So in the "attached to" dropdown I clicked "not attached" instead and that fixed the problem.

HW 3

-in scripts, you need to put #!/where they are located/the type of script for them to work

-putting a command in singular quotes overrides any variables you try to put within it, so if you want variables in a command you have to use double quotes -- aka, single quotes just take the literal sense of exactly what you say, not interpreting anything

-set ARG=$1 means that we set a variable called ARG to be that of the first word we type when executing the command ($2 would be the second word)

HW 4

all of these are within src:

also, this is a good site to reference to learn about types of loops and other things: http://www.cplusplus.com/doc/tutorial/control/

OGMAIN.CPP

#include <iostream> ---> this tells your compiler the types of commands you are giving it, that they are stored together in a group of codes called the iostream

using namespace std;

int main() { --> tells where the first line of code is-- it's in main

cout <<"Hello World!" << endl; //Print hello world to screen followed by end line (endl)

--> the cout command tells it to display the output. Hello World is what we tell it to display. The endl is put there so that it executes this command before it moves on to the next line. The semicolon is used to separate between commands

return 0; //Exit the program --> tells the compiler that this is the end of the command

}

output:

Hello World!

COMMANDS

g++ main.cpp --> tells your computer what kind of language it compiles this with. It translates the ASCII code into assembler language

./a.out --> executes it

TEST.CPP

#include <iostream> --> tells your compiler the types of commands you are giving it, that they are stored together in a group of codes called the iostrea

using namespace std;

int main() {

cout << "hello world" << end; --> displays hello world

int i=2; --> makes the variable i equal to 2; telling it how many bits to use with the word int

cout << "i = " <<i<<endl; --> displays "i=" and then the next i in this code tells it to display the actual value of i, not the specific letter since that is not in qoutes; then ends the line

double a=3.3; --> makes the variable a equal to 3.3; since it's double it has 64 bits

cout << "a = " <<a<<endl; --> displays a=WhatItEquals

int j = a*i; -->makes the variable j equal to a times i

cout << "a*i = "<<j<<endl; --> since j is just an int, it outputs 6 instead of 6.6. If we were to change it to double, then it would output 6.6.

return 0; --> tells that it's the end of the command

}

output:

hello world

i = 2

a = 3.3

a*i = 6

NUMBERS.CPP

#include <iostream>

using namespace std;

int main() {

int n=10; --> the initial n is ten

cout << "n is "<<n<<endl;

est

n--; --> makes the n variable go down an integer (it becomes 9)

cout<<"n is now "<<n<<endl;

n++; --> makes the n variable go up an integer (becomes 10 again)

cout<<n is now "<<n<<endl;

return 0;

}

output:

n is 10

n is now 9

n is now 10

TRUEFALSE.CPP

#include <iostream>

using namespace std;

int main() {

bool prop;

prop = (5>1); --> states that 5 is greater than 1

cout<<"prop is "<<prop<<endl; --> if the prop is true (in this case yes), then it will display prop is 1 (because it displays "prop is", like we told it to in quotes, and then the prop not in quotes tells it to display either the true or false statement in forms of a 1 or 0

prop = (1>5); --> states that 5 is less than 1

cout<<"prop is "<<prop<<endl; --> since the prop is false, it will display prop is 0

prop = (1 != 5); --> states that 1 is not equal to 5

cout << "prop is " <<prop<<endl; --> since the prop is true, it will display prop is 1

return 0;

}

output:

prop is 1

prop is 0

prop is 1

LOOP.CPP

#include <iostream>

using namespace std;

int main() {

int n=10; --> the initial n is 10

while(n>0) { --> makes it so that the loop stops at 1, and so the command is done until n gets down to above 0, when the condition of the loop is no longer met

cout<<"n is "<<n<<endl; --> displays what n is

n--; --> subtracts one from n each time

} --> ends the looping command

return 0;

}

output:

n is 10

n is 9

n is 8

n is 7

n is 6

n is 5

n is 4

n is 3

n is 2

n is 1

FORLOOP.CPP

#include <iostream>

using namespace std;

int main() {

// when we declare a for loop, we also initialize the loop variable,

// specify the exit condition, and tell the program how to modify the

// loop variable at the end of each loop

for (int n=10; n>0; n--) { --> a FOR loop--does the same thing as loop.cpp did, just all together in one line

cout<<"n is "<<n<<endl;

} --> here the loop ends

// in a for loop, the loop variable (in this case, 'n') only exists in

// the loop. we are not able to call 'n' from out here

// uncomment the following line and see for yourself

// cout<<"n outside the loop: "<<n; when I uncommented this, g++ did not accept my code because n was not defined outside of the loop

return 0;

}

output:

same as above (when the line is commented)

PRACTICE1.CPP

#include <iostream>

using namespace std;

int main() {

int n=0, m=0; -->starts both the n and m variables at 0

while(n<10) { --> while n is less than 10 it does all the things enclosed in the brackets

// this is the slow (or outer) loop

cout << "n is " << n << ": "; --> displays what number n is, followed by a colon

m=0;

while(m<=n) { --> while n is less than or equal to n it does the following command

// this is the fast (or inner) loop

// in this loop, the slow loop variable (n) is a constant

// this loop must run to completion before the slow loop

// can progress (during every iteration of the slow loop!)

cout << m;

m++; --> increases m by 1 until it stops meeting the m<=n condition

}

// now the fast loop has finished and the slow loop can

// continue with the current iteration

cout << endl; --> makes it go to the next line in the output

n++; --> after it increased m by one a bunch, it goes back to the outer loop, increases n by 1, and does it all again until the outer loops condition stops being met

}

return 0 ;

}

gives the output:

n is 0: 0

n is 1: 01

n is 2: 012

n is 3: 0123

n is 4: 01234

n is 5: 012345

n is 6: 0123456

n is 7: 01234567

n is 8: 012345678

n is 9: 0123456789

MY NEW VERSION, WITH A FOR LOOP (PRACTICE2.CPP)

#include <iostream>

using namespace std;

int main() {

for (int n=0, m=0; n<10; n++) { --> the initialization is executed, then the condition is checked. if it is met, it continues to do the command a line below

cout << "n is " <<n << ": "; --> tells it to display what n is followed by a colon (displays directly what is in quotes, and for the n not in quotes it displays the value of n

for (int m=0; m<=n; m++) { -->another command, still before it "goes back up" to the previous for command

cout << m; --> displays the m as long as the m<=n condition is met, then goes back up and adds 1 to m, then goes way back up and finishes the first for command by adding 1 to n. Then it loops and does all this again

}

cout << endl;

}

return 0 ;

}

gives same output!

HW 5

notes: &p is the address of p, *p makes it find the data within that address

LOGICSTATEMENT.CPP

#include <iostream>

using namespace std;

int main() {

int n = 10; --> starts n at 10

while (n>=10) { --> while n is greater than or equal to 10:

if(n>5) { --> if n is greater than five, do the following command

cout<<"n is "<<n<<endl; -->display n is what it is (says the content in quotes directly, then after that it says n, but since it's not in quotes it substitutes in the actual value of n

}

else { --> otherwise (in this case though, never, since all values greater than or equal to 10 are greater than 5)

cout<<"n = "<<n<<endl; --> display what n is (just like above)

n--; --> subtract one from n

}

return 0;

}

output:

n is 10

POINTERS.CPP

#include <iostream>

using namespace std;

int main() {

int i = 10;

cout << "The memory address of i is " << &i << "\n"; --> \n ends the line (but make sure the slash is going in this direction!!!)

cout << "The data stored at memory address " << &i << " is " << i << "\n";

int* p = &i;

cout << "The value of p is " << p << "\n";

cout << "We say that p 'points at' the memory location referenced by address " << p << "\n";

cout << "The data stored at memory address " << p << " is " << *p << "\n";

return 0;

}

PROGRAM1.CPP

#include <iostream>

using namespace std;

int main(){

int i = 10; --> declares i to be 10

int j = i; --> declares j to be the value of i at that moment---if i changes, j will still just be 10 because that is what i was when we said int j = i

cout << "i= " << i << " and j= " << j << "\n";

i=5;

cout << "i= " << i << " and j= " << j << "\n";

j=1;

cout << "i= " << i << " and j= " << j << "\n";

return 0;

}

PROGRAM2.CPP

#include <iostream>

using namespace std;

int main(){

int i = 10;

int* p = &i; --> points to the data inside the "mailbox" of i -- the * after int means that the p is a pointer not just a variable; also, if we wanted to declare a pointer to a double, we just say double* p =

cout << "i= " << i << " and *p= " << *p << "\n"; --> *p is the data that p points to

i=5; --> when i changes, so does *p, because both are referencing the same data

cout << "i= " << i << " and *p= " << *p << "\n";

*p=1; --> similarly, when *p is changed, so is i

cout << "i= " << i << " and *p= " << *p << "\n";

return 0;

}

output:

i= 10 and *p= 10

i= 5 and *p= 5

i= 1 and *p= 1

NEWPOINTER.CPP

#include <iostream>

using namespace std;

int main(){

int* p = new int(5); --> puts the data of 5 into some random available memory location, and then the p variable points to this location

cout << "p points at address " << p << "\n";

cout << "The data stored in address " << p << " is " << *p << "\n";

*p = 10; --> when we change the value of *p, since p points at the memory location with the 5 in it, the data inside that memory location is what its changed

cout << "Now the data stored in address " << p << " is " << *p << "\n";

return 0;

}

output:

p points at address 0x1bd7010

The data stored in address 0x1bd7010 is 5

Now the data stored in address 0x1bd7010 is 10

HW5CODE.CPP

#include <iostream>

using namespace std;

int main() {

int n = 5;

cout << "we start off with n equal to 5." << endl;

while (n<=45) {

if (n<21) {

n=n*2; --> at first, I tried just saying n*2, but it does not actually multiply it by 2 if you do that. This is what you need to do; you can also do "n*=2" as a shortcut way to do it

cout << "here, when n (before being multiplied) is less than 21, we multiply it by two: " <<endl<< "n is " <<n<<endl;

}

else {

n++;

cout << "otherwise, aka once n rises above 21, we just add 1 to n each time: " <<endl<< "n is " <<n<<endl;

}

cout << "the original while statement was while n is less than or equal to 45, so once n goes above 45 the commands stop being executed" <<endl;

return 0;

}

output:

we start off with n equal to 5.

here, when n (before being multiplied) is less than 21, we multiply it by two:

n is 10

here, when n (before being multiplied) is less than 21, we multiply it by two:

n is 20

here, when n (before being multiplied) is less than 21, we multiply it by two:

n is 40