Darpan #14

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

dsanghavi wants to merge 7 commits into master from darpan

Contributor

dsanghavi commented Mar 7, 2018

Update contains:

ngram in evaluation/init.py
SpokenMNIST in datasets/init.py and eth_spokenMNIST.py

dsanghavi added 3 commits

March 5, 2018 17:28


          Added ngram evaluation.

dda2882


          Added SpokenMNIST dataset, preprocessing, encoding using Poisson and …

4abc8f1

…Bernoulli windows.


          Cleaned evaluation init to merge easily with Dan's branch

6094a7b

dsanghavi requested a review from djsaunde

March 7, 2018 04:45

djsaunde reviewed

View reviewed changes

bindsnet/datasets/__init__.py

    
              import pickle as p

              import numpy  as np

              import scipy.io.wavfile

              from scipy.fftpack import dct

Collaborator

djsaunde Mar 7, 2018

Move this down to the from imports and correct the spacing.

djsaunde reviewed

View reviewed changes

bindsnet/datasets/__init__.py

    
              	Data is divided by an 80-20 split into train and test

              	'''

              	def __init__(self, path=None):

              		self.data_dir = '/home/darpan/sem4/free-spoken-digit-dataset/recordings/'

Collaborator

djsaunde Mar 7, 2018

This is a hard-coded path, and will fail on general machines. Take a look at the MNIST class for a default way of doing this.

djsaunde reviewed

View reviewed changes

bindsnet/datasets/__init__.py

    
              class SpokenMNIST:

              	'''

              	Data is divided by an 80-20 split into train and test

              	'''

Collaborator

djsaunde Mar 7, 2018

More descriptive doc-string + punctuaction.

djsaunde reviewed

View reviewed changes

bindsnet/datasets/__init__.py

    
              			labels.append(label)

              		return audios, torch.Tensor(labels)

              	def pre_process(self, file):

Collaborator

djsaunde Mar 7, 2018

Maybe preprocess instead of pre_process?

djsaunde reviewed

View reviewed changes

bindsnet/datasets/__init__.py

    
              		filter_banks = 20 * np.log10(filter_banks)  # dB

              		return filter_banks, label

Collaborator

djsaunde Mar 7, 2018

In general, this method seems really messy / hard-coded. Perhaps some of the parameters should be arguments to the function (e.g., NFFT, frame_size, frame_stride, etc.).

djsaunde reviewed

View reviewed changes

bindsnet/encoding/__init__.py

    
              	spike train from that timestep onwards

              	Inputs must be non-negative. Spike inter-arrival times are inversely proportional to

              	input magnitude, so data must be scaled according to desired spike frequency.

Collaborator

djsaunde Mar 7, 2018

But, it's not a "mixture" per se, is it? I feel like the function header is misleading.

djsaunde reviewed

View reviewed changes

bindsnet/encoding/__init__.py

    
              	# Yield Bernoulli-distributed spike trains.

              	return s

Collaborator

djsaunde Mar 7, 2018

I think we can simply add functionality to get_bernoulli in which the time dimension is implicit; e.g., we could pass time=None, and based on this, discard the explicit time dimension.

djsaunde reviewed

View reviewed changes

bindsnet/evaluation/__init__.py

    
              Most important functions to use:

              confidence_weighting()

              ngram()

Collaborator

djsaunde Mar 7, 2018

Dunno what this string is for. Could you remove it?

djsaunde reviewed

View reviewed changes

bindsnet/evaluation/__init__.py

    
              '''

              def get_fire_order(example):

Collaborator

djsaunde Mar 7, 2018

Unclear what this function is doing without a doc string. I suggest get_firing_order.

djsaunde reviewed

View reviewed changes

bindsnet/evaluation/__init__.py

    
                                  fire_order.append(n_id)

                  return fire_order

              def normalize_probability(v):

Collaborator

djsaunde Mar 7, 2018

Is it necessary to put this in a separate function? In general, you should avoid proliferation of functions inside of library modules, since they can be imported by end-users.

djsaunde reviewed

View reviewed changes

bindsnet/evaluation/__init__.py

Collaborator

djsaunde Mar 7, 2018

Re-write doc string to conform to style elsewhere in package.

djsaunde reviewed

View reviewed changes

bindsnet/evaluation/__init__.py

    
                  if not tuple(seq) in ngrams:

                      ngrams[tuple(seq)] = np.zeros(10)

                  ngrams[tuple(seq)][true_label] += 1

Collaborator

djsaunde Mar 7, 2018

Consider folding both of these functions into the ngram function.

djsaunde reviewed

View reviewed changes

examples/eth.py

    
              from nodes             import AdaptiveLIFNodes, LIFNodes, Input

              from analysis.plotting import plot_input, plot_spikes, plot_weights

              from evaluation 	   import *

Collaborator

djsaunde Mar 7, 2018

Fix spacing.

djsaunde reviewed

View reviewed changes

examples/eth.py

    
              images, labels = MNIST(path='../data').get_train()

              images /= (255 * min_isi)  # Normalize and enforce minimum expected inter-spike interval.

              images = images.view(images.size(0), -1)  # Flatten images to one dimension.

              labels = [int(lbl) for lbl in labels]

Collaborator

djsaunde Mar 7, 2018

Why do this? Having the labels lazily returned from a generator is a good thing. Same with images, which should be returned as a generator as in the get_poisson and get_bernoulli functions.

djsaunde reviewed

View reviewed changes

examples/eth.py

    
              print('Begin training.\n')

              start = default_timer()

              train_spikes = []

Collaborator

djsaunde Mar 7, 2018 •

edited

Loading

Consider just appending spike_record to disk every update_interval. This way, we won't have to store n_samples * time * n_neurons spikes.

djsaunde requested changes

View reviewed changes

Collaborator

djsaunde left a comment

See inline code comments.

dsanghavi added 4 commits

March 14, 2018 16:47


          Intermediate commit for tfs.

133defd


          Intermediate commit to switch branch

d6f537f


          Integrated ngram with confidence weighting and all activity evaluations

b5d8eb2


          Partial implementation of eth_spokenMNIST for integrated training and…

a201a46

… testing.

Hananel-Hazan closed this

coopersigrist added a commit that referenced this pull request


          Merge pull request #14 from Hananel-Hazan/master

dcd6458

updating fork

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet