Levi-Heath's picture
Update README.md
a32eb7c verified
|
raw
history blame
1.87 kB
metadata
license: cc-by-nc-nd-4.0
language:
  - en
pipeline_tag: image-classification
tags:
  - ImageNet-1k
  - image classification
datasets:
  - ILSVRC/imagenet-1k
metrics:
  - accuracy
authors:
  - Bo Deng, University of Nebraska-Lincoln
  - Levi Heath, University of Colorado Colorado Springs

Towards Errorless Training ImageNet-1k

This repository host MATLAB code and models for the manuscript, Towards Errorless Training ImageNet-1k, which is available at https://arxiv.org/abs/2508.04941. We give 6 models trained on the ImageNet-1k dataset, which we list in the table below. Each model is featured model of archtecture 17x40x2. That is, each model is made up of 17x40x2=1360 FNNs, all with homogeneous architecture (900-256-25 or 900-256-77-25), working in parallel to produce 1360 predictions which determine a final prediction using the majority voting protocol.

We trained the 6 models using the following transformation of the 64x64 downsampled ImageNet-1k dataset:

  • downsampled images to 32x32, using the mean values of non-overlapping 2x2 grid cells and
  • trimmed off top row, bottom row, left-most column, and right-most column.

This transformed data results in 30x30 images, hence 900-dimensional input vectors.

For a thorough description of our models trained on the ImageNet-1k dataset, please read our preprint linked above.

Model Training Method FNN Architecture Accuracy (%)
Model_S_h1_m1 SGD 900-256-25 98.247
Model_S_h1_m2 SGD 900-256-25 98.299
Model_S_h2_m1 SGD 900-256-77-25 96.990
Model_T_h1_m1 SGD followed by GDT 900-256-25 98.289
Model_T_h1_m2 SGD followed by GDT 900-256-25 98.300
Model_T_h2_m1 SGD followed by GDT 900-256-77-25 97.770
*SGD = stochastic gradient descent
**GDT = gradient descent tunneling