The hpt from berkeley-data

RandomGrayscale pipeline

dict(type='RandomGrayscale', p=0.2),

https://github.com/Berkeley-Data/OpenSelfSup/blob/taeil/configs/selfsup/moco/r50_v2_sen12ms_in_basetrain_20ep.py

benchmark using linear evaluation on resisc

The initial tricky part with using an RGB dataset (like RESISC) for evaluation is that the 3 channel input is not expecting RGB, but rather, the output of our input_modules (I think the right way to do this is to perform a bit of moco training on the target dataset where we freeze all params except an appropriate input_module -- but this will take a little engineering to set up).
So for now, it'll be easier if we use Sen1 and Sen2 for evaluation, and use their corresponding input_module that trained during moco pretraining.

Remove the 50% overlap of images in SEN12MS

Have a look at https://arxiv.org/pdf/2004.13390.pdf (section 4.1) which is based on sen12ms dataset.
"The optical and radar images were resampled to 10 m ground sampling distance and span 256×256 px in height and width. The original dataset uses tile-overlaps of 50%. For this work, we removed the overlap to ensure independence of support and query datasets, which yielded 200 306 128×128 px tiles."

They removed the overlap.

read paper

Geography-Aware Self-supervised Learning Paper
Functional Map of the World (FMoW)
@TsungChinHanKen
@SuryaTechie
@taeil
@oropezaev

add two other repos as submodules

https://git-scm.com/book/en/v2/Git-Tools-Submodules

Duplicate - Colorado system: Complete the verify install section and Setup WandB

This is created based on the following in Colorado's email:

Getting started goals:

log onto llano
with the HPT environment, complete the verify install section(https://github.com/cjrd/hpt#verify-install) of the README. You can skip the setup part of the README, I've already done that on llano for the crguest user.
setup wandb (current system uses tensorboard, so it should be easy enough to use wandb...)

Just a thought: Since we are tracking the first part as a separate task, maybe we can perform the remaining two tasks together via zoom as it's a shared guest user account.

Train with augmentation

include the augmentation done by @surya.

RandomAppliedTrans pipeline

dict(
# type='RandomAppliedTrans',
# transforms=[
# dict(
# type='ColorJitter',
# brightness=0.4,
# contrast=0.4,
# saturation=0.4,
# hue=0.4)
# ],
# p=0.8),

https://github.com/Berkeley-Data/OpenSelfSup/blob/taeil/configs/selfsup/moco/r50_v2_sen12ms_in_basetrain_20ep.py

baseline training 1x1 conv layer and weight

moco 1x1
resnet50 with conv1x1
resnet50
moco (without 1x1)
moco (with 1x1 but with random weight)

https://github.com/Berkeley-Data/hpt/blob/taeil/references/model_architectures.md

prepare SEN12MS dataset on IIlano

download and extract

pretraining with s1/s2 fusion data

Hey folks,

Let's take a look at a more straightforward contrastive learning baseline, using the following set of pretraining techniques, ie each of the following steps is a separate pretraining technique, ordered by complexity:

(pretraining) Have conv1 take 12 bands as input and use all 12 bands to do moco pretraining (this merges s1 and s2 data to be treated as a single "image"). Use whatever subset of the moco-v2 augmentations we have available across the 12 bands.
Do the same as (1.) except include cases where either s1 or s2 are not included for both the query and key view (e.g. a query might have s1+s2,s1, or s2 as input and the key would have the same input with a different set of augmentations). We can 0-pad the removed s1 or s2.
Do the same as (2.) except include cases where either s1 or s2 are not included for either the query and key view (e.g. a query might have s1+s2,s1, or s2 as input and the key might have s1+s2,s1, or s2 as input).

The idea here is that in (1.) we're doing instance discrimination where an input image is really a composition of 2 images. In (2.) we're doing instance discrimination where an input image can also be from only s1 or s2. In (3.), an input image can be from either (1.) or (2.).

Access to Colorado's system (SSH and github)

@SuryaTechie
@TsungChinHanKen
@oropezaev
@taeil

collect example images during the validation

some brief tutorial in case...

draft the section similar to 4.1.1 from geo-aware paper

Geography aware paper uses temporal images and they have some notations (just like many papers) specific to time as shown in the following figure:

Since our project uses spacially aligned (same location) images instead of temporal ones, it would be great if we have some notations which add value to the paper.

draft diagram for our solution

According to Colorado, most people will read the paper; they will glance at the first figure in the paper. We need to come up with some explainer diagrams and share with the group for feedback.

Based on the discussion in the team meeting (03/13 8:00 am), first we need to come up with a rough sketch of the current architecture. No need to focus on graphic creation.

collecting images from multiple workers

I did a search for the issues in the wandb git repo. There were no such issues last year. Only this year, a couple of similar issues got reported. So, did testing by installing older versions starting from the latest version 0.10.22 . The issue is reproducible till 0.10.0. Below this version, the issue is not reproducible and is working fine with multiple workers.
Please install wandb version 0.9.7 which was released on Sep 8, 2020.
We can activate the conda env and install using 'pip install wandb==0.9.7'

Wandb team is releasing the patches frequently. They released 13 versions since Sep 8, 2020. Hopefully, they will fix the issue soon.

Originally posted by @SuryaTechie in #2 (comment)

Minimum Pretraining Code with Sen12MS

with subset data
minimum transformation
some hard-coded variables
s1 and s2 as positive pair

convert moco pretraining to sen12ms evaluation pretraining model

,,,,

prepare subset SEN12MS and EDA for quicker training

Check this dataset out. 180k triplets, georeferenced, multi band, multi modal, multi resolution: "SEN12MS -- A Curated Dataset of Georeferenced Multi-Spectral Sentinel-1/2 Imagery for Deep Learning and Data Fusion" https://arxiv.org/abs/1906.07789

download instructions

combine extract and evaluation into single step

we should modify the main_train to convert and load in the same script to reduce the number of steps.

UndefinedMetricWarnings while running classification/main_train.py on SEN12MS

Getting the following warnings...Need to investigate and see if it's going to impact the results, if yes, we need to fix it.

/home/taeil/anaconda3/envs/hptest/lib/python3.7/site-packages/sklearn/metrics/_classification.py:1493: UndefinedMetricWarning: F-score is ill-defined and being set to 0.0 in labels with no true nor predicted samples. Use zero_division parameter to control this behavior.
average, "true nor predicted", 'F-score is', len(true_sum)
/home/taeil/anaconda3/envs/hptest/lib/python3.7/site-packages/sklearn/metrics/_classification.py:1493: UndefinedMetricWarning: F-score is ill-defined and being set to 0.0 in labels with no true nor predicted samples. Use zero_division parameter to control this behavior.
average, "true nor predicted", 'F-score is', len(true_sum)
/home/taeil/anaconda3/envs/hptest/lib/python3.7/site-packages/sklearn/metrics/_classification.py:1245: UndefinedMetricWarning: Precision is ill-defined and being set to 0.0 in labels with no predicted samples. Use zero_division parameter to control this behavior.
_warn_prf(average, modifier, msg_start, len(result))
/home/taeil/anaconda3/envs/hptest/lib/python3.7/site-packages/sklearn/metrics/_classification.py:1245: UndefinedMetricWarning: Recall is ill-defined and being set to 0.0 in labels with no true samples. Use zero_division parameter to control this behavior.
_warn_prf(average, modifier, msg_start, len(result))
Validation microPrec: 0.540000 microF1: 0.540000 sampleF1: 0.540000 microF2: 0.540000 sampleF2: 0.540000

compute-dataset-pixel-mean-std-sen12ms.py is not generating accurate mean/std

updated data loader and script to run without an error but the result does not look correct.

The following code needs to be updated.

    for sample in loader:
        x = sample["image"]
        data = torch.stack([x.mean([1,2]), (x*x).mean([1,2])])

        results += data.sum(2)
        Nproc += data.shape[0]
        i += 1
        print("batch: {}/{}".format(i, NB))
        if i >= NB:
            break

Multi source input + multi-dist training

We were using old version of torchvision.transform which only support PIL image.

https://pytorch.org/docs/1.6.0/torchvision/transforms.html#transforms-on-torch-tensor

either change to PIL image when loading the image or do our own transformation (which is probably bad idea).

Need to see if we can only upgrade torchvision only.

If try to transformToPIL, getting this error https://discuss.pytorch.org/t/error-while-using-transforms-topilimage-and-randomresizedcrop/12861

Colorado's change with custom data loader:
Berkeley-Data/OpenSelfSup#3

Load input module

how to transfer 10 channel weights

s2
s1/s2
s1

sen12ms evaluation deep dive and documentation

Sen12ms datasets evaluation (supervised models)

document (single label). Make it more clear
Document multi-label how the author did
Document Sen12ms repo, 2 tasks (classification, semantic segmentation)
Goal is to clearly explain to Colorado
Simple vs full (Going over the code for dataset.py, simplified vs full (10 vs 17 class).
Single vs multi
S1, s2, rgb, vs s1/s2
Different metrics
K-mean usage (page 5)
One class with no label (ice class)
what does "probability" in this https://github.com/Berkeley-Data/SEN12MS/blob/taeil/splits/IGBP_probability_labels.pkl

hyperparameter tuning for finetune

the best model so far by @suryatechie

great job!

classification/main_train.py --exp_name finetune --data_dir data/sen12ms --label_split_dir splits --IGBP_simple --label_type multi_label --threshold 0.1 --model Moco --lr 0.00001 --decay 1e-5 --batch_size 64 --num_workers 4 --data_size 4096 --epochs 100 --pt_name vivid-resonance-73 --pt_type qe --pt_dir data/pretrained/moco --eval --use_s2 --use_s1

Create initial augmentations (GaussianBlur, ElasticTransform, Blur, VerticalFlip, HorizontalFlip..)

Create some initial augmentations like GaussianBlur, ElasticTransform, Blur, VerticalFlip, HorizontalFlip, RandomBrightnessContrast.

SEN12MS: Util function to create positive pairs based on the input file

Write a simple util function to create positive pairs

The positive pair should be multiple images from different satellites at the same location.
The following is a positive pair:
ROIs1158_spring_lc_100_p101.tif
ROIs1158_spring_s1_100_p101.tif
ROIs1158_spring_s2_100_p101.tif

Note: The input can be a pkl file

setup jupyter notebook

need util function to match s1 and s2 dimension

Need code snippet and exploration around matching the dimensions.

s1 and s2 to both 3 dimension
s1 and s2 to some other dimension

Here is current code to load s1 image.

def load_s1(path, imgTransform):
    with rasterio.open(path) as data:
        s1 = data.read()
    s1 = s1.astype(np.float32)
    s1 = np.nan_to_num(s1)
    s1 = np.clip(s1, -25, 0)
    if not imgTransform:
        s1 /= 25
        s1 += 1
    s1 = s1.astype(np.float32)
    return s1

Please help modify

track experiment on wandb

each user should be able to use their own apikey
track loss or summary

Pixel Label Training

This issue was mentioned in our meeting with Colorado last Friday. Pixel level training refers to using Land Cover Images as labels for each pixel. There are 4 images for land cover (4x256,256). The first slice correspond to the Full Label classification. The other 3 assign other values to some of the classes that we know.

RandomResizedCrop pipeline

# dict(type='RandomResizedCrop', size=224, scale=(0.2, 1.)),

https://github.com/Berkeley-Data/OpenSelfSup/blob/taeil/configs/selfsup/moco/r50_v2_sen12ms_in_basetrain_20ep.py

SEN12MS EDA

Check this dataset out. 180k triplets, georeferenced, multi band, multi modal, multi resolution: "SEN12MS -- A Curated Dataset of Georeferenced Multi-Spectral Sentinel-1/2 Imagery for Deep Learning and Data Fusion" https://arxiv.org/abs/1906.07789

download instructions

no loss collection running pre-training with 4 gpu

using 1 or 2, pre-training collects loss. With 4, it failed to collect loss (or failed to calculate loss). Need to debug and see what is happening.

SEN12MS baseline

repo
wandb

pretrained models are downloaded and placed on s3 (s3://sen12ms/pretrained_sup) for faster download along with other pre-trained models.

use the pretrained models
run various prediction and explore different metrics for baselines

Once this is done, please convert this issue to the discussion for documentation purpose.

Setup website with content

no epoch progress for pre-training (sen12ms)

save only the best model to wandb.dir

currently it saves all the checkpoints which use up project storage on wandb too fast. need to only save the best model to the wandb folder.

draft writing for paper (assuming it will work)

assuming our method will work, we need to start writing a paper and share the draft / structure with Colorado and professors.

Here is current draft.

fine-tune / sen12 simple evaluation on sen12ms

Yeah - Sen12ms may not be the perfect evaluation because of the mislableling issue, but it's a good diagnostic to make sure our system is working properly -- and we can report the results either way.

it'll be easier if we use Sen1 and Sen2 for evaluation, and use their corresponding input_module that trained during moco pretraining.

A 200 epoch model is at:
/scratch/crguest/vivid-resonance-73_sen12ms_no_aug_200epoch.pth
where the wandb training is here: https://wandb.ai/cjrd/BDOpenSelfSup-tools/runs/3qjvxo2p?workspace=user-cjrd

(transfer learning)
run sen12ms training on s1 and s2 two separately using same pre-trainining model

fine tune using small subset of training data to measure.
prepare small subset of training data set
same validation set
scene validation with top1 and top5 (compare to baseline)

Additional Update on March 26th

resnet50+1x1 with transferred weight for input module

various baseline train sen12ms with pretrained models

pretrained models are downloaded and placed on s3 (s3://sen12ms/pretrained_sup) for faster download along with other pre-trained models.

use the pretrained models
run various prediction and explore different metrics for baselines

Once this is done, please convert this issue to the discussion for documentation purpose.

Multi source input + multi-dist training

Berkeley-Data/OpenSelfSup#3

t-SNE quantitative evaluation on pre-train MoCo V2 for SEN12MS

Evaluating a subset of Test data and reducing the 128 output component to 2 or 3 component to plot them and describe the visual classification

pytorch warning

/home/taeil/anaconda3/envs/taeil/lib/python3.7/site-packages/torch/nn/modules/module.py:795: UserWarning: Using a non-full backward hook when the forward contains multiple autograd Nodes is
deprecated and will be removed in future versions. This hook will be missing some grad_input. Please use register_full_backward_hook to get the documented behavior.
  warnings.warn("Using a non-full backward hook when the forward contains multiple autograd Nodes "
/home/taeil/anaconda3/envs/taeil/lib/python3.7/site-packages/torch/nn/modules/module.py:760: UserWarning: Using non-full backward hooks on a Module that does not return a single Tensor or a
tuple of Tensors is deprecated and will be removed in future versions. This hook will be missing some of the grad_output. Please use register_full_backward_hook to get the documented behavior.

setting up GPU instance for moco v2 pretraining with RESISC dataset

I think everyone should replace their p3 instance by this one since we may not use the previous one any more.

@SuryaTechie
@TsungChinHanKen
@oropezaev

updated the instructions here.

pre-training on bigger dataset

currently running on small dataset. Need to expand the training on bigger dataset.

RandomHorizontalFlip pipeline

# dict(type='RandomHorizontalFlip'),

https://github.com/Berkeley-Data/OpenSelfSup/blob/taeil/configs/selfsup/moco/r50_v2_sen12ms_in_basetrain_20ep.py

collect samples based on prediction results.

saving plot images to wandb.run.dir should be simple enough as well.

full train set and test set

according to Colorado, full train set includes test set.
Please help fix train set so it doesn't include test set.

Also, test set contains only 8 classes. Please help confirm and how to address this issue.

berkeley-data / hpt Goto Github PK

hpt's People

Stargazers

Watchers

Forkers

hpt's Issues

Sen12ms datasets evaluation (supervised models)

Recommend Projects

Recommend Topics

Recommend Org