Skip to content

server 옮기면서 training 다시 시작했을 때 에러들 #31

@yskim0

Description

@yskim0

(1) 수동으로 체크포인트 폴더를 만들어서 gcp에서 하던 체크포인트 하나 옮김

python3 flow —model ./cfg/handlang-small.cfg —labels ./labels.txt —trainer adam —dataset ./Data/handlang-data-1-400/dataset/ —annotation ./Data/handlang-data-1-400/annotations/ —train —summary ./logs —batch 20 —epoch 2000 —save 200 —keep 200 —lr 1e-04 —gpu 1.0 —load 105435

=> 못불러옴
비슷하게 load ./handlang-small-105435
load ./handlang-small-105435.meta
다 안됨

(2) gcp로 하던거 올바른 체크포인트에서 savepb하고 수동으로 built-graph 디렉토리 만들어서 옮김

python3 flow —pbLoad ./built-graph/handlang-small.pb —metaLoad ./built-graph/handlang-small.meta —labels ./labels.txt —trainer adam —dataset ./Data/handlang-data-1-400/dataset/ —annotation ./Data/handlang-data-1-400/annotations/ —train —summary ./logs —batch 20 —epoch 2000 —save 200 —keep 200 —lr 1e-04 —gpu 1.0

=> ValueError: No variables to optimize.
에러. 비슷하게 built-graph 폴더 지우고 그냥 밖(./handlang-small.pb)에서 했을 때도 똑같은 에러 발생

(3) meta가 어쨌든 weights에 대응하는 거니까 cfg, meta 섞어봄.

python3 flow —model ./cfg/handlang-small.cfg  —metaLoad ./handlang-small.meta —labels ./labels.txt —trainer adam —dataset ./Data/handlang-data-1-400/dataset/ —annotation ./Data/handlang-data-1-400/annotations/ —train —summary ./logs —batch 20 —epoch 2000 —save 200 —keep 200 —lr 1e-04 —gpu 1.0

=> 에러는 뜨지 않으나 그냥 처음부터 train 하는 결과와 같음.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions