Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is it trained after loading the pre training model or not #2

Open
xiaowuge1201 opened this issue Aug 11, 2020 · 16 comments
Open

Is it trained after loading the pre training model or not #2

xiaowuge1201 opened this issue Aug 11, 2020 · 16 comments
Labels
good first issue

Comments

@xiaowuge1201
Copy link

@xiaowuge1201 xiaowuge1201 commented Aug 11, 2020

As described in the title

@argusswift
Copy link
Owner

@argusswift argusswift commented Aug 11, 2020

It is better for you to conduct your training program after the pre training model is loaded, which will hugely improve the validation accuracy.

@xiaowuge1201
Copy link
Author

@xiaowuge1201 xiaowuge1201 commented Aug 12, 2020

It is better for you to conduct your training program after the pre training model is loaded, which will hugely improve the validation accuracy.

I understand what you said. I mean, under what circumstances did your model train? Did you train with or without the author's weight

@argusswift
Copy link
Owner

@argusswift argusswift commented Aug 13, 2020

yeah...I had used the author's weight (yolov4.weights) to get my results.For you, you can follow the instructions in read.me to conduct your experiment.

@xiaowuge1201
Copy link
Author

@xiaowuge1201 xiaowuge1201 commented Aug 14, 2020

yeah...I had used the author's weight (yolov4.weights) to get my results.For you, you can follow the instructions in read.me to conduct your experiment.

I don't think that if you want to verify the performance of your reproduced code, you should not load the original author's weight during training
In particular, I expect you to validate your code and feed back your test results without loading the original author weight file.
Thanks.

@argusswift
Copy link
Owner

@argusswift argusswift commented Aug 23, 2020

yeah...I had used the author's weight (yolov4.weights) to get my results.For you, you can follow the instructions in read.me to conduct your experiment.

I don't think that if you want to verify the performance of your reproduced code, you should not load the original author's weight during training
In particular, I expect you to validate your code and feed back your test results without loading the original author weight file.
Thanks.

yeah,you are excellent. I have changed my source code to verify the results. In this section, I have already loaded the CSPDarknet-53 weight file in the feature extraction network.

@xiaowuge1201
Copy link
Author

@xiaowuge1201 xiaowuge1201 commented Aug 25, 2020

yeah...I had used the author's weight (yolov4.weights) to get my results.For you, you can follow the instructions in read.me to conduct your experiment.

I don't think that if you want to verify the performance of your reproduced code, you should not load the original author's weight during training
In particular, I expect you to validate your code and feed back your test results without loading the original author weight file.
Thanks.

yeah,you are excellent. I have changed my source code to verify the results. In this section, I have already loaded the CSPDarknet-53 weight file in the feature extraction network.

哈哈,才知道你是中国人, 咨询下在不加载预训练模型情况下进行训练时,你的代码训练的效果是多少???

@xiaowuge1201
Copy link
Author

@xiaowuge1201 xiaowuge1201 commented Aug 25, 2020

yeah...I had used the author's weight (yolov4.weights) to get my results.For you, you can follow the instructions in read.me to conduct your experiment.

I don't think that if you want to verify the performance of your reproduced code, you should not load the original author's weight during training
In particular, I expect you to validate your code and feed back your test results without loading the original author weight file.
Thanks.

yeah,you are excellent. I have changed my source code to verify the results. In this section, I have already loaded the CSPDarknet-53 weight file in the feature extraction network.

你这个是最新的代码吗????

@argusswift
Copy link
Owner

@argusswift argusswift commented Aug 26, 2020

yeah...I had used the author's weight (yolov4.weights) to get my results.For you, you can follow the instructions in read.me to conduct your experiment.

I don't think that if you want to verify the performance of your reproduced code, you should not load the original author's weight during training
In particular, I expect you to validate your code and feed back your test results without loading the original author weight file.
Thanks.

yeah,you are excellent. I have changed my source code to verify the results. In this section, I have already loaded the CSPDarknet-53 weight file in the feature extraction network.

哈哈,才知道你是中国人, 咨询下在不加载预训练模型情况下进行训练时,你的代码训练的效果是多少???

这个没有测试过,但结果应该不会很差。

@argusswift
Copy link
Owner

@argusswift argusswift commented Aug 26, 2020

yeah...I had used the author's weight (yolov4.weights) to get my results.For you, you can follow the instructions in read.me to conduct your experiment.

I don't think that if you want to verify the performance of your reproduced code, you should not load the original author's weight during training
In particular, I expect you to validate your code and feed back your test results without loading the original author weight file.
Thanks.

yeah,you are excellent. I have changed my source code to verify the results. In this section, I have already loaded the CSPDarknet-53 weight file in the feature extraction network.

你这个是最新的代码吗????

这个暂时是最新的,后面还有新的模块后续还会更新。

@xiaowuge1201
Copy link
Author

@xiaowuge1201 xiaowuge1201 commented Aug 26, 2020

没有测试过,但结果应该不

这个我进行过测试, 训练不出来模型, 所以就请教你下

@argusswift
Copy link
Owner

@argusswift argusswift commented Aug 26, 2020

没有测试过,但结果应该不

这个我进行过测试, 训练不出来模型, 所以就请教你下

是用我的代码没有训练出模型?不加载预训练模型的话,还是可以训练的,不过训练会比较慢,很难拟合大型数据集并且精度会有所降低。

@xiaowuge1201
Copy link
Author

@xiaowuge1201 xiaowuge1201 commented Aug 27, 2020

我从新训练下,不加载模型在VOC2007上我训练了51个epoch,map能达到64%

@xiaowuge1201
Copy link
Author

@xiaowuge1201 xiaowuge1201 commented Aug 27, 2020

不过你的数据处理部分是没有mosaic的

@sudo-rm-covid19
Copy link
Contributor

@sudo-rm-covid19 sudo-rm-covid19 commented Oct 12, 2020

Hi @argusswift ,
I think there is a bug in the forward function of CSPStage and CSPFirstStage:

        x0 = self.split_conv0(x)
        x1 = self.split_conv1(x)

        x1 = self.blocks_conv(x1)

        x = torch.cat([x0, x1], dim=1) # where [x1, x0] should be used as it is in the original implementation
        x = self.concat_conv(x)

Thanks.

@argusswift
Copy link
Owner

@argusswift argusswift commented Oct 13, 2020

Hi @argusswift ,
I think there is a bug in the forward function of CSPStage and CSPFirstStage:

        x0 = self.split_conv0(x)
        x1 = self.split_conv1(x)

        x1 = self.blocks_conv(x1)

        x = torch.cat([x0, x1], dim=1) # where [x1, x0] should be used as it is in the original implementation
        x = self.concat_conv(x)

Thanks.

Thanks, your are right.If possible, you can pull requests with the modified code.Thank you again!

@mercuryson
Copy link

@mercuryson mercuryson commented May 30, 2021

这个提供的YOLOV4 DARKNET PRE-TRAINED WEIGHT是在COCO上训练的吗?为什么上来TOTAL LOSS非常的大。。。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue
Projects
None yet
Development

No branches or pull requests

4 participants