Jiapeng Wu issues

Results 7 issues of


                                            Jiapeng Wu

What's the correct value of "conf" during training?

Hello, thanks for your excellent work When I training in VisDrone dataset, the conf during training will decrease to around 0.002 after 35 Epochs, and the Precision is good but...

Multi-class MOT in QDTrack

Thanks for your error report and we appreciate it a lot. **Checklist** 1. I have searched related issues but cannot get the expected help. 2. I have read the [FAQ...

Where is ckpts/mmdet/faster_rcnn_r50_caffe_fpn_person_ap551.pth?

Thanks for your amazing work. May I ask where to download the checkpoint file that written in the MOT config file: `ckpts/mmdet/faster_rcnn_r50_caffe_fpn_person_ap551.pth`? Many thanks.

如果要完成评价指标计算请移步这里, Step here if you want to evaluate performance

`track.py`中的`TrackEval`部分对于多类的数据集评估不准确, 因为多类情况下只能将所有类视为有效类, 否则指标一般会出奇的低. 为此, 我推荐您直接将生成的结果txt文件使用[TrackEval](https://github.com/JonathonLuiten/TrackEval)这个权威的库进行评估. 如果您认为TrackEval库评估太麻烦, 可以参考我的仓库[Easier_To_Use_TrackEval](https://github.com/JackWoo0831/Easier_To_Use_TrackEval). The `TrackEval` section in `track.py` provides inaccurate evaluation for multi-class datasets because, in the case of multiple classes, it can only treat all...

Attention please

给大家推荐一个更好的repo

本人在写这个代码的时候, 没想到会有这么多人看到. 然而, **必须承认我这份代码是以尽量整合为目的, 加了我自己的理解, 所以有的部分也许和原论文有出入, 导致效果不一定是最好的.** **为此, 给大家推荐一个成熟的repo: [https://github.com/mikel-brostrom/yolo_tracking](https://github.com/mikel-brostrom/yolo_tracking)** 我这个代码大家可以作为学习之用, 也就是熟悉MOT的流程. 如果追求更好的效果, 我建议采纳更成熟的那些. 我会不断听取大家的问题和建议, 希望和大家一起学习!

Attention please

add referring tracker tool for track anything

作者们你们好！我是书生·浦语大模型训练营第二期的学员~ 在项目阶段，我想对agent进行一点研究，结合我自己的研究方向（目标跟踪），我发现agentlego有检测，分割的一些tool，但是没有跟踪相关的。如果用户输入一个视频，并且希望LLM对其中的某个物品进行跟踪，我写的这个tracking tool或许可以派上用场~ 技术手段是通过GLIP或SAM进行检测或分割，然后用类似ByteTtrack的算法进行跟踪。为了支持视频（或图片序列文件夹）的输入，我也更改了`types.py`中的一些内容。更改的部分是`agentlego/agentlego/tools/tracking`和`types.py`. 以下是示例以及运行的结果，LLM采用InternLM-1.8b或7b. - prompt: `Please segment and track the laptop and cup in the video `test.mp4`` - 结果： ![nan22](https://github.com/InternLM/agentlego/assets/49953745/3a29da17-285f-4ceb-963e-9e79c16fc632)

It seems like that the loss is easy to be NaN when adopting the checkpoint of Stable Diffusion v2.1

The reason is that when calculating the attention, some element of multiplication of `Q` and `K` will be extramely large (`~3e5`) in line `319`, `class SparseCausalAttention`. But when I changed...