File size: 1,348 Bytes
f46438d
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
---
# For reference on model card metadata, see the spec: https://github.com/huggingface/hub-docs/blob/main/modelcard.md?plain=1
# Doc / guide: https://huggingface.co/docs/hub/model-cards
{}
---

# CoReS: Orchestrating the Dance of Reasoning and Segmentation(ECCV2024)


### Model Sources [optional]

<!-- Provide the basic links for the model. -->

- **Project:** [https://chain-of-reasoning-and-segmentation.github.io/]
- **Paper:** [https://arxiv.org/abs/2404.05673]
- **codes:** [https://github.com/baoxiaoyi/CoReS]
## Uses

<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->

### Citation

If you find this project useful in your research, please consider citing:

```
@inproceedings{bao2024cores,
  title={Cores: Orchestrating the dance of reasoning and segmentation},
  author={Bao, Xiaoyi and Sun, Siyang and Ma, Shuailei and Zheng, Kecheng and Guo, Yuxin and Zhao, Guosheng and Zheng, Yun and Wang, Xingang},
  booktitle={European Conference on Computer Vision},
  pages={187--204},
  year={2024},
  organization={Springer}
}
```

### Acknowledgement
-  This work is built upon the [LISA](https://github.com/dvlab-research/LISA) [LLaVA](https://github.com/haotian-liu/LLaVA) and [SAM](https://github.com/facebookresearch/segment-anything).