Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about memory consolidation methods #150

Open
AbbeGo opened this issue Nov 6, 2024 · 4 comments
Open

Question about memory consolidation methods #150

AbbeGo opened this issue Nov 6, 2024 · 4 comments

Comments

@AbbeGo
Copy link

AbbeGo commented Nov 6, 2024

This is a result figure from an ablation experiment in your 'XMem' paper. I would like to ask how you conducted the comparison with the Random method. Do you have any hypotheses on why Random performs better than K-means? Additionally, the Random method does not perform significantly better than the Usage-based method. Looking forward to your response.
image

@hkchengrex
Copy link
Owner

Hello, thank you for the question!

We do find random selection to be surprisingly effective. One hypothesis is that the more "used" memory locations are also over-represented (e.g., as large objects or background regions) so a random selection is likely to pick them as well.
K-means is known to struggle in higher dimensions. Combined with the intuition above, it might under-represent those large clusters by assigning only one centroid and selecting more isolated points.

@AbbeGo
Copy link
Author

AbbeGo commented Nov 9, 2024

Thank you for your response, which has given me a new understanding of the memory mechanism.

@AbbeGo AbbeGo closed this as completed Nov 9, 2024
@AbbeGo AbbeGo reopened this Dec 17, 2024
@AbbeGo
Copy link
Author

AbbeGo commented Dec 17, 2024

I would like to ask you one more question: In your comparative experiments on different long memory mechanisms, were the experiments conducted using only long memory, or were the results in the experimental table obtained with sensory memory and work memory also included? I am particularly interested in whether you have ever used only long memory for video tracking and how the results compared. I look forward to your response.

@hkchengrex
Copy link
Owner

The other types of memory remain in use. Using long-term memory only is probably going to degrade performance by too much.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants