Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data without so much context? #2

Open
Evelynhuang opened this issue Feb 21, 2023 · 1 comment
Open

Data without so much context? #2

Evelynhuang opened this issue Feb 21, 2023 · 1 comment

Comments

@Evelynhuang
Copy link

Hi,

Thank you for this very interesting paper and important research question and contribution. I just have a question. Is this model more suitable for long text data/documents and not so suitable for short texts data or data without much context (ex. very short open-ended survey answers)?

Best,
Evelyn

@ArthurSpirling
Copy link
Collaborator

@Evelynhuang -- thanks for your query. @prodriguezsosa put together some code for looking specifically at open-ended survey answers, actually. See this part of the guide

Assuming you already have some reasonable pretrained embeddings, we found that you can get reasonable ALC embeddings from just a single instance of a term (and its surrounding ~12 words in context)---see the trump/Trump example in the paper. But I'm not sure if that's the sort of thing you're asking about.

If you clarify your use-case a little more, I will try to say something more definitive.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants