Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Google Scholar citation_xx metadata tags no longer visible to the crawler #3804

Open
misilot opened this issue Jan 7, 2025 · 2 comments
Open
Labels
bug cannot reproduce Unable to reproduce at this time, so the ticket either needs more information or needs closing component: SEO Search Engine Optimization

Comments

@misilot
Copy link
Contributor

misilot commented Jan 7, 2025

Describe the bug

We received this message from Google, we are running 7.6.2 for reference

I hope that this message finds you well! My name is Ollie, and I work at Google Scholar.

I'm writing regarding krex.k-state.edu -- I hope you're still the right contact.

The indexing system recently alerted us that the citation_xx metatags (which Scholar uses for indexing) are no longer visible to the crawler. It's possible that a recent update resulted in the metatags being loaded by JavaScript and not server-side rendered.

For example:
view-source:https://krex.k-state.edu/items/19dfeba9-33a1-4173-98fd-170115ec868f
view-source:https://krex.k-state.edu/items/1f6701b3-08fa-459a-8d1d-5f1b3c8ea9dc

both show the citation_xx metatags in a browser, but not for the crawler.

Would it be possible to take a look?

Let me know if you have any questions.

Many thanks in advance,

To Reproduce

Steps to reproduce the behavior:

  1. Do this
  2. Then this...

Expected behavior

A clear and concise description of what you expected to happen.

Related work

Link to any related tickets or PRs here.

@misilot misilot added bug needs triage New issue needs triage and/or scheduling labels Jan 7, 2025
@tdonohue
Copy link
Member

tdonohue commented Jan 7, 2025

@misilot : we'd need more information here. It looks, to me, like the "citation_*" tags exist in your server-side rendered HTML. If I "Disable Javascript" in Chrome, and access one of those pages, I can "view source" and see the tags (search for "citation_title" for example).

This looks like the same behavior that we see on https://demo.dspace.org/

So, I don't see a bug here. Or, I don't have enough information to reproduce it.

@misilot
Copy link
Contributor Author

misilot commented Jan 7, 2025

I can reach out to Google and see if I can get more information or ways to for us to reproduce what they are seeing.

@tdonohue tdonohue added component: SEO Search Engine Optimization cannot reproduce Unable to reproduce at this time, so the ticket either needs more information or needs closing and removed needs triage New issue needs triage and/or scheduling labels Jan 7, 2025
@tdonohue tdonohue moved this to ❓ Needs Info in DSpace Backlog Jan 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug cannot reproduce Unable to reproduce at this time, so the ticket either needs more information or needs closing component: SEO Search Engine Optimization
Projects
Status: Needs Info
Development

No branches or pull requests

2 participants