-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make the dataset search search on more fields #93
Comments
Thanks for this @andylolz, let me know if there is anything I can help with - explanations, testing etc |
Great! So the issue is with how I’m using the registry API. The query that’s run at the moment is: Line 11 in ccb6e01
So that’s just looking at the title. If you can figure out a better query, let me know! Or if you have a list of the fields that should be searched. The CKAN docs are here: https://docs.ckan.org/en/2.8/api/index.html |
Just a note to say that we are making some progress one this e.g. https://iatiregistry.org/api/3/action/package_search?fq=extras_filetype:organisation&q=BMZ returns files with the search string located outside of the title field - but we are now need to narrow down what it returns e.g. https://iatiregistry.org/api/3/action/package_search?fq=extras_filetype:organisation&q="DE" also returns files with the word 'description' in the returned data which we don't want. More soon..... |
Hi Andy - got a pull request coming for this coming from @kndm - a programmer I work with - we have done a few quick tests of a modified plugin and think it works well e.g. "bmz" now picks up the Germany file, "GB" returns all the orgs based in the UK etc, without being information overload or too many false positives. But you might decide that it is better with a narrower search field, or perhaps some other way.....let us know |
Oh, nice! Looks cool – I’ll test it out very shortly. |
Fixed in #96. |
@andylolz @kndm - in 1.3.1 a search for 'asdb' isn't bringing up the Asian Development Bank file....i.e. https://iatiregistry.org/api/3/action/package_show?id=asdb-org - which includes the string 'asdb' several times - have we missed a search field? |
Just checked, and this was the case at 1.3.0 too – so (thankfully!) unrelated to that change.
Line 17 in 2d96453
I’m not sure why that isn’t working! I was a bit suspicious of the underscore separator before merging this PR, but I tested it and it did seem to be doing the right thing. So I’m at a loss, I’m afraid! It might be worth us checking with CKAN developers (or even on a solr mailing list) to find out the best search string here, since this is not an IATI-specific problem. |
I'm not sure if it is related to this string in particular but the dash
("-") seems to be an issue in some searches, i.e the following query:
https://iatiregistry.org/api/3/action/package_search?fq=extras_filetype:organisation&q=name:asdb
Returns nothing
https://iatiregistry.org/api/3/action/package_search?fq=extras_filetype:organisation&q=name:asdb*
Returns the expected result by using the wildcard operator (*) which is
strange because the first query should already be listing it as a result.
El mar., 27 nov. 2018 a las 17:49, Andy Lulham (<[email protected]>)
escribió:
… Just checked, and this was the case at 1.3.0 too – so (thankfully!)
unrelated to that change.
It might be worth us checking with CKAN developers (or even on a solr
mailing list) to find out the best search string here, since this is not
IATI-specific.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#93 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AF35l8PlO0vJC4aStDDTV6hF8fCahyHZks5uzPyPgaJpZM4YfkaR>
.
|
Upon further inspection it seems some of the fields are not mapped properly, i.e organization_name may not be the name of the field for organization -> name (key) @andylolz do you happen to have any leads on to where I could better see documentation for these fields returned? :) |
@kndm I added a couple of links above:
But I think you’d be better off asking a solr mailing list, or possibly a CKAN mailing list. If you’re happy to keep looking into this, that would be great – I’ll be happy to review and merge a PR. |
Reopening this, since it still needs work (though thanks for the improvements so far, @kndm and @matmaxgeds!) |
This was raised again recently by two users separately, both times with the specific example of AfDB / African Development Bank. The former search works, the latter doesn’t (despite it being listed as the organisation name). |
The CKAN-dev mailing list page suggests searching the archive via: So e.g. this or perhaps this. There’s plenty of reading material there – I’d bet the answer lies within! |
@kndm happy for you to skip a bit of Somalia work to have another look at this - maybe you also got a reply to your post on the CKAN forums? |
I’ve posted the following: https://lists.okfn.org/pipermail/ckan-dev/2018-December/023005.html Does that look okay? Fingers crossed for a response! |
Thanks for the very detailed info Andy, I will be looking at this (and the
mailing list, though I got no response) in the following hours!
…On Tue, 18 Dec 2018 at 2:50 AM Andy Lulham ***@***.***> wrote:
I’ve posted the following:
https://lists.okfn.org/pipermail/ckan-dev/2018-December/023005.html
Does that look okay?
Fingers crossed for a response!
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#93 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AF35l2lyODPdzPSF3vMNcEWfIBjFUf0Yks5u59ltgaJpZM4YfkaR>
.
|
Okay, so it seems like the answer is: this isn’t possible without changes to the registry API :( |
Ooof/thanks for the detective work - is that something we can request changes to, as from my unenlightened position it is hard to understand why the org title field can't be queried? I guess the alternative is to download all the org files ourselves which isn't particularly appealing. |
Yep, same.
That would work, but I’m really not keen to do it because I think the registry API should be able to handle it. I’ve raised a ticket on the registry github (IATI/ckanext-iati#226), asking about the possibility of a plugin. |
Reported by Matt Geddes on discuss:
The text was updated successfully, but these errors were encountered: