Fix a few CSS selector issues #178

poire-z · 2018-05-05T15:34:41Z

Fix some of the issues reported in #176.

fix standalone #ID (was working only as ELEM#ID)
fix non-lowercase element name (elements are internally lowercased, so we should lowercase there too to expect any match)
fix E+F selector, that should ignore immediate preceding text nodes, and consider the first met element node
adds E~F selector (like E+F, but any instead of "immediate" precedessor is considered for a match)

- fix standalone #ID (was working only as ELEM#ID) - fix non-lowercase element name (elements are internally lowercased, so we should lowercase there too to expect any match) - fix E+F selector, that should ignore immediate preceding text nodes, and consider the first met element node - adds E~F selector (like E+F, but *any* instead of "immediate" precedessor is considered for a match)

Frenzie · 2018-05-05T15:36:14Z

crengine/include/lvstsheet.h

@@ -94,6 +94,7 @@ enum LVCssSelectorRuleType
    cssrt_parent,        // E > F
    cssrt_ancessor,      // E F
    cssrt_predecessor,   // E + F
+    cssrt_predsibling,   // E ~ F


Frenzie · 2018-05-05T15:37:57Z

crengine/src/lvstsheet.cpp

        else if ( css_is_alpha( *str ) )
        {
            // ident
            char ident[64];
            if (!parse_ident( str, ident ))
                return false;
-            _id = doc->getElementNameIndex( lString16(ident).c_str() );
+            _id = doc->getElementNameIndex( lString16(ident).lowercase().c_str() );


What's the effect on the XML vs HTML split?

HTML (including those found in EPUB) are parsed by HTMLParser (a subclass of XMLParser that has a setting m_citags=true - unlike XMLParser that has it false) for case-insensitive tags, which make it lowercasing the element names. So, they are all lowercased internally when added to the DOM tree.

doc->getElementNameIndex( "DIV" ) would work (if I remember correctly) like: oh, you're asking me for DIV, I haven't seen it yet, let's make a new ID for DIV for you, here it is - but there would be no element with that new ID in the DOM.

And this CSS rule only applies to HTML?

to each DOM node: Selectors gets that node's elementnameID (which will always be lowercase) and compares it to its _id (which is now lowercase).
I don't think we build DOM from anything but HTML.

So what happens to the XHTML? I thought you just said it's handled case sensitive? :-P

Both standalone html and html/xhtml found in EPUB are parsed by the lowercasing HTMLParser (only other opf/ncx xml stuff in the epub are parsed by the XMLParser).
Then, this HTMLParser feeds "documentWriter" with tag/attribute/text.
standalone HTML uses ldomDocumentWriterFilter, which handle autoclose and LIB.RU.
EPUB uses ldomDocumentFragmentWriter, that doesn't do autoclose, but deals with appending multiple body from multiple HTML as docFragment into a single DOM.
But both are fed with lowercased element tag names.

Weird, but okay in practice I suppose. :-p

koreader/crengine#178

Frenzie reviewed May 5, 2018

View reviewed changes

Frenzie approved these changes May 5, 2018

View reviewed changes

poire-z merged commit 5cd06ec into koreader:master May 5, 2018

poire-z deleted the css_fixes branch May 5, 2018 16:56

Frenzie added a commit to Frenzie/koreader-base that referenced this pull request May 5, 2018

bump crengine for: Fix a few CSS selector issues

93ff538

koreader/crengine#178

Frenzie mentioned this pull request May 5, 2018

bump crengine for: Fix a few CSS selector issues koreader/koreader-base#668

Merged

Frenzie added a commit to koreader/koreader-base that referenced this pull request May 5, 2018

bump crengine for: Fix a few CSS selector issues (#668)

c835232

koreader/crengine#178

poire-z mentioned this pull request May 6, 2018

bump base/crengine: stylesheet handling improvements koreader/koreader#3934

Merged

poire-z mentioned this pull request Jun 13, 2018

Adjacent sibling CSS selectors are not supported koreader/koreader#2845

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix a few CSS selector issues #178

Fix a few CSS selector issues #178

poire-z commented May 5, 2018

Frenzie May 5, 2018

Frenzie May 5, 2018

poire-z May 5, 2018

poire-z May 5, 2018

Frenzie May 5, 2018

poire-z May 5, 2018

Frenzie May 5, 2018

poire-z May 5, 2018

Frenzie May 5, 2018

Fix a few CSS selector issues #178

Fix a few CSS selector issues #178

Conversation

poire-z commented May 5, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment