[lexical-markdown] Bug Fix: support link and inline code text formats #7004

AlessioGr · 2024-12-30T18:23:02Z

Fixes #5148. Additionally, it fixes an issue where formatted code blocks (e.g.

**`code`**

are not exported to markdown correctly after they have been imported.

This PR refactors the logic for applying text match and text format transformers to enable support for nested text formats within text match transformers, such as link nodes.

Unlike the previous attempt to fix it, this change does not include any node-specific logic. It fixes the root cause of the issue by ensuring that nested combinations of textmatch and textformat transformers are applied in optimal order.

Before

CleanShot.2024-12-30.at.11.56.42.mp4

After

CleanShot.2024-12-30.at.11.56.03.mp4

The Problem

Previously, the import process for text transformers roughly followed this sequence:

ElementTransformers ($importBlocks) => TextFormatTransformers =if not found> TextMatchTransformers

For link nodes containing formatted text, the process failed as follows:

Executed $importBlocks
Found code markdown within link => Run code text format transformer => create code textnode
The input text is split into 3 nodes: normal text, code text, and normal text. However, this fragmented structure prevents the link text match transformer from recognizing and creating a link node.

Initially, I attempted to solve this issue by adjusting the sequence to prioritize text match transformers:

ElementTransformers ($importBlocks) => TextMatchTransformers => TextFormatTransformers

While this resolved the issue with nested formats, it introduced a new problem in scenarios where links were wrapped by text format markdown, like this:

Text **boldstart [text](https://lexical.dev) boldend** text

Now, the link is created first and we get normal text, link, normal text. However, the bold text transformer could no longer identify and apply formatting to the entire outer bold range.

The Solution

Text format transformers already include logic to identify the outermost match, allowing them to handle scenarios like:

One **two __three__ four**

In this case, the bold transformer runs first, followed by the italic transformer, ensuring proper formatting.

However, text match transformers currently lack similar logic. Consider this example:

Text **boldstart [`text`](https://lexical.dev) boldend** text

The existing sequence processes it as:

Bold text
Code text
Link

However, to achieve correct results, the sequence should be:

Bold text
Link
Code text

To address this, the PR introduces logic for identifying the outermost match across both text match and text format transformers.

With this change, text match and text format transformers are treated as equals in priority, allowing their results to be compared directly. This ensures that the outermost match—whether from a text match or a text format transformer—is correctly identified and then applied, enabling seamless handling of nested text transformations.

Implementation details

Find (not apply) outermost text match
Find (not apply) outermost text format
Determine if found text match or text format is the outermost match
Apply said text match or text format
Repeat this for the matched node, and the node before / afterwards if text was split, until no more text matches and text formats are found

…rmats

…most matching applies to both text format and text match transformers, instead of just text format transformers

vercel · 2024-12-30T18:23:06Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
lexical	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Jan 21, 2025 6:43am
lexical-playground	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Jan 21, 2025 6:43am

github-actions · 2024-12-30T18:24:49Z

size-limit report 📦

Path	Size
lexical - cjs	29.07 KB (0%)
lexical - esm	28.89 KB (0%)
@lexical/rich-text - cjs	38.04 KB (0%)
@lexical/rich-text - esm	30.98 KB (0%)
@lexical/plain-text - cjs	36.57 KB (0%)
@lexical/plain-text - esm	28.26 KB (0%)
@lexical/react - cjs	39.84 KB (0%)
@lexical/react - esm	32.34 KB (0%)

…#10269) Fixes #8279 Ports over facebook/lexical#7004

AlessioGr · 2025-01-07T05:47:31Z

Just squeezed in another fix for formatted inline code markdown, e.g.

**`code`**

Didn't want to open a separate PR, as this one is a relatively large refactor and I wanted to make sure this fix is compatible

etrepum

I didn't look too carefully at the logic in the while loop other than to confirm that it should still terminate, I think the unit test coverage is probably sufficient to show that it's not more wrong than the status quo

packages/lexical-markdown/src/MarkdownExport.ts

etrepum · 2025-01-13T20:51:40Z

packages/lexical-markdown/src/MarkdownTransformers.ts

-    } else {
-      return linkContent;
-    }
+      ? `[${textContent}](${node.getURL()} "${title}")`


Presumably this title should be escaped because there could be embedded " and/or )?

Done - I escaped this with some simple regex.

Should we consider a library like dompurify to protect against XSS attacks here? While it's just markdown, XSS could still be an issue if (and depending on how) this is rendered to HTML.

I have reverted the escaping change. This broke a unit test and did not work when the markdown was re-imported to lexical.

The latter is a separate issue I have experienced, where escaped markdown is not un-escaped properly when imported. E.g. \*text\* is imported as<span> \*text\*</span> instead of <span>*text*</span>

I think fixing this properly may not be trivial and is out of scope for this PR anyways, since this PR did not introduce this unescaped title being outputted

etrepum · 2025-01-13T20:57:38Z

packages/lexical-markdown/src/importTextTransformers.ts

+      result.nodeAfter &&
+      $isTextNode(result.nodeAfter) &&
+      !result.nodeAfter.hasFormat('code')


With how often this expression is repeated it's probably worth making a function to cover $isTextNode(node) && !node.hasFormat('code'). Checking for non-null/undefined is redundant because $isTextNode already does that

done! I went for canContainTransformableMarkdown as that conveys the intent and makes the code where it's used more readable

While doing this, found another area that I was able to optimize:

I think we can move the remaining logic up to the $transform call out of the editor.update() call as well - what do you think?

…pdate call if node can not contain transformable markdown

AlessioGr added 7 commits December 29, 2024 19:46

fix(markdown): incorrect markdown import for links containing text fo…

01ecf0e

…rmats

simpler test examples

3628b82

fix: markdown export for link nodes with text formats

eb0ca84

new text format & text match importing logic. This ensures that outer…

957bc66

…most matching applies to both text format and text match transformers, instead of just text format transformers

add tests

054c865

fix export

320a569

remove console.log

5c4e11b

AlessioGr requested review from zurfyx, fantactuka, acywatson, Fetz, ivailop7, Sahejkm and potatowagon as code owners December 30, 2024 18:23

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 30, 2024

Merge branch 'main' into fork/fix-md-link-text-formats

81f0a13

vercel bot deployed to Preview – lexical-playground December 30, 2024 18:25 View deployment

vercel bot deployed to Preview – lexical December 30, 2024 18:25 View deployment

AlessioGr mentioned this pull request Dec 30, 2024

richtext-lexical: bold link markdown conversion not working payloadcms/payload#8279

Closed

fix lint errors

14cda52

vercel bot deployed to Preview – lexical December 30, 2024 18:38 View deployment

vercel bot deployed to Preview – lexical-playground December 30, 2024 18:38 View deployment

fix build

b66b8cc

vercel bot deployed to Preview – lexical-playground December 30, 2024 18:53 View deployment

vercel bot deployed to Preview – lexical December 30, 2024 18:53 View deployment

fix build

4dbded1

vercel bot deployed to Preview – lexical December 30, 2024 19:01 View deployment

vercel bot deployed to Preview – lexical-playground December 30, 2024 19:01 View deployment

AlessioGr added a commit to payloadcms/payload that referenced this pull request Dec 30, 2024

fix(richtext-lexical): formatted link markdown conversion not working (…

885e966

…#10269) Fixes #8279 Ports over facebook/lexical#7004

AlessioGr added 2 commits January 6, 2025 22:41

fix: formatted inline code block are not formatted in the correct order

8f051ad

Merge branch 'main' into fork/fix-md-link-text-formats

8afcb0d

vercel bot deployed to Preview – lexical-playground January 7, 2025 05:44 View deployment

vercel bot deployed to Preview – lexical January 7, 2025 05:44 View deployment

AlessioGr changed the title ~~[lexical-markdown] Bug Fix: support link text formats~~ [lexical-markdown] Bug Fix: support link and inline code text formats Jan 7, 2025

more reliable fix

7f04ef3

vercel bot deployed to Preview – lexical January 7, 2025 05:58 View deployment

vercel bot deployed to Preview – lexical-playground January 7, 2025 05:58 View deployment

etrepum added the extended-tests Run extended e2e tests on a PR label Jan 13, 2025

etrepum previously approved these changes Jan 13, 2025

View reviewed changes

Merge branch 'main' into fork/fix-md-link-text-formats

113860c

vercel bot deployed to Preview – lexical-playground January 20, 2025 01:38 View deployment

vercel bot deployed to Preview – lexical January 20, 2025 01:38 View deployment

AlessioGr added 2 commits January 20, 2025 22:48

perf: less array.include calls

4fd26bb

perf: shared canContainTransformableMarkdown function, avoid editor.u…

7987e04

…pdate call if node can not contain transformable markdown

AlessioGr dismissed etrepum’s stale review via 7987e04 January 21, 2025 06:03

vercel bot deployed to Preview – lexical January 21, 2025 06:04 View deployment

vercel bot deployed to Preview – lexical-playground January 21, 2025 06:04 View deployment

Merge branch 'main' into fork/fix-md-link-text-formats

a820427

vercel bot deployed to Preview – lexical January 21, 2025 06:10 View deployment

vercel bot deployed to Preview – lexical-playground January 21, 2025 06:10 View deployment

sanitize link title

a0898cd

vercel bot deployed to Preview – lexical-playground January 21, 2025 06:20 View deployment

vercel bot deployed to Preview – lexical January 21, 2025 06:20 View deployment

undo change

51bf22b

vercel bot deployed to Preview – lexical-playground January 21, 2025 06:43 View deployment

vercel bot deployed to Preview – lexical January 21, 2025 06:43 View deployment

AlessioGr requested a review from etrepum January 21, 2025 06:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[lexical-markdown] Bug Fix: support link and inline code text formats #7004

[lexical-markdown] Bug Fix: support link and inline code text formats #7004

AlessioGr commented Dec 30, 2024 •

edited

Loading

vercel bot commented Dec 30, 2024 •

edited

Loading

github-actions bot commented Dec 30, 2024 •

edited

Loading

AlessioGr commented Jan 7, 2025

etrepum left a comment

etrepum Jan 13, 2025

AlessioGr Jan 21, 2025

AlessioGr Jan 21, 2025 •

edited

Loading

etrepum Jan 13, 2025

AlessioGr Jan 21, 2025

AlessioGr Jan 21, 2025

[lexical-markdown] Bug Fix: support link and inline code text formats #7004

Are you sure you want to change the base?

[lexical-markdown] Bug Fix: support link and inline code text formats #7004

Conversation

AlessioGr commented Dec 30, 2024 • edited Loading

The Problem

The Solution

Implementation details

vercel bot commented Dec 30, 2024 • edited Loading

github-actions bot commented Dec 30, 2024 • edited Loading

size-limit report 📦

AlessioGr commented Jan 7, 2025

etrepum left a comment

Choose a reason for hiding this comment

etrepum Jan 13, 2025

Choose a reason for hiding this comment

AlessioGr Jan 21, 2025

Choose a reason for hiding this comment

AlessioGr Jan 21, 2025 • edited Loading

Choose a reason for hiding this comment

etrepum Jan 13, 2025

Choose a reason for hiding this comment

AlessioGr Jan 21, 2025

Choose a reason for hiding this comment

AlessioGr Jan 21, 2025

Choose a reason for hiding this comment

AlessioGr commented Dec 30, 2024 •

edited

Loading

vercel bot commented Dec 30, 2024 •

edited

Loading

github-actions bot commented Dec 30, 2024 •

edited

Loading

AlessioGr Jan 21, 2025 •

edited

Loading