Lazy-load emoji module to improve performance #8109

LukasMasuch · 2024-02-08T11:50:49Z

Describe your changes

The emoji data is the biggest object when running a blank Streamlit app. Compiling the regex is also slightly expensive. However, the emoji data is only required if there is a check for emojis; many apps might not require this. Therefore, this PR makes the emoji module to lazy load only if it is actually required.

This also adds a precheck for emoji checks to make sure that the string even contains non alphanumeric characters before using the more expensive emoji regex.

GitHub Issue Link (if applicable)

Related to #6066

Testing Plan

Added e2e test to check that some lazy-loaded modules are not imported in an almost blank Streamlit app.

Contribution License Agreement

By submitting this pull request you agree that all contributions to this project are made under the Apache 2.0 license.

kajarenc · 2024-02-08T15:39:19Z

lib/streamlit/string_util.py

@@ -40,14 +36,29 @@ def clean_text(text: "SupportsStr") -> str:
    return textwrap.dedent(str(text)).strip()


+def _contains_special_chars(text: str) -> bool:


@LukasMasuch If I understand correctly this code, this method could be replaced with python built-in
str.isalnum method

## Describe your changes The emoji data is the biggest object when running a blank Streamlit app. Compiling the regex is also slightly expensive. However, the emoji data is only required if there is a check for emojis; many apps might not require this. Therefore, this PR makes the emoji module to lazy load only if it is actually required. This also adds a precheck for emoji checks to make sure that the string even contains non alphanumeric characters before using the more expensive emoji regex. ## GitHub Issue Link (if applicable) Related to streamlit#6066 ## Testing Plan - Added e2e test to check that some lazy-loaded modules are not imported in an almost blank Streamlit app. --- **Contribution License Agreement** By submitting this pull request you agree that all contributions to this project are made under the Apache 2.0 license.

LukasMasuch added 4 commits February 8, 2024 12:39

Lazy-load emojis data

2191887

Update comment

3ecf520

Update comment

cc7076e

Remove duplicated regex

671bbc7

LukasMasuch added the security-assessment-completed label Feb 8, 2024

LukasMasuch changed the title ~~Lazy-load the emoji data~~ Lazy-load the emoji data to improve performance Feb 8, 2024

LukasMasuch added change:refactor impact:users labels Feb 8, 2024

LukasMasuch changed the title ~~Lazy-load the emoji data to improve performance~~ Lazy-load emoji data to improve performance Feb 8, 2024

Add e2e tests

2e3c1da

LukasMasuch marked this pull request as ready for review February 8, 2024 13:27

LukasMasuch changed the title ~~Lazy-load emoji data to improve performance~~ Lazy-load emoji module to improve performance Feb 8, 2024

Update comment

ddf7f53

kajarenc approved these changes Feb 8, 2024

View reviewed changes

LukasMasuch merged commit 7c5b6a0 into develop Feb 8, 2024
38 checks passed

kajarenc reviewed Feb 8, 2024

View reviewed changes

LukasMasuch mentioned this pull request Feb 9, 2024

Streamlit startup time could be reduced from 1s to 400ms #6066

Closed

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lazy-load emoji module to improve performance #8109

Lazy-load emoji module to improve performance #8109

LukasMasuch commented Feb 8, 2024 •

edited

kajarenc Feb 8, 2024

		@@ -40,14 +36,29 @@ def clean_text(text: "SupportsStr") -> str:
		return textwrap.dedent(str(text)).strip()


		def _contains_special_chars(text: str) -> bool:

Lazy-load emoji module to improve performance #8109

Lazy-load emoji module to improve performance #8109

Conversation

LukasMasuch commented Feb 8, 2024 • edited

Describe your changes

GitHub Issue Link (if applicable)

Testing Plan

kajarenc Feb 8, 2024

Choose a reason for hiding this comment

LukasMasuch commented Feb 8, 2024 •

edited