Name		Name	Last commit message	Last commit date
parent directory ..
main.py		main.py
readme.md		readme.md

readme.md

Character Sets in Python 🅰️🅱️🔤

In this section, we will explore how Python handles different character sets in source files. Understanding character encoding is crucial for writing Python code that works across different languages and platforms.

Unicode and ASCII in Python 🌐

Python v3

Python v3 source files can use any Unicode character, encoded as UTF-8.
- UTF-8 is a variable-width character encoding that can represent every character in the Unicode character set.
- ASCII characters (with codes between 0 and 127) are encoded in UTF-8 as single bytes, making an ASCII text file perfectly valid as a Python v3 source file.

Python v2

Python v2 source files are typically made up of characters from the ASCII set.
- ASCII characters have codes between 0 and 127.

Specifying a Different Encoding 🎨

In both Python v2 and v3, you can inform Python that a source file uses a different encoding.
Python will use this specified encoding to read the file.

Example:

# coding: utf-8

This comment should be placed at the very beginning of your Python source file (right after the “shebang line” if present).
The coding: directive is also known as an encoding declaration.

Using Non-ASCII Characters ✒️

In Python v2, non-ASCII characters can only be used in comments and string literals.
The coding directive allows you to use these characters by specifying the appropriate encoding.

Example:

# coding: iso-8859-1
# This is a comment with a non-ASCII character: ñ
print("Hola, señor!")

Best Practices for Encoding 🚀

It is considered best practice to use UTF-8 for all your text files, including Python source files.
- UTF-8 is widely supported and ensures that your code can handle a wide variety of characters from different languages.

Example:

# coding: utf-8
print("こんにちは, 世界!")  # Prints "Hello, World!" in Japanese

This section covered how Python handles character sets and encodings. By understanding and using the correct encoding, you can write Python code that is robust, flexible, and compatible across different systems. 🌍

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

03_character_sets

03_character_sets

readme.md

Character Sets in Python 🅰️🅱️🔤

Unicode and ASCII in Python 🌐

Python v3

Python v2

Specifying a Different Encoding 🎨

Example:

Using Non-ASCII Characters ✒️

Example:

Best Practices for Encoding 🚀

Example:

Files

03_character_sets

Directory actions

More options

Directory actions

More options

Latest commit

History

03_character_sets

Folders and files

parent directory

readme.md

Character Sets in Python 🅰️🅱️🔤

Unicode and ASCII in Python 🌐

Python v3

Python v2

Specifying a Different Encoding 🎨

Example:

Using Non-ASCII Characters ✒️

Example:

Best Practices for Encoding 🚀

Example: