Skip to content

Feature request: read_excel to support hyperlinks split into label and hyperlink #13439

@DocViper666

Description

@DocViper666

Situation:
.xlsx file having a column with label and hyperlinks (see screenshot collage)
In Excel, label: Python (that has a hyperlink to i.e.: www.python.org)

Code Sample, a copy-pastable example if possible

data = pd.read_excel('Book1.xlsx')
column has just "Python" as content. Hyperlink information is lost.
(see screenshot)

excel-hyperlink

Expected Output, Proposal:

Add an argument for example named hyperlinkparser to read_excel()
that offers options: "label", "hyperlink", "both"
label ... just the label (this is how it is handled now)
hyperlink ... just the hyperlink, drop the label
split ... created two separate columns for label and hyperlink each

with example it could look like:
| MyLinks | MyLinks_hyperlink
0 | Python | www.python.org

btw. openpyxl 2.4.+ has this feature now included (was broken in previous versions). Maybe this helps for implementation.

Thank you!

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions