Support multi-lines text in table cells with write_html #91

Lucas-C · 2021-01-23T15:10:31Z

Example HTML to use:

<table border="1"><thead><tr>
    <th width="30%">First name</th><th width="30%">Last name</th><th width="15%">Age</th><th width="25%">City</th>
</tr></thead><tbody><tr>
    <td>Jean Abdul William</td><td>Smith</td><td>34</td><td>San Juan</td>
</tr></tbody></table>

The text in the first cell of the first row currently overflows.

A starting point would be to add a dedicated test in test_html.py.

In terms of code, some changes could be made in HTML2FPDF.handle_data in order to call self.pdf.multi_cell(... ln=3) instead of FPDF.cell. Inspiration could be taken from this test: https://github.com/PyFPDF/fpdf2/blob/master/test/cells/test_multi_cell.py#L85 Corresponding output: https://github.com/PyFPDF/fpdf2/blob/master/test/cells/multi_cell_ln_3_table.pdf

This issue can count as part of hacktoberfest

The text was updated successfully, but these errors were encountered:

RubendeBruin · 2021-04-04T13:27:40Z

As contribution, I modified the write-table-example such that the whole row is increased in height:

data = (
    ("First name", "Last name", "Age", "City"),
    ("Jules", "Smith", "34", "San Juan"),
    ("Mary", "Ramos\n \n Ramos \n    Ramos", "45", "Orlando"),
    ("Lucas", "Cimon", "Saint-Mahturin-sur-Loire - it may even be so long that multiple lines are needed to write it down completely", "49"),
    ("Carlson", "Banks", "19", "Los Angeles"),
)

line_height = pdf.font_size * 1.5
col_width = pdf.epw /6   # distribute content evenly

for row in data:

    row_height_lines = 1
    lines_in_row = []
    for datum in row: # determine height of highest cell
        output = pdf.multi_cell(col_width, line_height, datum, border=1, ln=3, split_only=True)
        lines_in_row.append(len(output))
        if len(output) > row_height_lines:
            row_height_lines = len(output)

    for tlines , datum in zip(lines_in_row, row):
        # here you can hack-in the
        text =datum.rstrip('\n') + (1 + row_height_lines - tlines) * '\n'
        pdf.multi_cell(col_width, line_height, text, border=1, ln=3)
    pdf.ln(row_height_lines * line_height)

Lucas-C · 2021-04-13T06:33:34Z

Thank you for sharing this code snippet @RubendeBruin

Yes, I think this is a good start, and this approach could be used in HTML2FPDF.handle_data to implement this feature.

RubendeBruin · 2021-08-17T09:10:13Z

Hi @Lucas-C , I've stated to implement this.

Only the text is quite straight forward with multi_cell. Adding the borders and background is more tricky as you need to know the height of the cell which may be dictated by unprocessed cells right of it.

To draw the borders and cell backgrounds you need to parse the whole row. handle_data only gives information about the current cell. I do not see an easy way to look ahead in the parser to see what is coming. This means I need to log all the cell info in handle_data until I reach the end of a row ( tag) and then process it.

Most logical implementation I think is to build a dataset with row data and process that when handle_endtag encounters a tr:

handle_starttag : prepare an empty list for cell data
_insert_td : adds to the cell-data list, includes the x-position from self._td_x()
handle_endtag : processes the cell-data: look at the number of text-rows in each of the cells. Append shorter cells with '\n' to make all the texts have the same number of rows (for drawing the border)

Would this make sense or is there an easier way?

Lucas-C · 2021-08-17T14:58:23Z

Sounds reasonable to me!

This is very likely to induce some refactoring though.
Feel free to get rid of some properties, like self.table_col_index or self.table_col_width, if they are not needed anymore with this new approach!

Lucas-C · 2022-03-23T11:06:05Z

Status update: PRs attempts have been closed, this issue is open for contributions!

Lucas-C · 2023-04-14T09:20:40Z

Since #703 has been released,multi-lines content in cells is now supported!

Lucas-C added enhancement good first issue up-for-grabs labels Jan 23, 2021

Lucas-C changed the title ~~Support multi-lines text in write_html~~ Support multi-lines text in table cells with write_html Jan 23, 2021

Lucas-C mentioned this issue Apr 25, 2021

how to make long text in <td> break automatically? #137

Closed

divyaman777 added a commit to divyaman777/fpdf2 that referenced this issue Aug 29, 2021

Support multi-lines text in table cells with write_html py-pdf#91

5ff3a8d

This was referenced Sep 7, 2021

Failing unit-tests? #212

Closed

WIP: multi-line cells in table #213

Closed

Lucas-C removed good first issue up-for-grabs labels Oct 19, 2021

Lucas-C mentioned this issue Nov 12, 2021

Support multi-lines text in table cells with write_html - fix #91 #211

Closed

Lucas-C added the html label Nov 15, 2021

Lucas-C mentioned this issue Dec 9, 2021

Support html table with long text for <td> table data attribute #307

Closed

Lucas-C mentioned this issue Jan 29, 2022

Right-aligned HTML paragraph breaks inline font styling #151

Closed

Lucas-C mentioned this issue Mar 22, 2022

write_html() does not generate proper cell border if a cell contains a multi line text. #370

Closed

Lucas-C added up-for-grabs hacktoberfest labels Mar 23, 2022

Lucas-C mentioned this issue Apr 28, 2022

text running out of the <td><> #405

Closed

Lucas-C mentioned this issue Aug 17, 2022

Is there a way to set row height according to the highest column height in the row without calculating the highest column height? #494

Closed

Lucas-C added the table label Feb 14, 2023

Lucas-C mentioned this issue Apr 14, 2023

Bug: <center> does not support internal HTML tags #640

Closed

Lucas-C closed this as completed in d4b3745 Apr 14, 2023

gmischler mentioned this issue Oct 1, 2023

Text regions layout management #897

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multi-lines text in table cells with write_html #91

Support multi-lines text in table cells with write_html #91

Lucas-C commented Jan 23, 2021 •

edited

Loading

RubendeBruin commented Apr 4, 2021 •

edited

Loading

Lucas-C commented Apr 13, 2021

RubendeBruin commented Aug 17, 2021

Lucas-C commented Aug 17, 2021 •

edited

Loading

Lucas-C commented Mar 23, 2022

Lucas-C commented Apr 14, 2023

Support multi-lines text in table cells with write_html #91

Support multi-lines text in table cells with write_html #91

Comments

Lucas-C commented Jan 23, 2021 • edited Loading

RubendeBruin commented Apr 4, 2021 • edited Loading

Lucas-C commented Apr 13, 2021

RubendeBruin commented Aug 17, 2021

Lucas-C commented Aug 17, 2021 • edited Loading

Lucas-C commented Mar 23, 2022

Lucas-C commented Apr 14, 2023

Lucas-C commented Jan 23, 2021 •

edited

Loading

RubendeBruin commented Apr 4, 2021 •

edited

Loading

Lucas-C commented Aug 17, 2021 •

edited

Loading