Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deleting/changing word #134

Closed
pijaniintergral opened this issue Jul 16, 2018 · 3 comments
Closed

Deleting/changing word #134

pijaniintergral opened this issue Jul 16, 2018 · 3 comments
Assignees

Comments

@pijaniintergral
Copy link

Is it possible to add to flat option to delete unwanted text? I understand that a "word" cannot be completly deleted due to how the structure is working in Folia, but maybe adding an option so that the unwanted text can be edited in text field to some sign or space o somehow to be marked?
My problem is when tranforming from html to text, a lot of unwanted text stays in the document I would like to have possiblity to somehow "delete" it. I cannot pre-clean the text, because I am trying to automate the process of transforming html to text and it is not so easy (for me) to predict all possible unwanted text.

@proycon proycon self-assigned this Jul 16, 2018
@proycon
Copy link
Owner

proycon commented Jul 16, 2018

Good point and a valid request indeed. FoLiA and FLAT do have extensive correction facilities which would already allow you to explicitly mark the deletion of a word (using the Correction edit form, . https://flat.readthedocs.io/en/latest/user_guide.html#edit-forms), but you seem to require a more basic manipulation of words. This has actually long been planned in the form of a separate structure editor in FoLiA (#5), which would allow deleting and adding structure elements (including words). Implementation of this is in a very unfinished (=unusable) state as it never been a high priority due to there not being any demand for it yet.

Perhaps this request can provide the necessary incentive to take that up again, although it would still take a while (at least a month) before I can get to it. In the meantime, perhaps the more complex correction facilities may help. You can also use the foliacorrect --corrected command-line tool (part of the FoLiA-tools) to strip the explicit corrections from the FoLiA again and retain only the new tokens.

@pijaniintergral
Copy link
Author

Thanks for a quick responce! Will try the suggested ways of dealing with my unwanted words :).

@proycon
Copy link
Owner

proycon commented Feb 7, 2024

I believe this was implemented a while back, deleting the text of a word now deletes the word (I think).. Closing this old issue.

@proycon proycon closed this as completed Feb 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants