This repository has been archived by the owner on Mar 18, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 52
Added model file - CallmeMehdi #66
Merged
Merged
Changes from 7 commits
Commits
Show all changes
12 commits
Select commit
Hold shift + click to select a range
c3892d3
Added model file - CallmeMehdi
CallmeMehdi fa36749
Fixed license, added information about model, and fixed participant l…
CallmeMehdi 00f487b
Fixeed directory name issue
CallmeMehdi 9c62795
Rename CallmeMehdi.md to callmemehdi.md
CallmeMehdi 1432393
Fixed model asset link
CallmeMehdi 8f431de
Fixed model path
CallmeMehdi 5e861bd
Fixed file by remove top-level directory
CallmeMehdi d8a9140
Fixed license link
CallmeMehdi 3c015a4
Added usage example of the model
CallmeMehdi 48a3a4e
Merge branch 'master' of https://github.com/CallmeMehdi/tfhub.dev
CallmeMehdi 2941e8d
Added description of output, fixed model casing, and added hub load i…
CallmeMehdi 7c98143
Fixed readme typo, and removed output[0] from readme
CallmeMehdi File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,13 @@ | ||
# Publisher callmemehdi | ||
Mehdi Mabrouki. | ||
|
||
[![Icon URL]](https://avatars.githubusercontent.com/u/47258917?v=4) | ||
|
||
## Details | ||
Mehdi Mabrouki ([@CallmeMehdi](https://github.com/CallmeMehdi)). | ||
A Tunisian Software Engineer and Artificial Intelligence enthusiast, interested in Machine and Deep Learning. | ||
Google Summer of Code 2021 participant. | ||
|
||
**GitHub:** [@CallmeMehdi](https://github.com/CallmeMehdi)\ | ||
**LinkedIn:** [Mehdi Mabrouki](https://www.linkedin.com/in/mehdimabrouki/)\ | ||
**Personal Website:** [portfolio](https://portfolio.callmemehdi.vercel.app/) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,44 @@ | ||
# Module callmemehdi/arabert/1 | ||
|
||
AraBERT is an Arabic pretrained lanaguage model based on Google's BERT architechture. | ||
|
||
<!-- asset-path: https://gsoctfarabert.web.app/arabert.tar.gz --> | ||
<!-- task: text-embedding --> | ||
<!-- fine-tunable: true --> | ||
<!-- format: saved_model_2 --> | ||
<!-- license: custom --> | ||
|
||
## Overview | ||
|
||
These results are obtained from pre-training BERT specifically for the Arabic language in the pursuit of achieving the same success that BERT did for the English language.The performance of AraBERT is compared to multilingual BERT from Google and other state-of-the-art approaches | ||
MorganR marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
||
## Performance | ||
|
||
Performance of AraBERT on Arabic downstreamtasks compared to mBERT | ||
Results: | ||
|
||
Task | metric | mBERT | AraBERT | ||
------|--------|------ | ------- | ||
SA (HARD) | Acc. | 95.7 | 96.1 | ||
SA (ASTD) | Acc. | 80.1 | 96.5 | ||
SA (ArsenTD-Lev) | Acc. | 51.0 | 59.4 | ||
SA (AJGT) | Acc. | 83.6 | 93.8 | ||
SA (LABR) | Acc. | 83.0 | 86.7 | ||
NER (ANERcorp) | macro-F1 | 78.4 | 81.9 | ||
QA (ARCD) | macro-F1 | 61.3 | 62.7 | ||
|
||
MorganR marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
||
## Reference | ||
|
||
``` | ||
@inproceedings{antoun2020arabert, | ||
title={AraBERT: Transformer-based Model for Arabic Language Understanding}, | ||
author={Antoun, Wissam and Baly, Fady and Hajj, Hazem}, | ||
booktitle={LREC 2020 Workshop Language Resources and Evaluation Conference 11--16 May 2020}, | ||
pages={9} | ||
} | ||
``` | ||
|
||
## License | ||
|
||
[SOFTWARE LICENSE AGREEMENT - AraBERT](https://github.com/snakers4/silero-models/blob/master/LICENSE) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Shouldn't this be https://github.com/aub-mind/arabert/blob/master/arabert/LICENSE ? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Yes, absolutely, I think I didn't save when I changed it. Changing it right now. |
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please update the 'arabert' casing here or in the folder name so that they match