-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for diacritics? Is UTF-8 being used consistently? #23
Comments
I will try it. Can you give me the item URL? I have looked at the link. It is in version 4.0 beta. We are using 3.6.2. |
Some further observations on this issue: I believe that dataverse may be interpreting a UTF-8 two bit character as two ASCII characters. (I have seen reference to this issue in 3.6.x elsewhere on github threads.) An interesting wrinkle to this issue is that when you are entering cataloguing for the study, I think certain actions (such as adding another instance of a repeating field) cause DV to automatically save, and it re-interprets the two ASCII characters it previously put in place of the single UTF-8 character over again, adding an additional character. So that over time é becomes © and then �Ã�©, then �Â�Ã�©, then Ã�Â�Ã�©, etc. |
I am still working on DOI problem. If you have found discussion on the Thanks, Piyapong. On Tue, Mar 24, 2015 at 4:15 PM, johnhuck notifications@github.com wrote:
|
Thanks, Piyapong. I'll see if I can find some specific discussion threads for you (the one reference I saw was a passing comment someone made). |
One of my source is Dataverse User Group, Piyapong. On Wed, Mar 25, 2015 at 11:05 AM, johnhuck notifications@github.com wrote:
|
I have tried to add the character encode in the startup script, but it did not seem to be working. I will tried to find another solutions. |
I have fixed the problem and tested. It seems to be working. Here is a sample study, https://hibernian.library.ualberta.ca/dvn/faces/study/StudyPage.xhtml?globalId=doi:10.5072/FK2/10076&versionNumber=2. I have entered some French, Chinese, Japanese and Thai. The dataverse saved and returned back with the same characters. Please tried again and let me know. |
Hi Piyapong, unfortunately, the problem persists in the dataverse study I showed you the other day. I just edited the record via my laptop (mac) and also via my desktop (pc), and got the same result both times. I also observed the extra character being added on "save". I didn't look at your example, but I notice it's on a different server. Could that be why? |
I have fixed it on the development server, If everything is working as expected, I will deploy the application on Thanks, Piyapong. On Wed, Apr 1, 2015 at 3:16 PM, johnhuck notifications@github.com wrote:
|
You might not be able to try on your laptop with Wifi connection. You Piyapong. On Wed, Apr 1, 2015 at 3:25 PM, Piyapong Charoenwattana <
|
I was able to log on to the hibernia dev server. I created a test dataverse and a test study with simple diacritics and it looks fine, so, I can confirm the behaviour that you have observed. Looks like it's fixed. Thanks! Was it simple to solve? (In case someone else runs into the same problem) |
I will package it and deploy on production server. I will let you know when Piyapong. On Wed, Apr 1, 2015 at 3:40 PM, johnhuck notifications@github.com wrote:
|
Hi, Piyapong ... I'm told by the development server that this is a Charles (Chuck) Humphrey On Wed, Apr 1, 2015 at 2:54 PM, Piyapong Charoenwattana <
|
Chuck: I would like to see the problem. It might be firewall or browser certificate. Can you show me when you have a chance? If it is firewall problem, I will request the sysadmin to open it for you. Thanks, |
Piyapong, I also got a restricted message when I clicked your link (using my staff station), https://hibernian.library.ualberta.ca/dvn/faces/study/StudyPage.xhtml?globalId=doi:10.5072/FK2/10076&versionNumber=2, but I was able to navigate somehow to a login screen, where I was able to login (ignoring a certificate warning message from my browser) and once I was in the dev. instance of dataverse, I was able to visually identify your test study (using the DOI) and navigate to it. But the link wouldn't take me directly to it. I didn't mention all of this because it seemed tangential to the original task of checking the diacritics. |
I think I was testing when I logged in. Chuck: You might want to point your browser to http://hibernian.library.ualberta.ca Thanks, Piyapong. On Thu, Apr 2, 2015 at 11:46 AM, johnhuck notifications@github.com wrote:
|
Chuck: I have released the study. Can you try this URL, Thanks, Piyapong. On Thu, Apr 2, 2015 at 11:53 AM, Piyapong Charoenwattana <
|
I still get an unsafe certificate warning, but if I click through, the link works now. |
The development server does not have a trust certificate installed. You In the sample, I know that Thai is working properly. It think, it should be Piyapong. On Thu, Apr 2, 2015 at 12:16 PM, johnhuck notifications@github.com wrote:
|
Excellent! Thanks, Chuck Charles (Chuck) Humphrey On Thu, Apr 2, 2015 at 12:03 PM, Piyapong Charoenwattana <
|
I have told Henry to deploy the application on production server. I will let you know when it lives. |
The original encoding problem persists on the production instance. |
I have checked the version of development server and production server. The Thanks, Piyapong. On Wed, Apr 15, 2015 at 10:14 AM, johnhuck notifications@github.com wrote:
|
I think Henry is away today. I will check with him again tomorrow. Thanks, Piyapong. On Wed, Apr 15, 2015 at 11:42 AM, Piyapong Charoenwattana <
|
what would you like me to do? regards On Wed, Apr 15, 2015 at 11:44 AM, Piyapong Charoenwattana <
|
Henry: Can you deploy the dataverse war file from Hibernian to production server? Thanks, Piyapong. On Wed, Apr 15, 2015 at 12:01 PM, henryzhang87 notifications@github.com
|
Hi, Piyapong ... Sorry, I'm on the side of the firewall to check this. I'll be back on the office Monday. Thanks, Chuck Sent from my iPhone
|
I see that this is an older message that was stacked on the latest exchanges about special character sets. It popped into sight and I responded. But now that I think about, I already checked this earlier in the month. Sent from my iPhone
|
Deployed. Please verify. Regards Henry On Wed, Apr 15, 2015 at 12:06 PM, pcharoen notifications@github.com wrote:
|
Thank you Henry. It is now latest version. John: Can you try again? Thanks, Piyapong. On Wed, Apr 15, 2015 at 12:18 PM, henryzhang87 notifications@github.com
|
Success! Diacritics are saving properly, so it looks like it is fixed. Thanks, Piyapong and Henry. |
Henry: You can close the ticket. Thanks, Piyapong. On Wed, Apr 15, 2015 at 2:17 PM, johnhuck notifications@github.com wrote:
|
fixed |
I was entering cataloguing information for an item in Dataverse, and it seems that diacritics are not being captured properly. I wonder if it is related to this issue at all:
IQSS/dataverse#834
For an example, look at the title for study doi:10.7939/DVN/10269, if you can access it through an admin interface (It hasn't been released yet). I don't know if this is a known issue for us already. Thank you!
The text was updated successfully, but these errors were encountered: