Discussion:
Problem to be examined by those who must use UTF8 with Gambas
(too old to reply)
Fernando Cabral
2017-07-01 17:01:33 UTC
Permalink
I've been testing my new toy with every large and small text I can put my
hands on. The fact is, I have had surprise after surprise. I've just found
that when I read a text that contains "À" (that's an A with a grave accent
(`A) -- or a backward acute accent for those who are not familiar with
diacritics).

Well, I can't display it if I load it into a TextArea.text. Nothing shows.
I mean, NOTHING. That is, I have a blank screen. If I bring it to lowercase
(à or `a) it will show up (in lowercase) with the rest of the text.

So, "`A" (coded as "\xC3\x80") will prevent the displaying of any string.
Nevertheless, if I just try to display "Some text plus À and some text
more" it will work.

Now, if I take the original text and do:

* RawText = RegExp.Replace(RawText, "À", "à")*

yep! it does work.
But if I translate it back with

* RawText = RegExp.Replace(RawText, "à", "À")*

It does not work anymore.

But if I do

*RawText = RawText & "À"*

And RawText did not contain "À" before, then it will be displayed.

Amazing.

Any ideas about why this happens?

Regards

- fernando
--
Fernando Cabral
Blogue: http://fernandocabral.org
Twitter: http://twitter.com/fjcabral
e-mail: ***@gmail.com
Facebook: ***@fcabral.com.br
Telegram: +55 (37) 99988-8868
Wickr ID: fernandocabral
WhatsApp: +55 (37) 99988-8868
Skype: fernandojosecabral
Telefone fixo: +55 (37) 3521-2183
Telefone celular: +55 (37) 99988-8868

Enquanto houver no mundo uma só pessoa sem casa ou sem alimentos,
nenhum político ou cientista poderá se gabar de nada.
Jussi Lahtinen
2017-07-01 19:15:45 UTC
Permalink
I don't quite get your description of the problem. It seems self
contradictory. I mean this part:




*'So, "`A" (coded as "\xC3\x80") will prevent the displaying of any
string. Nevertheless, if I just try to display "Some text plus À and some
text more" it will work.'*
So, what exactly does not work then?
Are you using ascii functions for UTF-8 at some point? I think we need to
see your code to understand the issue.



Jussi



On Sat, Jul 1, 2017 at 8:01 PM, Fernando Cabral <
Post by Fernando Cabral
I've been testing my new toy with every large and small text I can put my
hands on. The fact is, I have had surprise after surprise. I've just found
that when I read a text that contains "À" (that's an A with a grave accent
(`A) -- or a backward acute accent for those who are not familiar with
diacritics).
Well, I can't display it if I load it into a TextArea.text. Nothing shows.
I mean, NOTHING. That is, I have a blank screen. If I bring it to lowercase
(à or `a) it will show up (in lowercase) with the rest of the text.
So, "`A" (coded as "\xC3\x80") will prevent the displaying of any string.
Nevertheless, if I just try to display "Some text plus À and some text
more" it will work.
* RawText = RegExp.Replace(RawText, "À", "à")*
yep! it does work.
But if I translate it back with
* RawText = RegExp.Replace(RawText, "à", "À")*
It does not work anymore.
But if I do
*RawText = RawText & "À"*
And RawText did not contain "À" before, then it will be displayed.
Amazing.
Any ideas about why this happens?
Regards
- fernando
--
Fernando Cabral
Blogue: http://fernandocabral.org
Twitter: http://twitter.com/fjcabral
Telegram: +55 (37) 99988-8868
Wickr ID: fernandocabral
WhatsApp: +55 (37) 99988-8868
Skype: fernandojosecabral
Telefone fixo: +55 (37) 3521-2183
Telefone celular: +55 (37) 99988-8868
Enquanto houver no mundo uma só pessoa sem casa ou sem alimentos,
nenhum político ou cientista poderá se gabar de nada.
------------------------------------------------------------
------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Gambas-user mailing list
https://lists.sourceforge.net/lists/listinfo/gambas-user
Jussi Lahtinen
2017-07-01 22:59:53 UTC
Permalink
---------- Forwarded message ----------
From: Benoît Minisini <***@users.sourceforge.net>
Date: Sun, Jul 2, 2017 at 1:14 AM
Subject: Re: [Gambas-user] Problem to be examined by those who must use
UTF8 with Gambas
To: Jussi Lahtinen <***@gmail.com>, ***@fcabral.com.br


Please send that to the mailing-list.

Regards,
OK, this seems to be bug in GTK+ component!
Go to; Project --> Properties --> Components, and change gb.gui to
gb.gui.qt, then everything will work as expected. Or you need to wait for
fix.
Jussi
On Sat, Jul 1, 2017 at 11:34 PM, Fernando Cabral <
I don't quite get your description of the problem. It seems self
/
/
Perhaps It should sound contradictory. That's because if I write
directly to a TextArea.text from
inside the program, it will show "À" correctly. Nevertheless, if I
read the same string from a file read into a variable and then try
to display it... it does not work.
Post by Jussi Lahtinen
Are you using ascii functions for UTF-8 at some point?
I only use ascii functions where there are no UTF-8 functions. For
instance, split().
Post by Jussi Lahtinen
I think we need to see your code to understand the issue.
I have attached a text file (ODT) as well as the code itself. Note
that, in order to run the code you will have to have unoconv installed.
If you comment the two lines bellow, the code will not work.
* RawText = RegExp.Replace(RawText, "À", "à")
RawText = RegExp.Replace(RawText, "Ó", "ó")
*
This happens because for some mysterious reason, the QT library
seems to get confused
with *À* and *Ó*. But not always!*
*
Jussi
On Sat, Jul 1, 2017 at 8:01 PM, Fernando Cabral
I've been testing my new toy with every large and small text I can put my
hands on. The fact is, I have had surprise after surprise. I've just found
that when I read a text that contains "À" (that's an A with a grave accent
(`A) -- or a backward acute accent for those who are not familiar with
diacritics).
Well, I can't display it if I load it into a TextArea.text. Nothing shows.
I mean, NOTHING. That is, I have a blank screen. If I bring it to lowercase
(à or `a) it will show up (in lowercase) with the rest of the text.
So, "`A" (coded as "\xC3\x80") will prevent the displaying of any string.
Nevertheless, if I just try to display "Some text plus À and some text
more" it will work.
* RawText = RegExp.Replace(RawText, "À", "à")*
yep! it does work.
But if I translate it back with
* RawText = RegExp.Replace(RawText, "à", "À")*
It does not work anymore.
But if I do
*RawText = RawText & "À"*
And RawText did not contain "À" before, then it will be displayed.
Amazing.
Any ideas about why this happens?
Regards
- fernando
--
Fernando Cabral
Blogue: http://fernandocabral.org
Twitter: http://twitter.com/fjcabral
Telegram: +55 (37) 99988-8868 <tel:%2B55%20%2837%29%2099988-8868>
Wickr ID: fernandocabral
WhatsApp: +55 (37) 99988-8868 <tel:%2B55%20%2837%29%2099988-8868>
Skype: fernandojosecabral
Telefone fixo: +55 (37) 3521-2183 <tel:%2B55%20%2837%29%203521-2
183>
Telefone celular: +55 (37) 99988-8868
<tel:%2B55%20%2837%29%2099988-8868>
Enquanto houver no mundo uma só pessoa sem casa ou sem alimentos,
nenhum político ou cientista poderá se gabar de nada.
------------------------------------------------------------
------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Gambas-user mailing list
https://lists.sourceforge.net/lists/listinfo/gambas-user
<https://lists.sourceforge.net/lists/listinfo/gambas-user>
-- Fernando Cabral
Blogue: http://fernandocabral.org
Twitter: http://twitter.com/fjcabral
Telegram: +55 (37) 99988-8868 <tel:+55%2037%2099988-8868>
Wickr ID: fernandocabral
WhatsApp: +55 (37) 99988-8868 <tel:+55%2037%2099988-8868>
Skype: fernandojosecabral
Telefone fixo: +55 (37) 3521-2183 <tel:+55%2037%203521-2183>
Telefone celular: +55 (37) 99988-8868 <tel:+55%2037%2099988-8868>
Enquanto houver no mundo uma só pessoa sem casa ou sem alimentos,
nenhum político ou cientista poderá se gabar de nada.
--
Benoît Minisini
PICCORO McKAY Lenz
2017-07-02 11:37:19 UTC
Permalink
Post by Jussi Lahtinen
Please send that to the mailing-list.
in a forum its easy to the users, only click to the "reoly" button..

but due gambas-users its a mail list, its a compĺication, need to edit the
subject, the Remitent, etc.. so this king of errors/mistakes its a
common...

and its bed indexed by google so find a solution in the web without have a
subscription to the mail list its a pain..
Post by Jussi Lahtinen
Regards,
OK, this seems to be bug in GTK+ component!
Go to; Project --> Properties --> Components, and change gb.gui to
gb.gui.qt, then everything will work as expected. Or you need to wait for
fix.
Jussi
On Sat, Jul 1, 2017 at 11:34 PM, Fernando Cabral <
I don't quite get your description of the problem. It seems self
/
/
Perhaps It should sound contradictory. That's because if I write
directly to a TextArea.text from
inside the program, it will show "À" correctly. Nevertheless, if I
read the same string from a file read into a variable and then try
to display it... it does not work.
Post by Jussi Lahtinen
Are you using ascii functions for UTF-8 at some point?
I only use ascii functions where there are no UTF-8 functions. For
instance, split().
Post by Jussi Lahtinen
I think we need to see your code to understand the issue.
I have attached a text file (ODT) as well as the code itself. Note
that, in order to run the code you will have to have unoconv
installed.
If you comment the two lines bellow, the code will not work.
* RawText = RegExp.Replace(RawText, "À", "à")
RawText = RegExp.Replace(RawText, "Ó", "ó")
*
This happens because for some mysterious reason, the QT library
seems to get confused
with *À* and *Ó*. But not always!*
*
Jussi
On Sat, Jul 1, 2017 at 8:01 PM, Fernando Cabral
I've been testing my new toy with every large and small text I
can put my
hands on. The fact is, I have had surprise after surprise. I've
just found
that when I read a text that contains "À" (that's an A with a
grave accent
(`A) -- or a backward acute accent for those who are not familiar with
diacritics).
Well, I can't display it if I load it into a TextArea.text.
Nothing shows.
I mean, NOTHING. That is, I have a blank screen. If I bring it
to lowercase
(à or `a) it will show up (in lowercase) with the rest of the
text.
So, "`A" (coded as "\xC3\x80") will prevent the displaying of
any string.
Nevertheless, if I just try to display "Some text plus À and
some text
more" it will work.
* RawText = RegExp.Replace(RawText, "À", "à")*
yep! it does work.
But if I translate it back with
* RawText = RegExp.Replace(RawText, "à", "À")*
It does not work anymore.
But if I do
*RawText = RawText & "À"*
And RawText did not contain "À" before, then it will be
displayed.
Amazing.
Any ideas about why this happens?
Regards
- fernando
--
Fernando Cabral
Blogue: http://fernandocabral.org
Twitter: http://twitter.com/fjcabral
gmail.com>
Telegram: +55 (37) 99988-8868 <tel:%2B55%20%2837%29%2099988-
8868>
Wickr ID: fernandocabral
WhatsApp: +55 (37) 99988-8868 <tel:%2B55%20%2837%29%2099988-
8868>
Skype: fernandojosecabral
Telefone fixo: +55 (37) 3521-2183 <tel:%2B55%20%2837%29%203521-2
183>
Telefone celular: +55 (37) 99988-8868
<tel:%2B55%20%2837%29%2099988-8868>
Enquanto houver no mundo uma só pessoa sem casa ou sem alimentos,
nenhum político ou cientista poderá se gabar de nada.
------------------------------------------------------------
------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Gambas-user mailing list
https://lists.sourceforge.net/lists/listinfo/gambas-user
<https://lists.sourceforge.net/lists/listinfo/gambas-user>
-- Fernando Cabral
Blogue: http://fernandocabral.org
Twitter: http://twitter.com/fjcabral
Telegram: +55 (37) 99988-8868 <tel:+55%2037%2099988-8868>
Wickr ID: fernandocabral
WhatsApp: +55 (37) 99988-8868 <tel:+55%2037%2099988-8868>
Skype: fernandojosecabral
Telefone fixo: +55 (37) 3521-2183 <tel:+55%2037%203521-2183>
Telefone celular: +55 (37) 99988-8868 <tel:+55%2037%2099988-8868>
Enquanto houver no mundo uma só pessoa sem casa ou sem alimentos,
nenhum político ou cientista poderá se gabar de nada.
--
Benoît Minisini
------------------------------------------------------------
------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Gambas-user mailing list
https://lists.sourceforge.net/lists/listinfo/gambas-user
Fernando Cabral
2017-07-01 23:12:24 UTC
Permalink
---------- Forwarded message ----------
From: Fernando Cabral <***@gmail.com>
Date: 2017-07-01 17:34 GMT-03:00
Subject: Re: [Gambas-user] Problem to be examined by those who must use
UTF8 with Gambas
Post by Jussi Lahtinen
I don't quite get your description of the problem. It seems self
Perhaps It should sound contradictory. That's because if I write directly
to a TextArea.text from
inside the program, it will show "À" correctly. Nevertheless, if I read the
same string from a file read into a variable and then try to display it...
it does not work.
Post by Jussi Lahtinen
Are you using ascii functions for UTF-8 at some point?
I only use ascii functions where there are no UTF-8 functions. For
instance, split().
Post by Jussi Lahtinen
I think we need to see your code to understand the issue.
I have attached a text file (ODT) as well as the code itself. Note that, in
order to run the code you will have to have unoconv installed.

If you comment the two lines bellow, the code will not work.



* RawText = RegExp.Replace(RawText, "À", "à") RawText =
RegExp.Replace(RawText, "Ó", "ó")*

This happens because for some mysterious reason, the QT library seems to
get confused
with *À* and *Ó*. But not always!

Jussi



On Sat, Jul 1, 2017 at 8:01 PM, Fernando Cabral <
Post by Jussi Lahtinen
I've been testing my new toy with every large and small text I can put my
hands on. The fact is, I have had surprise after surprise. I've just found
that when I read a text that contains "À" (that's an A with a grave accent
(`A) -- or a backward acute accent for those who are not familiar with
diacritics).
Well, I can't display it if I load it into a TextArea.text. Nothing shows.
I mean, NOTHING. That is, I have a blank screen. If I bring it to lowercase
(à or `a) it will show up (in lowercase) with the rest of the text.
So, "`A" (coded as "\xC3\x80") will prevent the displaying of any string.
Nevertheless, if I just try to display "Some text plus À and some text
more" it will work.
* RawText = RegExp.Replace(RawText, "À", "à")*
yep! it does work.
But if I translate it back with
* RawText = RegExp.Replace(RawText, "à", "À")*
It does not work anymore.
But if I do
*RawText = RawText & "À"*
And RawText did not contain "À" before, then it will be displayed.
Amazing.
Any ideas about why this happens?
Regards
- fernando
--
Fernando Cabral
Blogue: http://fernandocabral.org
Twitter: http://twitter.com/fjcabral
Telegram: +55 (37) 99988-8868
Wickr ID: fernandocabral
WhatsApp: +55 (37) 99988-8868
Skype: fernandojosecabral
Telefone fixo: +55 (37) 3521-2183
Telefone celular: +55 (37) 99988-8868
Enquanto houver no mundo uma só pessoa sem casa ou sem alimentos,
nenhum político ou cientista poderá se gabar de nada.
------------------------------------------------------------
------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Gambas-user mailing list
https://lists.sourceforge.net/lists/listinfo/gambas-user
--
Fernando Cabral
Blogue: http://fernandocabral.org
Twitter: http://twitter.com/fjcabral
e-mail: ***@gmail.com
Facebook: ***@fcabral.com.br
Telegram: +55 (37) 99988-8868 <(37)%2099988-8868>
Wickr ID: fernandocabral
WhatsApp: +55 (37) 99988-8868 <(37)%2099988-8868>
Skype: fernandojosecabral
Telefone fixo: +55 (37) 3521-2183 <(37)%203521-2183>
Telefone celular: +55 (37) 99988-8868 <(37)%2099988-8868>

Enquanto houver no mundo uma só pessoa sem casa ou sem alimentos,
nenhum político ou cientista poderá se gabar de nada.
--
Fernando Cabral
Blogue: http://fernandocabral.org
Twitter: http://twitter.com/fjcabral
e-mail: ***@gmail.com
Facebook: ***@fcabral.com.br
Telegram: +55 (37) 99988-8868
Wickr ID: fernandocabral
WhatsApp: +55 (37) 99988-8868
Skype: fernandojosecabral
Telefone fixo: +55 (37) 3521-2183
Telefone celular: +55 (37) 99988-8868

Enquanto houver no mundo uma só pessoa sem casa ou sem alimentos,
nenhum político ou cientista poderá se gabar de nada.
Continue reading on narkive:
Loading...