Re: [OpenStack-I18n] stackalytics translation stats API is broken????
Hmm... It seems a problem is limited to "stable-mitaka vs stable/mitaka".
It seems 'master' version of translation stats also has a problem.
okay, good information. thanks.
I translated more than 2000 words in 'master' version, but stackalytics says I translated only 251 words.
To understand the situation more exactly, which do you mean 2000 "Words" or "Messages" at Zanata?
Is it better that I stop translation?
If exact stats is important for you, you should stop translation. Otherwise, translation process itself works fine. Also, I'm not sure, but we will get proper stats after stackalytics works fine. Regards, KATO Tomoyuki
2016-05-15 16:10 GMT+09:00 Kato, Tomoyuki <kato.tomoyuki@jp.fujitsu.com>:
Hmm... It seems a problem is limited to "stable-mitaka vs stable/mitaka".
It seems 'master' version of translation stats also has a problem.
okay, good information. thanks.
I translated more than 2000 words in 'master' version, but stackalytics says I translated only 251 words.
To understand the situation more exactly, which do you mean 2000 "Words" or "Messages" at Zanata?
What is not clear? As I wrote above, I translated more than 2000 'words'.
Is it better that I stop translation?
If exact stats is important for you, you should stop translation. Otherwise, translation process itself works fine. Also, I'm not sure, but we will get proper stats after stackalytics works fine.
It is an irony :-) Once translation statistics is available publicly, the 'number' should be correct. Do you plan to determine ATC status based on Stackalytics stats? If so, I think it is a problem. My e-mail just wants to warn you all. Akihiro
Regards, KATO Tomoyuki
_______________________________________________ OpenStack-I18n mailing list OpenStack-I18n@lists.openstack.org http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-i18n
Hmm... It seems a problem is limited to "stable-mitaka vs stable/mitaka".
It seems 'master' version of translation stats also has a problem.
okay, good information. thanks.
I translated more than 2000 words in 'master' version, but stackalytics says I translated only 251 words.
To understand the situation more exactly, which do you mean 2000 "Words" or "Messages" at Zanata?
What is not clear? As I wrote above, I translated more than 2000 'words'.
IIRC, Zanata returns "Messages" (i.e. units, not "Words") counts as stats from API 'stats/user/%s/%s..%s' % (zanata_user_id,start_date, end_date))'. Carlos, Alex, is it correct?
Is it better that I stop translation?
If exact stats is important for you, you should stop translation. Otherwise, translation process itself works fine. Also, I'm not sure, but we will get proper stats after stackalytics works fine.
It is an irony :-)
Once translation statistics is available publicly, the 'number' should be correct. Do you plan to determine ATC status based on Stackalytics stats? If so, I think it is a problem. My e-mail just wants to warn you all.
Yes. Good finding. We need to investigate and clarify it. KATO Tomoyuki
2016年5月15日日曜日、Kato, Tomoyuki<kato.tomoyuki@jp.fujitsu.com>さんは書きました:
Hmm... It seems a problem is limited to "stable-mitaka vs stable/mitaka".
It seems 'master' version of translation stats also has a problem.
okay, good information. thanks.
I translated more than 2000 words in 'master' version, but stackalytics says I translated only 251 words.
To understand the situation more exactly, which do you mean 2000 "Words" or "Messages" at Zanata?
What is not clear? As I wrote above, I translated more than 2000 'words'.
IIRC, Zanata returns "Messages" (i.e. units, not "Words") counts as stats from API 'stats/user/%s/%s..%s' % (zanata_user_id,start_date, end_date))'.
To clarify the situation more, my statistics from Zanata is based on Zanata personal dashboard. That is all what I can know as an usual user. On the other hand, Stackalitcs uses "terms". In my understandng, "terms" means "words". Thanks, Akihiro
Carlos, Alex, is it correct?
Is it better that I stop translation?
If exact stats is important for you, you should stop translation. Otherwise, translation process itself works fine. Also, I'm not sure, but we will get proper stats after stackalytics works fine.
It is an irony :-)
Once translation statistics is available publicly, the 'number' should be correct. Do you plan to determine ATC status based on Stackalytics stats? If so, I think it is a problem. My e-mail just wants to warn you all.
Yes. Good finding. We need to investigate and clarify it.
KATO Tomoyuki
_______________________________________________ OpenStack-I18n mailing list OpenStack-I18n@lists.openstack.org <javascript:;> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-i18n
Hello, On 5/15/16 9:26 PM, Akihiro Motoki wrote:
2016年5月15日日曜日、Kato, Tomoyuki<kato.tomoyuki@jp.fujitsu.com <mailto:kato.tomoyuki@jp.fujitsu.com>>さんは書きました:
> >> Hmm... It seems a problem is limited to "stable-mitaka vs stable/mitaka". > >> > >> It seems 'master' version of translation stats also has a problem. > > > > okay, good information. thanks. > > > >> I translated more than 2000 words in 'master' version, but > >> stackalytics says I translated only 251 words. > > > > To understand the situation more exactly, > > which do you mean 2000 "Words" or "Messages" at Zanata? > > What is not clear? > As I wrote above, I translated more than 2000 'words'.
IIRC, Zanata returns "Messages" (i.e. units, not "Words") counts as stats from API 'stats/user/%s/%s..%s' % (zanata_user_id,start_date, end_date))'.
To clarify the situation more, my statistics from Zanata is based on Zanata personal dashboard. That is all what I can know as an usual user.
On the other hand, Stackalitcs uses "terms". In my understandng, "terms" means "words".
Stackalytics 'terms' means 'words' also in my understanding. To find why Stackalytics generates wrong statistics, one of easiest but so manual ways is to compare Zanata stat counts to Stackalytics results.. I manually compared Stackalytics translation counts and JSON results on that Zanata API only for master branch in my account, but the two numbers are quite diffrerent. I made sums with possible different metrics on the Zanata API results, but I cannot make the same number in Stackalytics... T.T Any recommendations or approaches on finding the problem(s)? Zanata API results have more counts, so IMO Zanata numbers are more correct than Stackalytics calculates currently.
Thanks, Akihiro
Carlos, Alex, is it correct?
> >> Is it better that I stop translation? > > > > If exact stats is important for you, you should stop translation. > > Otherwise, translation process itself works fine. > > Also, I'm not sure, but we will get proper stats > > after stackalytics works fine. > > It is an irony :-) > > Once translation statistics is available publicly, the 'number' should > be correct. > Do you plan to determine ATC status based on Stackalytics stats? > If so, I think it is a problem.
I have a question, although it is not mainly related to wrong Stackalytics stats. I have found that API 'stats/user/%s/%s..%s' and API 'stats/project/%s/version/%s/contributor/%s/%s..%s' generate different results. For example, from Apr 1 2016 to today, my count is 34320 with the first API, but the send API generates 29490 for the same period. I do know why the two APIs generate different numbers. Can statistics numbers on user API and the sum of results from each project API be different? With many thanks, /Ian
> My e-mail just wants to warn you all.
Yes. Good finding. We need to investigate and clarify it.
KATO Tomoyuki
_______________________________________________ OpenStack-I18n mailing list OpenStack-I18n@lists.openstack.org <javascript:;> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-i18n
_______________________________________________ OpenStack-I18n mailing list OpenStack-I18n@lists.openstack.org http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-i18n
On Sun, May 15, 2016 at 10:42 PM, Ian Y. Choi <ianyrchoi@gmail.com> wrote:
I have a question, although it is not mainly related to wrong Stackalytics stats.
I have found that API 'stats/user/%s/%s..%s' and API 'stats/project/%s/version/%s/contributor/%s/%s..%s' generate different results.
For example, from Apr 1 2016 to today, my count is 34320 with the first API, but the send API generates 29490 for the same period.
I do know why the two APIs generate different numbers. Can statistics numbers on user API and the sum of results from each project API be different?
Hi all, Previously Stackalytics uses API 'stats/project/%s/version/%s/contributor/%s/%s..%s'. But I change it to 'stats/user/%s/%s..%s' in https://review.openstack.org/#/c/288871/ I may find the BUG but I have no time to fix it these days. So I hope someone to help. Stackalytics will get the results every week. And 'stats/user/%s/%s..%s' will return everyday's count. However Stackalytics seems not to sum a week's work. To fix it, I think we can change the way how Stackalytics uses Zanata's API. Based on 'stats/user/%s/%s..%s' we can get 'savedDate'. So just use the 'savedDate' to display on Stackalytics. I haven't read the code carefully but I think the change is possible. What's more we can debug the wrong results on Stackalytics more conveniently. But there's another question. How can we fix the old results on Stackalytics? -- Best regards, Zhu Shengjing
Hello, I might have found the problem. The problem is that there can be several 'Translated' results within a week, however, those results are saved with the same primary key. I have just proposed https://review.openstack.org/316460 . I am not sure whether 'date' field is used on other Stackalytics sources, but at least, generating the primary key for translation records on record_processor.py does not use 'savedDate' field, as Shengjing mentioned. If 'date' field is used on other Stackalytics sources, please tell me, then I will change the review :) With many thanks, /Ian SJ Zhu wrote on 5/16/2016 12:14 AM:
On Sun, May 15, 2016 at 10:42 PM, Ian Y. Choi <ianyrchoi@gmail.com> wrote:
I have a question, although it is not mainly related to wrong Stackalytics stats.
I have found that API 'stats/user/%s/%s..%s' and API 'stats/project/%s/version/%s/contributor/%s/%s..%s' generate different results.
For example, from Apr 1 2016 to today, my count is 34320 with the first API, but the send API generates 29490 for the same period.
I do know why the two APIs generate different numbers. Can statistics numbers on user API and the sum of results from each project API be different? Hi all,
Previously Stackalytics uses API 'stats/project/%s/version/%s/contributor/%s/%s..%s'. But I change it to 'stats/user/%s/%s..%s' in https://review.openstack.org/#/c/288871/
I may find the BUG but I have no time to fix it these days. So I hope someone to help.
Stackalytics will get the results every week. And 'stats/user/%s/%s..%s' will return everyday's count. However Stackalytics seems not to sum a week's work.
To fix it, I think we can change the way how Stackalytics uses Zanata's API. Based on 'stats/user/%s/%s..%s' we can get 'savedDate'. So just use the 'savedDate' to display on Stackalytics. I haven't read the code carefully but I think the change is possible. What's more we can debug the wrong results on Stackalytics more conveniently.
But there's another question. How can we fix the old results on Stackalytics?
Hi, All fixes in stackalytics zanata processor has been merged. Does anyone know when Stackalytics is updated? BR, Akihiro 2016-05-16 1:56 GMT+09:00 Ian Y. Choi <ianyrchoi@gmail.com>:
Hello,
I might have found the problem.
The problem is that there can be several 'Translated' results within a week, however, those results are saved with the same primary key.
I have just proposed https://review.openstack.org/316460 .
I am not sure whether 'date' field is used on other Stackalytics sources, but at least, generating the primary key for translation records on record_processor.py does not use 'savedDate' field, as Shengjing mentioned. If 'date' field is used on other Stackalytics sources, please tell me, then I will change the review :)
With many thanks,
/Ian
SJ Zhu wrote on 5/16/2016 12:14 AM:
On Sun, May 15, 2016 at 10:42 PM, Ian Y. Choi <ianyrchoi@gmail.com> wrote:
I have a question, although it is not mainly related to wrong Stackalytics stats.
I have found that API 'stats/user/%s/%s..%s' and API 'stats/project/%s/version/%s/contributor/%s/%s..%s' generate different results.
For example, from Apr 1 2016 to today, my count is 34320 with the first API, but the send API generates 29490 for the same period.
I do know why the two APIs generate different numbers. Can statistics numbers on user API and the sum of results from each project API be different?
Hi all,
Previously Stackalytics uses API 'stats/project/%s/version/%s/contributor/%s/%s..%s'. But I change it to 'stats/user/%s/%s..%s' in https://review.openstack.org/#/c/288871/
I may find the BUG but I have no time to fix it these days. So I hope someone to help.
Stackalytics will get the results every week. And 'stats/user/%s/%s..%s' will return everyday's count. However Stackalytics seems not to sum a week's work.
To fix it, I think we can change the way how Stackalytics uses Zanata's API. Based on 'stats/user/%s/%s..%s' we can get 'savedDate'. So just use the 'savedDate' to display on Stackalytics. I haven't read the code carefully but I think the change is possible. What's more we can debug the wrong results on Stackalytics more conveniently.
But there's another question. How can we fix the old results on Stackalytics?
_______________________________________________ OpenStack-I18n mailing list OpenStack-I18n@lists.openstack.org http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-i18n
On 2016-05-20 18:45:05 +0900 (+0900), Akihiro Motoki wrote: [...]
Does anyone know when Stackalytics is updated?
You might want to reach out to members of the stackalytics-core team: https://review.openstack.org/#/admin/groups/183,members -- Jeremy Stanley
/Status update/ I contacted the stackalytics team. They plan to upgrade Stackalytics this week. 2016-05-20 23:24 GMT+09:00 Jeremy Stanley <fungi@yuggoth.org>:
On 2016-05-20 18:45:05 +0900 (+0900), Akihiro Motoki wrote: [...]
Does anyone know when Stackalytics is updated?
You might want to reach out to members of the stackalytics-core team: https://review.openstack.org/#/admin/groups/183,members -- Jeremy Stanley
_______________________________________________ OpenStack-I18n mailing list OpenStack-I18n@lists.openstack.org http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-i18n
If you wish to see your statistics in Zanata, https://translate.openstack.org/profile/view/{your username} --------------------------------------------- Alex Eng Senior Software Engineer Globalisation Tools Engineering DID: +61 3514 8262 <callto:+61+3514+8262> Mobile: +614 2335 3457 <callto:+614+2335+3457> Red Hat, Asia-Pacific Pty Ltd Level 1, 193 North Quay Brisbane 4000 Office: +61 7 3514 8100 <callto:+61+7+3514+8100> Fax: +61 7 3514 8199 <callto:+61+7+3514+8199> Website: www.redhat.com On Sun, May 15, 2016 at 5:10 PM, Kato, Tomoyuki < kato.tomoyuki@jp.fujitsu.com> wrote:
Hmm... It seems a problem is limited to "stable-mitaka vs stable/mitaka".
It seems 'master' version of translation stats also has a problem.
okay, good information. thanks.
I translated more than 2000 words in 'master' version, but stackalytics says I translated only 251 words.
To understand the situation more exactly, which do you mean 2000 "Words" or "Messages" at Zanata?
Is it better that I stop translation?
If exact stats is important for you, you should stop translation. Otherwise, translation process itself works fine. Also, I'm not sure, but we will get proper stats after stackalytics works fine.
Regards, KATO Tomoyuki
_______________________________________________ OpenStack-I18n mailing list OpenStack-I18n@lists.openstack.org http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-i18n
Hi all, Just wanted to explain a bit more on the Zanata side. http://translate.openstack.org/profile/view/{username} will give you a stats for a period of time. It uses the REST api /stats/user/{username}/{fromdate}...{todate} The result is word based. So 100 means 100 words. There are other stats API in zanata and the semantics and algorithm may be different. e.g. stats/project/%s/version/%s/contributor/%s/%s..%s It uses a different algorithm for different purpose. For example: - User A translated 100 words. Next day user A revise his translation. Do we count user A's stats as 100 or 200? - User A translated 100 words. Next day user B modify his translation. But after discussion they agreed to revert to user A's original translation. Do we count user A's stats as 100 or 200. What about user B's? So different stats API may return different result as they are designed differently. Hope this helps. On Mon, May 16, 2016 at 7:51 AM, Alex Eng <aeng@redhat.com> wrote:
If you wish to see your statistics in Zanata,
https://translate.openstack.org/profile/view/{your username}
---------------------------------------------
Alex Eng Senior Software Engineer Globalisation Tools Engineering DID: +61 3514 8262 <callto:+61+3514+8262> Mobile: +614 2335 3457 <callto:+614+2335+3457>
Red Hat, Asia-Pacific Pty Ltd Level 1, 193 North Quay Brisbane 4000 Office: +61 7 3514 8100 <callto:+61+7+3514+8100> Fax: +61 7 3514 8199 <callto:+61+7+3514+8199> Website: www.redhat.com
On Sun, May 15, 2016 at 5:10 PM, Kato, Tomoyuki < kato.tomoyuki@jp.fujitsu.com> wrote:
Hmm... It seems a problem is limited to "stable-mitaka vs stable/mitaka".
It seems 'master' version of translation stats also has a problem.
okay, good information. thanks.
I translated more than 2000 words in 'master' version, but stackalytics says I translated only 251 words.
To understand the situation more exactly, which do you mean 2000 "Words" or "Messages" at Zanata?
Is it better that I stop translation?
If exact stats is important for you, you should stop translation. Otherwise, translation process itself works fine. Also, I'm not sure, but we will get proper stats after stackalytics works fine.
Regards, KATO Tomoyuki
_______________________________________________ OpenStack-I18n mailing list OpenStack-I18n@lists.openstack.org http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-i18n
_______________________________________________ OpenStack-I18n mailing list OpenStack-I18n@lists.openstack.org http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-i18n
-- Patrick Huang Senior Software Engineer Engineering - Internationalisation Red Hat, Asia-Pacific Pty Ltd Level 1, 193 North Quay Brisbane 4000 Office: +61 7 3514 8278 Fax: +61 7 3514 8199 IRC: pahuang github: github.com/huangp Website: www.redhat.com
If you wish to see your statistics in Zanata,
https://translate.openstack.org/profile/view/{your username}
From my stats, Zanata says May 1 nova 94 words, May 1 designate 61 words May 4 openstack-manuals 182 words May 5 openstack-manuals 45 words On the other hands, Stackalytics says May 2 nova 94 words, Apr 25 designate 61 words Apr 25 openstack-manuals 45 words So, Stackalytics shows the last day stats in a week, I guess. We need deeper investigation... KATO Tomoyuki
Thanks, Remo
On May 15, 2016, at 18:17, Kato, Tomoyuki <kato.tomoyuki@jp.fujitsu.com> wrote:
If you wish to see your statistics in Zanata,
https://translate.openstack.org/profile/view/{your username}
From my stats, Zanata says
May 1 nova 94 words, May 1 designate 61 words May 4 openstack-manuals 182 words May 5 openstack-manuals 45 words
On the other hands, Stackalytics says May 2 nova 94 words, Apr 25 designate 61 words Apr 25 openstack-manuals 45 words
So, Stackalytics shows the last day stats in a week, I guess. We need deeper investigation...
KATO Tomoyuki _______________________________________________ OpenStack-I18n mailing list OpenStack-I18n@lists.openstack.org http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-i18n
!DSPAM:57391fdb6031194441273!
Kato, So, Stackalytics shows the last day stats in a week, I guess. Seems like that. The statistics you got from Zanata can confirmed that as it's daily, but Stackalytics aggregate it into weekly. On Mon, May 16, 2016 at 11:17 AM, Kato, Tomoyuki < kato.tomoyuki@jp.fujitsu.com> wrote:
If you wish to see your statistics in Zanata,
https://translate.openstack.org/profile/view/{your username}
From my stats, Zanata says
May 1 nova 94 words, May 1 designate 61 words May 4 openstack-manuals 182 words May 5 openstack-manuals 45 words
On the other hands, Stackalytics says May 2 nova 94 words, Apr 25 designate 61 words Apr 25 openstack-manuals 45 words
So, Stackalytics shows the last day stats in a week, I guess. We need deeper investigation...
KATO Tomoyuki _______________________________________________ OpenStack-I18n mailing list OpenStack-I18n@lists.openstack.org http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-i18n
-- Alex Eng
participants (9)
-
Akihiro Motoki
-
Alex Eng
-
Alex Eng
-
Ian Y. Choi
-
Jeremy Stanley
-
Kato, Tomoyuki
-
Patrick Huang
-
Remo Mattei
-
SJ Zhu