Number of results to display per page
Search Results
28122. Optimal reference translation of English-Czech WMT2020
- Creator:
- Kloudová, Věra, Mraček, David, Bojar, Ondřej, and Popel, Martin
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- translational equivalence, reference translation, optimal reference translation, and WMT
- Language:
- Czech and English
- Description:
- We define "optimal reference translation" as a translation thought to be the best possible that can be achieved by a team of human translators. Optimal reference translations can be used in assessments of excellent machine translations. We selected 50 documents (online news articles, with 579 paragraphs in total) from the 130 English documents included in the WMT2020 news test (http://www.statmt.org/wmt20/) with the aim to preserve diversity (style, genre etc.) of the selection. In addition to the official Czech reference translation provided by the WMT organizers (P1), we hired two additional translators (P2 and P3, native Czech speakers) via a professional translation agency, resulting in three independent translations. The main contribution of this dataset are two additional translations (i.e. optimal reference translations N1 and N2), done jointly by two translators-cum-theoreticians with an extreme care for various aspects of translation quality, while taking into account the translations P1-P3. We publish also internal comments (in Czech) for some of the segments. Translation N1 should be closer to the English original (with regards to the meaning and linguistic structure) and female surnames use the Czech feminine suffix (e.g. "Mai" is translated as "Maiová"). Translation N2 is more free, trying to be more creative, idiomatic and entertaining for the readers and following the typical style used in Czech media, while still preserving the rules of functional equivalence. Translation N2 is missing for the segments where it was not deemed necessary to provide two alternative translations. For applications/analyses needing translation of all segments, this should be interpreted as if N2 is the same as N1 for a given segment. We provide the dataset in two formats: OpenDocument spreadsheet (odt) and plain text (one file for each translation and the English original). Some words were highlighted using different colors during the creation of optimal reference translations; this highlighting and comments are present only in the odt format (some comments refer to row numbers in the odt file). Documents are separated by empty lines and each document starts with a special line containing the document name (e.g. "# upi.205735"), which allows alignment with the original WMT2020 news test. For the segments where N2 translations are missing in the odt format, the respective N1 segments are used instead in the plain-text format.
- Rights:
- Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0), http://creativecommons.org/licenses/by-nc-sa/4.0/, and PUB
28123. Optimal Reference Translations from English to Czech
- Creator:
- Zouhar, Vilém, Kloudová, Věra, Popel, Martin, and Bojar, Ondřej
- Publisher:
- Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
- Type:
- text and corpus
- Subject:
- translation, evaluation, and optimal reference translation
- Language:
- English and Czech
- Description:
- This corpus contains annotations of translation quality from English to Czech in seven categories on both segment- and document-level. There are 20 documents in total, each with 4 translations (evaluated by each annotator in paralel) of 8 segments (can be longer than one sentence). Apart from the evaluation, the annotators also proposed their own, improved versions of the translations. There were 11 annotators in total, on expertise levels ranging from non-experts to professional translators.
- Rights:
- Creative Commons - Attribution 4.0 International (CC BY 4.0), http://creativecommons.org/licenses/by/4.0/, and PUB
28124. Optimal sequential multiple hypothesis testing in presence of control variables
- Creator:
- Novikov, Andrey
- Format:
- bez média and svazek
- Type:
- model:article and TEXT
- Subject:
- sequential analysis, sequential hypothesis testing, multiple hypotheses, control variable, independent observations, optimal stopping, optimal control, optimal decision, optimal sequential testing procedure, Bayes, and sequential probability ratio test
- Language:
- English
- Description:
- Suppose that at any stage of a statistical experiment a control variable X that affects the distribution of the observed data Y at this stage can be used. The distribution of Y depends on some unknown parameter θ, and we consider the problem of testing multiple hypotheses H1:θ=θ1, H2:θ=θ2,…, Hk:θ=θk allowing the data to be controlled by X, in the following sequential context. The experiment starts with assigning a value X1 to the control variable and observing Y1 as a response. After some analysis, another value X2 for the control variable is chosen, and Y2 as a response is observed, etc. It is supposed that the experiment eventually stops, and at that moment a final decision in favor of one of the hypotheses H1,…, Hk is to be taken. In this article, our aim is to characterize the structure of optimal sequential testing procedures based on data obtained from an experiment of this type in the case when the observations Y1,Y2,…,Yn are independent, given controls X1,X2,…,Xn, n=1,2,….
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
28125. Optimal sequential multiple hypothesis tests
- Creator:
- Novikov, Andrey
- Format:
- bez média and svazek
- Type:
- model:article and TEXT
- Subject:
- sequential analysis, hypothesis testing, multiple hypotheses, discrete-time stochastic process, dependent observations, optimal sequential test, and Bayes sequential test
- Language:
- English
- Description:
- This work deals with a general problem of testing multiple hypotheses about the distribution of a discrete-time stochastic process. Both the Bayesian and the conditional settings are considered. The structure of optimal sequential tests is characterized.
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
28126. Optimal sequential procedures with Bayes decision rules
- Creator:
- Novikov, Andrey
- Format:
- bez média and svazek
- Type:
- model:article and TEXT
- Subject:
- sequential analysis, discrete-time stochastic process, dependent observations, statistical decision problem, Bayes decision, randomized stopping time, optimal stopping rule, and existence and uniqueness of optimal sequential decision procedure
- Language:
- English
- Description:
- In this article, a general problem of sequential statistical inference for general discrete-time stochastic processes is considered. The problem is to minimize an average sample number given that Bayesian risk due to incorrect decision does not exceed some given bound. We characterize the form of optimal sequential stopping rules in this problem. In particular, we have a characterization of the form of optimal sequential decision procedures when the Bayesian risk includes both the loss due to incorrect decision and the cost of observations.
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
28127. Optimal sublinear inequalities involving geometric and power means
- Creator:
- Wen, Jiajin, Cheng, Sui Sun, and Gao, Chaobang
- Format:
- bez média and svazek
- Type:
- model:article and TEXT
- Subject:
- geometric mean, power mean, Hermitian matrix, permanent of a complex, simplex, and arithmetic-geometric inequality
- Language:
- English
- Description:
- There are many relations involving the geometric means Gn(x) and power means [An(x γ )]1/γ for positive n-vectors x. Some of them assume the form of inequalities involving parameters. There then is the question of sharpness, which is quite difficult in general. In this paper we are concerned with inequalities of the form (1 − λ)G γ n(x) + λAγ n(x) ≥ An(x γ ) and (1 − λ)G γ n(x) + λAγ n(x) ≤ An(x γ ) with parameters λ ∈ R and γ ∈ (0, 1). We obtain a necessary and sufficient condition for the former inequality, and a sharp condition for the latter. Several applications of our results are also demonstrated.
- Rights:
- http://creativecommons.org/publicdomain/mark/1.0/ and policy:public
28128. Optimal taxation with risky human capital
- Creator:
- Kapička, Marek and Neira, Julian
- Publisher:
- CERGE-EI
- Format:
- electronic, bez média, svazek, and 36 stran : ilustrace.
- Type:
- model:monograph and TEXT
- Subject:
- Veřejné finance, daňová politika, daň z příjmů fyzických osob, lidský kapitál, tax policy, personal income tax, human capital, 336.22.02, 351.72, 331.108.23, (048.8), 4, and 336.1/.5
- Language:
- English and Czech
- Description:
- Marek Kapička, Julian Neira., Obsahuje bibliografii a bibliografické odkazy, and České a anglické resumé
- Rights:
- http://creativecommons.org/licenses/by-nc-sa/4.0/ and policy:public
28129. Optimal tuning of the two-degree-of-freedom system
- Creator:
- Bach, Pavel
- Format:
- bez média and svazek
- Type:
- model:article and TEXT
- Subject:
- machine tool, chatter, stability, performance, and optimization
- Language:
- English
- Description:
- Performance of any machine tool is, under certain technological conditions, limited by chatter, which occurs during machining. The limit between machining without and with chatter is called limit of stability. It is expressed by so called stable depth of cut, which is defíned under certain conditions. The article deals with the investigation of optimal modal parameters for machine tool models. The criterion for this optimum is the highest limit of stability.
- Rights:
- http://creativecommons.org/licenses/by-nc-sa/4.0/ and policy:public
28130. Optimal wire myography normalization for the rat dorsal penile, internal pudendal and internal iliac arteries
- Creator:
- Azeez, Tooyib A., Andrade, Manuella R., and La Favor, Justin D.
- Format:
- počítač and online zdroj
- Type:
- model:article and TEXT
- Subject:
- normalization procedure, optimal initial tension, myograph, pre-penile arteries, and erectile dysfunction
- Language:
- English
- Description:
- In functional arterial studies using wire myography, the determination of a vessel’s standardized normalization factor (factor k) is an essential step to ensure optimal contraction and relaxation by the arteries when stimulated with their respective vasoactive agents and to obtain reproducible results. The optimal factor k for several arteries have been determined, however, the optimal initial tension and factor k for the arteries involved in erection remains unknown. Hence, in the present study we set out to determine the optimal factor k for the internal iliac artery, proximal and distal internal pudendal artery (IPA), and dorsal penile artery. After isolating, harvesting, and mounting the arteries from male Sprague-Dawley rats on a multi wire myograph, we tested arterial responsivity to high K+-stimulation when the factor k was set at 0.7, 0.8, 0.85, 0.9, 0.95, 1.0, 1.1, and 1.2 to determine the factor k setting that results in the greatest K+-induced active force production for each vessel type. The data showed the optimal factor k is 0.90-0.95 for the dorsal penile, distal internal pudendal and internal iliac arteries while it is 0.85-0.90 for proximal internal pudendal artery. These optimal values corresponded to initial passive tension settings of 1.10±0.16 - 1.46±0.23, 1.28±0.20 - 1.69±0.34, 1.03±0.27 - 1.33±0.31, and 1.33±0.31 - 1.77±0.43 mN/mm for the dorsal penile, distal IP, proximal IP, and internal iliac arteries, respectively.
- Rights:
- http://creativecommons.org/licenses/by-nc-sa/4.0/ and policy:public