New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

finish basic_text_classification translation #98

Merged

leviding merged 6 commits into xitu:zh-hans from TobiasLee:TobiasLee-basic_tc

Oct 24, 2018

Contributor

TobiasLee commented Oct 8, 2018

resolve: #96
.md 文件中 Colab Notebook 我不知道怎么更改... 麻烦 @leviding 检查一下~
另外 .ipynb 我是在 Jupyter Notebook 中直接修改的，也不知道有没有问题，有问题随时告知我，辛苦啦！

Tobias Lee added 2 commits

October 8, 2018 21:30


          finish basic_text_classification translation

a04a52a


          Update basic_text_classification.md

9bd0418

Member

leviding commented Oct 9, 2018

@TobiasLee 这样可以

Member

leviding commented Oct 9, 2018 •

edited

Loading

校对可以选择 split 来 diff 内容进行校对，或者也使用 Jupyter Notebook 进行校对。

leviding added the 校对认领 label

leviding mentioned this pull request

install/gpu.md, basic_text_classification.ipynb and basic_text_classification.md #96

Closed

5 tasks

rocheers commented Oct 19, 2018

@leviding 认领校对

Member

leviding commented Oct 20, 2018

leviding added the 正在校对 label

rocheers approved these changes

View reviewed changes

rocheers left a comment

翻译超赞！有几处格式问题，还有英文原文也保留着。需要查看一下

tutorials/keras/basic_text_classification.ipynb Outdated

+                  "\n",
+                  "在这个任务中，我们将把电影评论分为**积极**和**消极**两种，即是一个**二分类**任务，这是一个非常重要并且已经被广泛应用的机器学习问题。\n",
+                  "\n",
+                  "我们将使用 [IMDB 数据集](https://www.tensorflow.org/api_docs/python/tf/keras/datasets/imdb)，其中包括了 50000 条来自 [Internet Movie Database](https://www.imdb.com/) 的电影评论。这些评论被等分成两份分别用于训练和测试，并且，训练集和测试集的样本是**平衡**的，也就是说，积极和消极的评论数目相同。\n",

rocheers Oct 19, 2018

「这些评论被等分成两份分别用于训练和测试」=>「这些评论被等分成两份，分别用于训练和测试」

tutorials/keras/basic_text_classification.ipynb Outdated

+                  "\n",
+                  "我们将使用 [IMDB 数据集](https://www.tensorflow.org/api_docs/python/tf/keras/datasets/imdb)，其中包括了 50000 条来自 [Internet Movie Database](https://www.imdb.com/) 的电影评论。这些评论被等分成两份分别用于训练和测试，并且，训练集和测试集的样本是**平衡**的，也就是说，积极和消极的评论数目相同。\n",
+                  "\n",
+                  "接下来的代码中，我们会使用一个用于创建和训练 TensorFlow 模型的高级 API —— [tf.keras](https://www.tensorflow.org/guide/keras)。如果你希望查看进阶版的文本分类教程，请查看 [MLCC Text Classification Guide](https://developers.google.com/machine-learning/guides/text-classification/)。"

rocheers Oct 19, 2018

「如果你希望查看进阶版的文本分类教程」=>「如果你希望查看 tf.keras 进阶版的文本分类教程」

tutorials/keras/basic_text_classification.ipynb Outdated

+                  "## 下载 IMDB 数据集\n",
+                  "\n",
+                  "\n",
+                  "IMDB 数据集随 TensorFlow 附带，并且已经被预处理过：单词序列已经被转换成证书序列，并且每个整数对应字典中特定的一个单词。\n",

rocheers Oct 19, 2018

「单词序列已经被转换成证书序列」=>「单词序列已经被转换成整数序列」

tutorials/keras/basic_text_classification.ipynb Outdated

+                 "source": [
+                  "## 探索数据\n",
+                  "\n",
+                  "让我们先来看看数据的格式。数据集已经被预处理过了，其中：每个电影评论样本（一连串的单词）由一个整数数组代表，每个评论的标签是一个 0 或者 1 的整数，其中 0 代表消极的评论，1 代表积极的评论。"

rocheers Oct 19, 2018

「其中：每个电影评论样本（一连串的单词）由一个整数数组代表，」=>「其中：每个电影评论样本（一连串的单词）由一个整数数组代表，其中每个整数表示一个单词。」

tutorials/keras/basic_text_classification.ipynb Outdated

+                 "source": [
+                  "## 准备数据\n",
+                  "\n",
+                  "The reviews—the arrays of integers—must be converted to tensors before fed into the neural network. This conversion can be done a couple of ways:\n",

rocheers Oct 20, 2018

这里的英文已经翻译了，但没有删除这段原文，是有特殊考虑？

tutorials/keras/basic_text_classification.ipynb Outdated

+                 "source": [
+                  "### 隐藏单元\n",
+                  "\n",
+                  "The above model has two intermediate or \"hidden\" layers, between the input and output. The number of outputs (units, nodes, or neurons) is the dimension of the representational space for the layer. In other words, the amount of freedom the network is allowed when learning an internal representation.\n",

rocheers Oct 20, 2018

这里和上面有一段一样，翻译了中文之后没有删除英文。

tutorials/keras/basic_text_classification.ipynb Outdated

+                  "\n",
+                  "The above model has two intermediate or \"hidden\" layers, between the input and output. The number of outputs (units, nodes, or neurons) is the dimension of the representational space for the layer. In other words, the amount of freedom the network is allowed when learning an internal representation.\n",
+                  "\n",
+                  "上面的模型在输入和输出之间有两层隐藏层。输出向量的维度（单位，节点或神经元）是网络层的表示空间的维度。 换句话说，是网络在学习内部表示时所具有的自由度。\n",

rocheers Oct 20, 2018

「上面的模型在输入和输出之间有两层隐藏层」=>「上面的模型在输入和输出之间有两个中间层，或者叫“隐藏”层」

tutorials/keras/basic_text_classification.ipynb Outdated

+                  "上面的模型在输入和输出之间有两层隐藏层。输出向量的维度（单位，节点或神经元）是网络层的表示空间的维度。 换句话说，是网络在学习内部表示时所具有的自由度。\n",
+                  "\n",
+                  "\n",
+                  "If a model has more hidden units (a higher-dimensional representation space), and/or more layers, then the network can learn more complex representations. However, it makes the network more computationally expensive and may lead to learning unwanted patterns—patterns that improve performance on training data but not on the test data. This is called *overfitting*, and we'll explore it later.\n",

rocheers Oct 20, 2018

英文原文没删

tutorials/keras/basic_text_classification.ipynb Outdated

+                 "source": [
+                  "## 评估模型\n",
+                  "\n",
+                  "让我们看看模型最终表现的怎么样，我们将得到两个指标：loss（代表模型的错误，越低越好）以及准确率。"

rocheers Oct 20, 2018

「loss（代表模型的错误，越低越好）以及准确率。」=>「Loss（代表模型的错误，值越低越好）以及准确率。」

rocheers commented Oct 20, 2018

@leviding @TobiasLee 校对完成

Member

leviding commented Oct 20, 2018

@TobiasLee 可以修改啦

leviding added the enhancement label


          根据校对意见翻译

1b7fc63

感谢仔细的校对~

Contributor Author

TobiasLee commented Oct 23, 2018

@leviding 修改完毕

Member

leviding commented Oct 23, 2018

@TobiasLee 你检查一下，jpynb 文件和英文原文预览效果不同 https://github.com/xitu/tensorflow-docs/blob/v1.10/tutorials/keras/basic_text_classification.ipynb

检查什么问题。

leviding added help wanted and removed enhancement labels

TobiasLee added 2 commits

October 23, 2018 20:03


          核对样式

6eec657


          核对样式2

d5c92cb

Contributor Author

TobiasLee commented Oct 23, 2018

@leviding 麻烦再看一下？好像之前是有个 cell 的 type 错了

Member

leviding commented Oct 23, 2018 •

edited

Loading

@TobiasLee 还是不一样，你两边对比一下，应该很明显，不用截图说明吧？开头的 In [ ]: 中标号不显示，文章结尾多余一个代码块

Member

leviding commented Oct 23, 2018 •

edited

Loading

原文：https://github.com/xitu/tensorflow-docs/blob/v1.10/tutorials/keras/basic_text_classification.ipynb

译文：https://github.com/xitu/tensorflow-docs/blob/d5c92cb7523892cf62d68b043609a52d31a785d4/tutorials/keras/basic_text_classification.ipynb


          核对样式

21ac3a4

Contributor Author

TobiasLee commented Oct 24, 2018

@leviding 开头的标号不显示是因为我把运行记录清空了，Notebook 处于未运行状态，一般网上的 Jupyter Notebook 都是这样子，所以我建议维持原样。最后一行的空行已经去掉了。

Member

leviding commented Oct 24, 2018

@TobiasLee 好的，辛苦啦~

leviding merged commit e223b9d into xitu:zh-hans

Member

leviding commented Oct 24, 2018

辛苦啦各位 👍

leviding added 翻译完成 and removed help wanted labels

leviding removed 校对认领正在校对 labels

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment