1引言¶

TensorFlow2.0版本已经发布，虽然不是正式版，但预览版都发布了，正式版还会远吗？相比于1.X，2.0版的TensorFlow修改的不是一点半点，这些修改极大的弥补了1.X版本的反人类设计，提升了框架的整体易用性，绝对好评！

赶紧来学习一波吧，做最先吃螃蟹的那一批人！先从TensorFlow的基本数据结构——张量（tensor）开始。

2 创建¶

2.1 constant()方法¶

import tensorflow as tf

tf.constant(1)  # 创建一个整型张量

<tf.Tensor: id=0, shape=(), dtype=int32, numpy=1>

tf.constant(1.)  # 创建一个浮点型张量

<tf.Tensor: id=2, shape=(), dtype=float32, numpy=1.0>

tf.constant(2., dtype=tf.double)  # 创建的同时指定数据类型

<tf.Tensor: id=4, shape=(), dtype=float64, numpy=2.0>

tf.constant([[1.,2.,3.],[4.,5.,6.]])  # 通过传入一个list参数创建

<tf.Tensor: id=6, shape=(2, 3), dtype=float32, numpy=  array([[1., 2., 3.],         [4., 5., 6.]], dtype=float32)>

如果输入的数据与指定的数据类型不相符，会产生以下异常：
TypeError: Cannot convert provided value to EagerTensor. Provided value: 2.1 Requested dtype: int32

2.2 convert_to_tensor()方法¶

import numpy as np

tf.convert_to_tensor(np.ones([2, 3]))

<tf.Tensor: id=9, shape=(2, 3), dtype=float64, numpy=  array([[1., 1., 1.],         [1., 1., 1.]])>

tf.convert_to_tensor(np.ones([2, 3]))

<tf.Tensor: id=11, shape=(2, 3), dtype=float64, numpy=  array([[1., 1., 1.],         [1., 1., 1.]])>

tf.convert_to_tensor([[2.,3.],[3., 4.]])

<tf.Tensor: id=13, shape=(2, 2), dtype=float32, numpy=  array([[2., 3.],         [3., 4.]], dtype=float32)>

2.3 创建元素为指定值的tensor¶

如果你熟悉numpy创建数组的方法，你一定见过zeros()、ones()等方法，TensorFlow中也有这些方法。

（1）zeros()与ones()

a = tf.zeros([2, 3, 3])  # 创建一个元素全为0，形状为[2, 3, 3]的tensor

a

<tf.Tensor: id=46, shape=(2, 3, 3), dtype=float32, numpy=  array([[[0., 0., 0.],          [0., 0., 0.],          [0., 0., 0.]],           [[0., 0., 0.],          [0., 0., 0.],          [0., 0., 0.]]], dtype=float32)>

b = tf.ones([2, 3])  #  创建一个元素全为1，形状为[2, 3]的tensor

b

<tf.Tensor: id=50, shape=(2, 3), dtype=float32, numpy=  array([[1., 1., 1.],         [1., 1., 1.]], dtype=float32)>

（2）zeros_like()与ones_like

tf.zeros_like(b)  # 仿照b的shape创建一个全为0的tensor

<tf.Tensor: id=52, shape=(2, 3), dtype=float32, numpy=  array([[0., 0., 0.],         [0., 0., 0.]], dtype=float32)>

tf.ones_like(a)  # 仿照b的shape创建一个全为1的tensor

<tf.Tensor: id=56, shape=(2, 3, 3), dtype=float32, numpy=  array([[[1., 1., 1.],          [1., 1., 1.],          [1., 1., 1.]],           [[1., 1., 1.],          [1., 1., 1.],          [1., 1., 1.]]], dtype=float32)>

（3）fill()

tf.fill([2,3],5)  # 创建元素全为5，形状为[2,3]的tensor

<tf.Tensor: id=38, shape=(2, 3), dtype=int32, numpy=  array([[5, 5, 5],         [5, 5, 5]])>

2.4 随机初始化¶

在实际应用中，经常需要随机初始化元素服从某种分布的tensor，TensorFlow中也提供了这种功能。

（1）从指定正态分布中随机取值：tf.random.normal()。例如，随机初始化一个元素服从均值为1，方差为1的正态分布且形状为[2, 3]的tensor：

tf.random.normal([2, 3], mean=1, stddev=1)

<tf.Tensor: id=63, shape=(2, 3), dtype=float32, numpy=  array([[ 1.7034731 ,  0.4979009 ,  1.4266468 ],         [-0.33414853,  0.2618034 ,  0.3966313 ]], dtype=float32)>

（2）从指定的截断正态分布中随机取值：truncated_normal()。意思是从指定的正太分布中取值，但是取值范围在两个标准差范围内，也就是：[ mean – 2 stddev, mean + 2 stddev ]

tf.random.truncated_normal([2, 3], mean=1, stddev=1)

<tf.Tensor: id=70, shape=(2, 3), dtype=float32, numpy=  array([[0.71736836, 1.7930655 , 0.47575486],         [0.83504593, 0.7969478 , 0.6002228 ]], dtype=float32)>

（3）从指定均匀分布中随机取值：tf.random.uniform()。

tf.random.uniform([2, 3], minval=1, maxval=2) # 在1~2之间均匀分布

<tf.Tensor: id=78, shape=(2, 3), dtype=float32, numpy=  array([[1.7117869, 1.2625391, 1.6652637],         [1.3810604, 1.0297629, 1.1268978]], dtype=float32)>

3 索引¶

a = tf.convert_to_tensor(np.arange(80).reshape(2,2,4,5))

a

<tf.Tensor: id=80, shape=(2, 2, 4, 5), dtype=int32, numpy=  array([[[[ 0,  1,  2,  3,  4],           [ 5,  6,  7,  8,  9],           [10, 11, 12, 13, 14],           [15, 16, 17, 18, 19]],            [[20, 21, 22, 23, 24],           [25, 26, 27, 28, 29],           [30, 31, 32, 33, 34],           [35, 36, 37, 38, 39]]],             [[[40, 41, 42, 43, 44],           [45, 46, 47, 48, 49],           [50, 51, 52, 53, 54],           [55, 56, 57, 58, 59]],            [[60, 61, 62, 63, 64],           [65, 66, 67, 68, 69],           [70, 71, 72, 73, 74],           [75, 76, 77, 78, 79]]]])>

3.1 基础索引¶

TensorFlow支持Python原生的基础索引方式，即多个方括号逐步索引取值：[idx][idx][idx]，每个方括号对应一个维度。

a[0]  # 取第一个维度

<tf.Tensor: id=85, shape=(2, 4, 5), dtype=int32, numpy=  array([[[ 0,  1,  2,  3,  4],          [ 5,  6,  7,  8,  9],          [10, 11, 12, 13, 14],          [15, 16, 17, 18, 19]],           [[20, 21, 22, 23, 24],          [25, 26, 27, 28, 29],          [30, 31, 32, 33, 34],          [35, 36, 37, 38, 39]]])>

a[0][1]  # 同时筛选两个维度

<tf.Tensor: id=94, shape=(4, 5), dtype=int32, numpy=  array([[20, 21, 22, 23, 24],         [25, 26, 27, 28, 29],         [30, 31, 32, 33, 34],         [35, 36, 37, 38, 39]])>

a[0][1][3][3]  # 同时对4个维度进行筛选

<tf.Tensor: id=111, shape=(), dtype=int32, numpy=38>

这种索引数据的方法简单，易于理解，但是可读性差，只能按维度依次索引数据，也不能索引列。

3.2 numpy索引¶

TensorFlow也继承了numpy中的部分索引方式，如果对numpy索引方式不熟悉，可以查看我的前几篇博客。
（1）[idx1, idx2, idx3]
这种索引方式是在一个方括号内写下所有的索引，每个索引序号之间用逗号隔开。

a[1]  # 筛选第一维度，这跟基础索引一样

<tf.Tensor: id=116, shape=(2, 4, 5), dtype=int32, numpy=  array([[[40, 41, 42, 43, 44],          [45, 46, 47, 48, 49],          [50, 51, 52, 53, 54],          [55, 56, 57, 58, 59]],           [[60, 61, 62, 63, 64],          [65, 66, 67, 68, 69],          [70, 71, 72, 73, 74],          [75, 76, 77, 78, 79]]])>

a[1,1, 3]  # 同时帅选3个维度

<tf.Tensor: id=121, shape=(5,), dtype=int32, numpy=array([75, 76, 77, 78, 79])>

（2）冒号切片与步长：[start:end:step]

这种索引方式在Python原生的list类型中也是常见的，而且使用方法也是一样的。

a[1,:,0:2] # 对第1维度选第二块数据，对第二维度选所有数据，对第三维度选前两行

<tf.Tensor: id=126, shape=(2, 2, 5), dtype=int32, numpy=  array([[[40, 41, 42, 43, 44],          [45, 46, 47, 48, 49]],           [[60, 61, 62, 63, 64],          [65, 66, 67, 68, 69]]])>

a[1,:,0:2,0:4] # 继续上面的例子，对第4维度筛选去前4列

<tf.Tensor: id=131, shape=(2, 2, 4), dtype=int32, numpy=  array([[[40, 41, 42, 43],          [45, 46, 47, 48]],           [[60, 61, 62, 63],          [65, 66, 67, 68]]])>

a[1,:,0:2,0:4:2] # 对第4维度加上步长，每隔一个数据取一次

<tf.Tensor: id=136, shape=(2, 2, 2), dtype=int32, numpy=  array([[[40, 42],          [45, 47]],           [[60, 62],          [65, 67]]])>

也可以使用负值步长表示逆序索引，但要注意，负数步长时，原本的[start : end : step]也要跟着编程[end : start : step]：

a[1,:,0:2,4:0:-1]

<tf.Tensor: id=141, shape=(2, 2, 4), dtype=int32, numpy=  array([[[44, 43, 42, 41],          [49, 48, 47, 46]],           [[64, 63, 62, 61],          [69, 68, 67, 66]]])>

a[1,:,0:2,4:0:-2]

<tf.Tensor: id=146, shape=(2, 2, 2), dtype=int32, numpy=  array([[[44, 42],          [49, 47]],           [[64, 62],          [69, 67]]])>

在numpy和TensorFlow中还有“…”（三个英文句号）的使用，“…”用于表示连续多个维度全选：

a[1,...,0:4] # 等同于a[1, : , : ,0:4]

<tf.Tensor: id=151, shape=(2, 4, 4), dtype=int32, numpy=  array([[[40, 41, 42, 43],          [45, 46, 47, 48],          [50, 51, 52, 53],          [55, 56, 57, 58]],           [[60, 61, 62, 63],          [65, 66, 67, 68],          [70, 71, 72, 73],          [75, 76, 77, 78]]])>

a[0,0,...] # 等同于a[0,0,:,:]

<tf.Tensor: id=156, shape=(4, 5), dtype=int32, numpy=  array([[ 0,  1,  2,  3,  4],         [ 5,  6,  7,  8,  9],         [10, 11, 12, 13, 14],         [15, 16, 17, 18, 19]])>

3.3 gather与gather_nd¶

gather与gather_nd是指TensorFlow通过gather()方法和gather_nd()方法提供的两种索引方式。在numpy中，可以通过嵌套list的方式来指定无规则的索引：

b = np.arange(20).reshape(4,5)

b[1, [0,3,4]] # 选取第2行的第1列、第4列、第5列

array([5, 8, 9])

但是在TensorFlow中，这种索引方式并没有从numpy中继承下来,所以如果在Tensor中使用这种方式，会抛出以下异常：
TypeError: Only integers, slices (:), ellipsis (...), tf.newaxis (None) and scalar tf.int32/tf.int64 tensors are valid indices, got [0, 3, 4]

还好的是，在TensorFlow中通过gather()方法和gather_nd()方法提供了这种索引方法。

（1）gather()方法

tf.gather(b, axis=0, indices=[0, 2, 3]) # 选取第1行，第3行，第4行

<tf.Tensor: id=163, shape=(3, 5), dtype=int32, numpy=  array([[ 0,  1,  2,  3,  4],         [10, 11, 12, 13, 14],         [15, 16, 17, 18, 19]])>

tf.gather(b, axis=1, indices=[0, 2, 3]) # 选取第1列，第3列，第4列

<tf.Tensor: id=168, shape=(4, 3), dtype=int32, numpy=  array([[ 0,  2,  3],         [ 5,  7,  8],         [10, 12, 13],         [15, 17, 18]])>

仔细观察上面gather()方法例子，可以发现，第一个参数时数据源，还有两个参数中，axis指的是将要的维度，indices指的是需要选取的序号。

（2）gather_nd()

gather()方法一次只能对一个维度进行索引，gather_nd()方法可以同时对多个维度进行索引。

tf.gather_nd(b, [[0, 2],[3, 3]]) # 选取第1行第3列的那个数据，和第4行第4列的数据

<tf.Tensor: id=172, shape=(2,), dtype=int32, numpy=array([ 2, 18])>

4 维度变换¶

4.1 reshape()¶

numpy中的ndarray数组有个一reshape()方法，用来改变数组的shape，TensorFlow中的reshape()方法，功能也是一样的，不过TensorFlow中的reshape()没有绑定到tensor中：

a = tf.ones([2,3,4])

a.shape

TensorShape([2, 3, 4])

a

<tf.Tensor: id=176, shape=(2, 3, 4), dtype=float32, numpy=  array([[[1., 1., 1., 1.],          [1., 1., 1., 1.],          [1., 1., 1., 1.]],           [[1., 1., 1., 1.],          [1., 1., 1., 1.],          [1., 1., 1., 1.]]], dtype=float32)>

b = tf.reshape(a, [2, 2, 6])

b.shape

TensorShape([2, 2, 6])

b

<tf.Tensor: id=179, shape=(2, 2, 6), dtype=float32, numpy=  array([[[1., 1., 1., 1., 1., 1.],          [1., 1., 1., 1., 1., 1.]],           [[1., 1., 1., 1., 1., 1.],          [1., 1., 1., 1., 1., 1.]]], dtype=float32)>

c = tf.reshape(a, [3, 2, 4])

c

<tf.Tensor: id=183, shape=(3, 2, 4), dtype=float32, numpy=  array([[[1., 1., 1., 1.],          [1., 1., 1., 1.]],           [[1., 1., 1., 1.],          [1., 1., 1., 1.]],           [[1., 1., 1., 1.],          [1., 1., 1., 1.]]], dtype=float32)>

可以看到，在上面的例子中，通过reshape()方法可以很方便的改变tensor的形状，得到一个新的tensor，需要注意的是在进行维度变换时，数据的重量是不变的，上面的例子无论是[2,3,4]， [2, 2, 6]还是[3, 2, 4]都对应总量24，如果对应不上，就会产生异常。

4.2 转置：transpose()¶

transpose()方法提供了一种类似于装置的操作：

a = tf.constant([[1,2,3],[4,5,6]])

a.shape

TensorShape([2, 3])

b = tf.transpose(a)

b.shape

TensorShape([3, 2])

b

<tf.Tensor: id=192, shape=(3, 2), dtype=int32, numpy=  array([[1, 4],         [2, 5],         [3, 6]])>

在默认情况下，transpose()方法会将所有维度按逆序方式完全转置，当然也可以通过perm参数执行需要转置的维度：

a=tf.constant([[[1,2,3],[4,5,6]],[[7,8,9],[10,11,12]]])

a

<tf.Tensor: id=194, shape=(2, 2, 3), dtype=int32, numpy=  array([[[ 1,  2,  3],          [ 4,  5,  6]],           [[ 7,  8,  9],          [10, 11, 12]]])>

b = tf.transpose(a) # 不指定perm参数时，相当于tf.transpose(a, perm=[2, 1, 0])

b

<tf.Tensor: id=197, shape=(3, 2, 2), dtype=int32, numpy=  array([[[ 1,  7],          [ 4, 10]],           [[ 2,  8],          [ 5, 11]],           [[ 3,  9],          [ 6, 12]]])>

c = tf.transpose(a, perm=[2, 1, 0])

c

<tf.Tensor: id=200, shape=(3, 2, 2), dtype=int32, numpy=  array([[[ 1,  7],          [ 4, 10]],           [[ 2,  8],          [ 5, 11]],           [[ 3,  9],          [ 6, 12]]])>

d = tf.transpose(a, perm=[0, 2, 1]) # 第一个维度不做变换，对第二、第三维度进行转置

d

<tf.Tensor: id=203, shape=(2, 3, 2), dtype=int32, numpy=  array([[[ 1,  4],          [ 2,  5],          [ 3,  6]],           [[ 7, 10],          [ 8, 11],          [ 9, 12]]])>

4.3 添加维度：expand_dims()¶

a=tf.constant([[1,2,3],[4,5,6]])

a

<tf.Tensor: id=205, shape=(2, 3), dtype=int32, numpy=  array([[1, 2, 3],         [4, 5, 6]])>

tf.expand_dims(a, axis=0)

<tf.Tensor: id=208, shape=(1, 2, 3), dtype=int32, numpy=  array([[[1, 2, 3],          [4, 5, 6]]])>

tf.expand_dims(a, axis=1)

<tf.Tensor: id=211, shape=(2, 1, 3), dtype=int32, numpy=  array([[[1, 2, 3]],           [[4, 5, 6]]])>

tf.expand_dims(a, axis=-1)

<tf.Tensor: id=214, shape=(2, 3, 1), dtype=int32, numpy=  array([[[1],          [2],          [3]],           [[4],          [5],          [6]]])>

tf.expand_dims(a, axis=2)

<tf.Tensor: id=217, shape=(2, 3, 1), dtype=int32, numpy=  array([[[1],          [2],          [3]],           [[4],          [5],          [6]]])>

expand_dims()方法添加维度时，通过axis参数指定添加维度的位置，正数表示从前往后数，负数表示从后往前数。

4.4 压缩维度：squeeze()¶

squeeze()方法与expand_dims()方法作用刚好相反，其作用是删除张量中dim为1的维度：

a = tf.ones([1,3,1,2])

a

<tf.Tensor: id=221, shape=(1, 3, 1, 2), dtype=float32, numpy=  array([[[[1., 1.]],            [[1., 1.]],            [[1., 1.]]]], dtype=float32)>

tf.squeeze(a)

<tf.Tensor: id=223, shape=(3, 2), dtype=float32, numpy=  array([[1., 1.],         [1., 1.],         [1., 1.]], dtype=float32)>

TensorFlow2.0（一）：基本数据结构——张量

1引言¶

2 创建¶

2.1 constant()方法¶

2.2 convert_to_tensor()方法¶

2.3 创建元素为指定值的tensor¶

2.4 随机初始化¶

3 索引¶

3.1 基础索引¶

3.2 numpy索引¶

3.3 gather与gather_nd¶

4 维度变换¶

4.1 reshape()¶

4.2 转置：transpose()¶

4.3 添加维度：expand_dims()¶

4.4 压缩维度：squeeze()¶

VirMach 便宜 VPS

QNews

TensorFlow2.0（一）：基本数据结构——张量

1引言¶

2 创建¶

2.1 constant()方法¶

2.2 convert_to_tensor()方法¶

2.3 创建元素为指定值的tensor¶

2.4 随机初始化¶

3 索引¶

3.1 基础索引¶

3.2 numpy索引¶

3.3 gather与gather_nd¶

4 维度变换¶

4.1 reshape()¶

4.2 转置：transpose()¶

4.3 添加维度：expand_dims()¶

4.4 压缩维度：squeeze()¶

分享此文：

Related Posts

rpm包管理基础

任何人都需要知道的「世界时间系统」构成原理，尤其开发人员

语音信号的梅尔频率倒谱系数(MFCC)的原理讲解及python实现

微信支付中分账功能 填坑指南V1

VirMach 便宜 VPS

QNews

热门文章

热门搜寻

微信支付中分账功能填坑指南V1