在本章中，我们将使用基本系列/索引来学习字符串操作。在随后的章节中，将学习如何将这些字符串函数应用于数据帧(DataFrame)。

Pandas提供了一组字符串函数，可以方便地对字符串数据进行操作。最重要的是，这些函数忽略(或排除)丢失/ NaN 值。

几乎这些方法都使用Python字符串函数。因此，将 Series 对象转换为 String 对象，然后执行该操作。

字符串函数操作的执行和说明

下面来看看每个操作的执行和说明。

编号	函数	描述
1	`lower()`	将`Series/Index`中的字符串转换为小写。
2	`upper()`	将`Series/Index`中的字符串转换为大写。
3	`len()`	计算字符串长度。
4	`strip()`	帮助从两侧的系列/索引中的每个字符串中删除空格(包括换行符)。
5	`split(' ')`	用给定的模式拆分每个字符串。
6	`cat(sep=' ')`	使用给定的分隔符连接系列/索引元素。
7	`get_dummies()`	返回具有单热编码值的数据帧(DataFrame)。
8	`contains(pattern)`	如果元素中包含子字符串，则返回每个元素的布尔值`True`，否则为`False`。
9	`replace(a,b)`	将值`a`替换为值`b`。
10	`repeat(value)`	重复每个元素指定的次数。
11	`count(pattern)`	返回模式中每个元素的出现总数。
12	`startswith(pattern)`	如果系列/索引中的元素以模式开始，则返回`true`。
13	`endswith(pattern)`	如果系列/索引中的元素以模式结束，则返回`true`。
14	`find(pattern)`	返回模式第一次出现的位置。
15	`findall(pattern)`	返回模式的所有出现的列表。
16	`swapcase`	变换字母大小写。
17	`islower()`	检查系列/索引中每个字符串中的所有字符是否小写，返回布尔值
18	`isupper()`	检查系列/索引中每个字符串中的所有字符是否大写，返回布尔值
19	`isnumeric()`	检查系列/索引中每个字符串中的所有字符是否为数字，返回布尔值。

现在创建一个系列，看看上述所有函数是如何工作的。

In [2]:

import pandas as pd
import numpy as np

s = pd.Series(['Tom', 'William Rick', 'John', 'Alber@t', np.nan, '1234','SteveMinsu'])
s

Out[2]:

0             Tom
1    William Rick
2            John
3         Alber@t
4             NaN
5            1234
6      SteveMinsu
dtype: object

lower()函数示例

In [3]:

s.str.lower()

Out[3]:

0             tom
1    william rick
2            john
3         alber@t
4             NaN
5            1234
6      steveminsu
dtype: object

执行上面示例代码，得到上面运行结果。

`upper()` 函数示例

In [4]:

s.str.upper()

Out[4]:

0             TOM
1    WILLIAM RICK
2            JOHN
3         ALBER@T
4             NaN
5            1234
6      STEVEMINSU
dtype: object

执行上面示例代码，得到上面运行结果。

`len()` 函数示例

In [5]:

s.str.len()

Out[5]:

0     3.0
1    12.0
2     4.0
3     7.0
4     NaN
5     4.0
6    10.0
dtype: float64

执行上面示例代码，得到上面运行结果。

`strip()` 函数示例

In [6]:

s = pd.Series(['Tom ', ' William Rick', 'John', 'Alber@t'])
s

Out[6]:

0             Tom 
1     William Rick
2             John
3          Alber@t
dtype: object

In [7]:

s.str.strip()

Out[7]:

0             Tom
1    William Rick
2            John
3         Alber@t
dtype: object

执行上面示例代码，得到上面运行结果。

`split(pattern)` 函数示例

In [8]:

s.str.split(' ')

Out[8]:

0              [Tom, ]
1    [, William, Rick]
2               [John]
3            [Alber@t]
dtype: object

`cat(sep=pattern)` 函数示例

In [9]:

s.str.cat(sep=' <=> ')

Out[9]:

'Tom  <=>  William Rick <=> John <=> Alber@t'

`get_dummies()` 函数示例

In [10]:

s.str.get_dummies()

Out[10]:

	William Rick	Alber@t	John	Tom
0	0	0	0	1
1	1	0	0	0
2	0	0	1	0
3	0	1	0	0

`contains()` 函数示例

In [11]:

s.str.contains(' ')

Out[11]:

0     True
1     True
2    False
3    False
dtype: bool

`replace(a,b)` 函数示例

In [12]:

s.str.replace('@','$')

Out[12]:

0             Tom 
1     William Rick
2             John
3          Alber$t
dtype: object

`repeat(value)` 函数示例

In [13]:

s.str.repeat(2)

Out[13]:

0                      Tom Tom 
1     William Rick William Rick
2                      JohnJohn
3                Alber@tAlber@t
dtype: object

`count(pattern)` 函数示例

In [14]:

s.str.count('m')

Out[14]:

0    1
1    1
2    0
3    0
dtype: int64

`startswith(pattern)` 函数示例

In [15]:

s.str. startswith ('T')

Out[15]:

0     True
1    False
2    False
3    False
dtype: bool

`endswith(pattern)` 函数示例

In [16]:

s.str.endswith('t')

Out[16]:

0    False
1    False
2    False
3     True
dtype: bool

执行上面示例代码，得到上面运行的结果。

`find(pattern)` 函数示例

In [17]:

s.str.find('e')

Out[17]:

0   -1
1   -1
2   -1
3    3
dtype: int64

注意：-1表示元素中没有这样的模式可用。

`findall(pattern)` 函数示例

In [18]:

s.str.findall('e')

Out[18]:

0     []
1     []
2     []
3    [e]
dtype: object

空列表( [] )表示元素中没有这样的模式可用。

`swapcase()` 函数示例

In [19]:

s.str.swapcase()

Out[19]:

0             tOM 
1     wILLIAM rICK
2             jOHN
3          aLBER@T
dtype: object

`islower()` 函数示例

In [20]:

s.str.islower()

Out[20]:

0    False
1    False
2    False
3    False
dtype: bool

`isupper()` 函数示例

In [21]:

s.str.isupper()

Out[21]:

0    False
1    False
2    False
3    False
dtype: bool

`isnumeric()` 函数示例

In [22]:

s.str.isnumeric()

Out[22]:

0    False
1    False
2    False
3    False
dtype: bool

字符串函数操作的执行和说明

lower()函数示例

`upper()` 函数示例

`len()` 函数示例

`strip()` 函数示例

`split(pattern)` 函数示例

`cat(sep=pattern)` 函数示例

`get_dummies()` 函数示例

`contains()` 函数示例

`replace(a,b)` 函数示例

`repeat(value)` 函数示例

`count(pattern)` 函数示例

`startswith(pattern)` 函数示例

`endswith(pattern)` 函数示例

`find(pattern)` 函数示例

`findall(pattern)` 函数示例

`swapcase()` 函数示例

`islower()` 函数示例

`isupper()` 函数示例

`isnumeric()` 函数示例

① 阅读使用手册

② 注册用户账号

介绍

平台内核

注意事项

字符串函数操作的执行和说明

lower()函数示例

upper() 函数示例

len() 函数示例

strip() 函数示例

split(pattern) 函数示例

cat(sep=pattern) 函数示例

get_dummies() 函数示例

contains() 函数示例

replace(a,b) 函数示例

repeat(value) 函数示例

count(pattern) 函数示例

startswith(pattern) 函数示例

endswith(pattern) 函数示例

find(pattern) 函数示例

findall(pattern) 函数示例

swapcase() 函数示例

islower() 函数示例

isupper() 函数示例

isnumeric() 函数示例

① 阅读使用手册

② 注册用户账号

③ 登陆

Python基础

Python进阶

标准类库

专题工具

图像处理

科学计算

自然语言

开源GIS

R 编程语言

Julia编程语言

介绍

平台内核

注意事项

`upper()` 函数示例

`len()` 函数示例

`strip()` 函数示例

`split(pattern)` 函数示例

`cat(sep=pattern)` 函数示例

`get_dummies()` 函数示例

`contains()` 函数示例

`replace(a,b)` 函数示例

`repeat(value)` 函数示例

`count(pattern)` 函数示例

`startswith(pattern)` 函数示例

`endswith(pattern)` 函数示例

`find(pattern)` 函数示例

`findall(pattern)` 函数示例

`swapcase()` 函数示例

`islower()` 函数示例

`isupper()` 函数示例

`isnumeric()` 函数示例