python文件操作的基本流程

本文介绍: 这些资料，对于【软件测试】的朋友来说应该是最全面最完整的备战仓库，这个仓库也陪伴上万个测试工程师们走过最艰难的路程，希望也能帮助到你！

程序运行过程中产生的数据会保存到内存中，如果想要永久保存下来，就必须将数据存放在硬盘上，应用程序如果想要操作计算机的硬件就必须通过操作系统，文件就是操作系统提供给应用程序来操作硬盘的虚拟概念，应用程序操作文件就是向操作系统发送调用，由操作系统完成对硬盘的操作。

比如想打开电脑桌面上一个word文档进行操作，步骤应该是：1、双击打开文档； 2、进行某些操作，比如读文件、修改文件等；3、保存后关闭文件。

使用python实现对文件的操作也遵循这三个步骤：

# open()方法常用方式是接收三个参数:文件绝对/相对路径,操作文件的方式,编码格式
# 1.打开文件，应用程序向操作系统发送调用，操作系统打开文件(硬盘上的一块空间)，返回一个文件对象赋值给变量file
file = open(r'D:文件.txt', 'r', encoding='utf-8')  # 以读模式打开文件，打开文件的字符编码是utf-8
# 注意：在python中有特殊意义，当路径是绝对路径时，需要在路径字符串前加r进行转义；文件路径如果是文件名的话，python会在当前程序文件夹所在路径下去找该文件。

# 2、对文件进行操作，比如读文件---调用文件对象的读方法
data = file.read()

# 3、关闭文件，向操作系统发送关闭文件的请求
file.close()

使用python打开一个文件之后产生了两部分的内存空间占用，一部分是文件打开后占用的内存空间，另一部分是打开后产生的文件对象，文件操作完成之后需要回收这两部分的内存空间.

file.close()  # 删除文件打开后占用的内存
del file  # 删除文件对象的内存

由于python垃圾回收机制，我们无需考虑删除文件对象这一步，但是在操作完文件之后必须要关闭文件，就是f.close()，否则在电脑上不停的打开文件而不关闭，电脑的内存迟早会被用尽，尽管如此，可能还会有粗心的小伙伴忘记关闭文件，python为了防止这一情况，提供了with关键字来帮助我们管理从打开到关闭整个上下文的流程，因此with关键字也称为with上下文管理。

# 执行完with下的子代码块之后会自动执行f.close()的操作，再也不用担心忘记关闭文件了。
with open('文件.txt', 'r') as f:  # 打开文件，将文件对象赋值给变量f
    pass   # pass是什么都不做，可以用来占位

# 可以用with同时打开多个文件，用逗号分隔
with open('file1.txt', 'r') as f1, open('file2.txt', 'r') as f2:
    data1 = f1.read()
    data2 = f2.read()

由于使用python打开文件的时候是通过操作系统完成的，如果打开的文件是文本文件，会涉及到字符编码的问题，如果在打开文件时没有指定字符编码，操作系统就会使用自己默认的编码打开文件(windows下是gbk，在linux下是utf-8)，如果要保证不乱码，文件以什么编码格式存的就要以什么格式打开。

with open('a.txt', mode='r', encoding='utf-8') as f:
    data = f.read()

with open('a.txt', 'w', encoding='utf-8') as f:
    f.write('hello worldn')  # n表示换行
    f.write('my name is python')

with open('a.txt', 'a', encoding='utf-8') as f:
    f.write('追加的1n')
    f.write('追加的2n')

# 如果打开文件时指定打开模式为r/w/a，其实默认就是rt/wt/at
with open('a.txt', 'wt', encoding='utf-8') as f:
    f.write('haha')  # 写入的数据必须也是字符串格式

with open('a.txt', 'wb') as f:
    info = 'name'
    # 需要将写入文件的数据转成二进制格式，使用encode()方法指定编码格式可以实现
    res = info.encode('utf-8')
    f.write(res)

f.read()  # 一次性读取所有内容，如果文件过大，会导致内存不足
f.readline()  # 每次读取文件的一行内容，读完一行后，光标移至第二行行首
f.readlines()  # 一次性读取所有内容，将每一行内容存放于列表中

# 方式一
with open('a.txt',mode='rt',encoding='utf-8') as f:
    for line in f:
        print(line) # 同一时刻只读入一行内容到内存中
# 方式二
with open('好汉歌.mp3', mode='rb') as f:
    while True:
        data=f.read(1024) # 同一时刻只读入1024个Bytes到内存中
        if len(data) == 0:
            break
        print(data)

f.write('hellonworldn')  # 针对文本模式的写,需要自己写换行符
f.write('1111n222n'.encode('utf-8'))  # 针对b模式的写,需要自己写换行符
f.writelines(['333n','444n'])  # 文件模式
f.writelines([bytes('333n',encoding='utf-8'),'444n'.encode('utf-8')]) #b模式

with open('a.txt',mode='rt',encoding='utf-8') as f:
     data=f.read(3) # 读取3个字符
        
with open('a.txt',mode='rb') as f:
     data=f.read(3) # 读取3个Bytes

# 0模式
with open('a.txt',mode='rt',encoding='utf-8') as f:
    f.seek(3,0)     # 参照文件开头移动了3个字节
    print(f.tell())  # 查看当前文件指针相对于文件开头的位置
    
# 1模式
with open('a.txt',mode='rb') as f:
    f.seek(3,1) # 从当前位置往后移动3个字节，而此时的当前位置就是文件开头
    print(f.tell()) # 输出结果为：3
    
# 2模式
with open('a.txt',mode='rb') as f:
    f.seek(-3,2)     # 参照文件末尾往前移动了3个字节
    print(f.read().decode('utf-8')) # 输出结果为：好

'''
优点：文件修改过程中只有同一份数据
缺点：内存占用过多
'''
# 先读入内存
with open('db.txt',mode='rt',encoding='utf-8') as f:
    data=f.read()
# 修改内存中的数据
with open('db.txt',mode='wt',encoding='utf-8') as f:
    f.write(data.replace('xxx','python'))

'''
优点：不会占用过多的内存
缺点：需要借助临时文件
'''
import os  # 需要借助os模块进行删除文件和文件重命名操作

with open('old.txt',mode='rt',encoding='utf-8') as read_f,
        open('new.txt',mode='wt',encoding='utf-8') as wrife_f:
    for line in read_f:
        wrife_f.write(line.replace('python','xxx'))

os.remove('old.txt')   # 删除未修改的文件
os.rename('new.txt','old.txt')  # 将新文件重命名为和旧文件相同的名字

username = input('please input your username:').strip()
password = input('please input your password:').strip()
with open(r'F:FullStackPython_basedinfo', 'r', encoding='utf-8') as f:
    for line in f:
        info = line.strip('n')  # 去掉每一行后面的换行符，默认文件中每换一行，每行后面都会有换行符
        name, pwd = info.split(':')  # 解压赋值
        if username == name and password == pwd:
            print('successful!')
    else:  # 要循环完成所有信息之后才知道用户名和密码是否正确
        print('invalid username or password!')

username = input('please input your username:').strip()
password = input('please input your password:').strip()
with open(r'F:FullStackPython_basedinfo', 'a', encoding='utf-8') as f:
    info = f'{username}:{password}n'
    f.write(info)

src_path = input('Please enter the path where you want to copy the file:')
target_path = input('please enter the target path:')
with open(r'{}'.format(src_path), mode='rb') as f,
        open('{}'.format(target_path), mode='wb') as f1:
    for line in f:
        f1.write(line)