파이썬은 텍스트 파일을 연결

Programing

파이썬은 텍스트 파일을 연결

lottogame 2020. 6. 14. 10:21

파이썬은 텍스트 파일을 연결

와 같은 20 개의 파일 이름 목록이 ['file1.txt', 'file2.txt', ...]있습니다. 이 파일을 새 파일로 연결하는 Python 스크립트를 작성하고 싶습니다. 으로 각 파일을 f = open(...)열고을 호출하여 한 줄씩 읽고 f.readline()새 줄에 각 줄을 쓸 수 있습니다. 그것은 나에게 매우 "우아한"것처럼 보이지 않습니다. 특히 한 줄씩 읽거나 써야하는 부분입니다.

파이썬에서 이것을하는 더 "우아한"방법이 있습니까?

이거해야 해

큰 파일의 경우 :

filenames = ['file1.txt', 'file2.txt', ...]
with open('path/to/output/file', 'w') as outfile:
    for fname in filenames:
        with open(fname) as infile:
            for line in infile:
                outfile.write(line)

작은 파일의 경우 :

filenames = ['file1.txt', 'file2.txt', ...]
with open('path/to/output/file', 'w') as outfile:
    for fname in filenames:
        with open(fname) as infile:
            outfile.write(infile.read())

… 그리고 내가 생각한 또 다른 흥미로운 것 :

filenames = ['file1.txt', 'file2.txt', ...]
with open('path/to/output/file', 'w') as outfile:
    for line in itertools.chain.from_iterable(itertools.imap(open, filnames)):
        outfile.write(line)

안타깝게도이 마지막 방법은 GC가 처리해야하는 열린 파일 디스크립터를 남겨 둡니다. 난 그냥 재미 있다고 생각

사용하십시오 shutil.copyfileobj.

그것은 당신을 위해 청크별로 입력 파일을 자동으로 읽습니다. 더 효율적이고 입력 파일을 읽는 것은 입력 파일 중 일부가 너무 커서 메모리에 맞지 않아도 작동합니다.

with open('output_file.txt','wb') as wfd:
    for f in ['seg1.txt','seg2.txt','seg3.txt']:
        with open(f,'rb') as fd:
            shutil.copyfileobj(fd, wfd)

즉 무엇을 정확히 fileinput 함수 입니다 :

import fileinput
with open(outfilename, 'w') as fout, fileinput.input(filenames) as fin:
    for line in fin:
        fout.write(line)

이 유스 케이스의 경우 파일을 수동으로 반복하는 것보다 훨씬 간단하지 않지만 다른 경우에는 단일 파일처럼 모든 파일을 반복하는 단일 반복자를 갖는 것이 매우 편리합니다. (또한 fileinput각 파일이 완료 되 자마자 닫히게 된다는 사실은 필요 with하거나 각 파일이 필요하지 않다는 것을 의미 close하지만 이는 한 번의 비용 절감 일 뿐이며 큰 거래는 아닙니다.)

fileinput각 줄을 필터링하는 것만으로 파일을 적절하게 수정하는 기능과 같은 다른 유용한 기능 이 있습니다.

코멘트에 언급, 다른에서 설명하고있는 바와 같이 게시 , fileinput표시 파이썬 2.7에 대해 작동하지 않습니다. 코드를 파이썬 2.7과 호환되도록 약간 수정했습니다.

with open('outfilename', 'w') as fout:
    fin = fileinput.input(filenames)
    for line in fin:
        fout.write(line)
    fin.close()

나는 우아함에 대해 모른다. 그러나 이것은 효과가있다.

    import glob
    import os
    for f in glob.glob("file*.txt"):
         os.system("cat "+f+" >> OutFile.txt")

UNIX 명령의 문제점은 무엇입니까? (Windows에서 작업하지 않는 경우) :

ls | xargs cat | tee output.txt 작업을 수행합니다 (원하는 경우 하위 프로세스로 파이썬에서 호출 할 수 있음)

File 객체의 .read () 메소드를 확인하십시오.

http://docs.python.org/2/tutorial/inputoutput.html#methods-of-file-objects

당신은 다음과 같은 것을 할 수 있습니다 :

concat = ""
for file in files:
    concat += open(file).read()

또는보다 '우아한'파이썬 방식 :

concat = ''.join([open(f).read() for f in files])

which, according to this article: http://www.skymind.com/~ocrow/python_string/ would also be the fastest.

An alternative to @inspectorG4dget answer (best answer to date 29-03-2016). I tested with 3 files of 436MB.

@inspectorG4dget solution: 162 seconds

The following solution : 125 seconds

from subprocess import Popen
filenames = ['file1.txt', 'file2.txt', 'file3.txt']
fbatch = open('batch.bat','w')
str ="type "
for f in filenames:
    str+= f + " "
fbatch.write(str + " > file4results.txt")
fbatch.close()
p = Popen("batch.bat", cwd=r"Drive:\Path\to\folder")
stdout, stderr = p.communicate()

The idea is to create a batch file and execute it, taking advantage of "old good technology". Its semi-python but works faster. Works for windows.

If you have a lot of files in the directory then glob2 might be a better option to generate a list of filenames rather than writing them by hand.

import glob2

filenames = glob2.glob('*.txt')  # list of all .txt files in the directory

with open('outfile.txt', 'w') as f:
    for file in filenames:
        with open(file) as infile:
            f.write(infile.read()+'\n')

outfile.write(infile.read()) # time: 2.1085190773010254s
shutil.copyfileobj(fd, wfd, 1024*1024*10) # time: 0.60599684715271s

A simple benchmark shows that the shutil performs better.

If the files are not gigantic:

with open('newfile.txt','wb') as newf:
    for filename in list_of_files:
        with open(filename,'rb') as hf:
            newf.write(hf.read())
            # newf.write('\n\n\n')   if you want to introduce
            # some blank lines between the contents of the copied files

If the files are too big to be entirely read and held in RAM, the algorithm must be a little different to read each file to be copied in a loop by chunks of fixed length, using read(10000) for example.

def concatFiles():
    path = 'input/'
    files = os.listdir(path)
    for idx, infile in enumerate(files):
        print ("File #" + str(idx) + "  " + infile)
    concat = ''.join([open(path + f).read() for f in files])
    with open("output_concatFile.txt", "w") as fo:
        fo.write(path + concat)

if __name__ == "__main__":
    concatFiles()

참고URL : https://stackoverflow.com/questions/13613336/python-concatenate-text-files

'Programing' 카테고리의 다른 글

일반적인 CSS 미디어 쿼리 중단 점 (0)	2020.06.14
공분산 및 공분산 실제 사례 (0)	2020.06.14
OSX에서 zsh에서 bash로 전환했다가 다시? (0)	2020.06.14
Linux-redis-cli 만 설치 (0)	2020.06.14
iOS 10.0 런타임 충돌의 NSCameraUsageDescription? (0)	2020.06.14

현재글파이썬은 텍스트 파일을 연결

복권의 역사, 로또 정보와 IT 기술 등을 다루는 블로그입니다.

뮤지컬, c++, Spring3, c#, 자바, spring, 무비순위, JQuery, 볼거리, 가족나들이, 행사, 연극, 축제, Javascript, 극장순위, 놀거리, java, 관광, 여행, 공연,

Today :
Yesterday :

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

lottogame

파이썬은 텍스트 파일을 연결

파이썬은 텍스트 파일을 연결

'Programing' 카테고리의 다른 글

'Programing'의 다른글

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역

2025. 06
일	월	화	수	목	금	토
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

파이썬은 텍스트 파일을 연결

파이썬은 텍스트 파일을 연결

'Programing' 카테고리의 다른 글

'Programing'의 다른글

관련글

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역