https://github.com/rycont/hanja-grade-dataset
[GitHub - rycont/hanja-grade-dataset: 한국어문회 등급별 선정한자 CSV 데이터셋
한국어문회 등급별 선정한자 CSV 데이터셋.
Contribute to rycont/hanja-grade-dataset development by creating an account on GitHub.
[github.com](https://github.com/rycont/hanja-grade-dataset)
csv로 누군가 정리해서 깃헙에 올려줌.
pandas로 데이터프레임으로 읽어서 불필요한 컬럼을 날리고, anki의 입력파일로 쓸 csv를 만듬
import numpy as np
import pandas as pd
import os
path = "./hanja-grade-dataset-main/by-level/"
file_list = os.listdir(path)
file_list_py = [file for file in file_list if file.endswith(".csv")]
def gen_csv(path):
# print(path)
df = pd.read_csv(path)
df2 = df[['hanja','meaning']].replace({'\[':''}, regex=True).replace({'\]':''}, regex=True).replace({'\'':''}, regex=True).copy()
print(path+".output.csv")
df2.to_csv(path+".output.csv",sep=';',header=False, index=False)
path = "./hanja-grade-dataset-main/by-level/"
for filename in file_list_py:
print(path+filename)
gen_csv(path+filename)
./hanja-grade-dataset-main/by-level/4급.csv
./hanja-grade-dataset-main/by-level/4급.csv.output.csv
./hanja-grade-dataset-main/by-level/3급Ⅱ.csv
./hanja-grade-dataset-main/by-level/3급Ⅱ.csv.output.csv
./hanja-grade-dataset-main/by-level/3급.csv
./hanja-grade-dataset-main/by-level/3급.csv.output.csv
./hanja-grade-dataset-main/by-level/5급Ⅱ.csv
./hanja-grade-dataset-main/by-level/5급Ⅱ.csv.output.csv
./hanja-grade-dataset-main/by-level/특급.csv
./hanja-grade-dataset-main/by-level/특급.csv.output.csv
./hanja-grade-dataset-main/by-level/8급.csv
./hanja-grade-dataset-main/by-level/8급.csv.output.csv
./hanja-grade-dataset-main/by-level/6급Ⅱ.csv
./hanja-grade-dataset-main/by-level/6급Ⅱ.csv.output.csv
./hanja-grade-dataset-main/by-level/5급.csv
./hanja-grade-dataset-main/by-level/5급.csv.output.csv
./hanja-grade-dataset-main/by-level/4급Ⅱ.csv
./hanja-grade-dataset-main/by-level/4급Ⅱ.csv.output.csv
./hanja-grade-dataset-main/by-level/6급.csv
./hanja-grade-dataset-main/by-level/6급.csv.output.csv
./hanja-grade-dataset-main/by-level/2급.csv
./hanja-grade-dataset-main/by-level/2급.csv.output.csv
./hanja-grade-dataset-main/by-level/7급Ⅱ.csv
./hanja-grade-dataset-main/by-level/7급Ⅱ.csv.output.csv
./hanja-grade-dataset-main/by-level/7급.csv
./hanja-grade-dataset-main/by-level/7급.csv.output.csv
./hanja-grade-dataset-main/by-level/1급.csv
./hanja-grade-dataset-main/by-level/1급.csv.output.csv
./hanja-grade-dataset-main/by-level/특급Ⅱ.csv
./hanja-grade-dataset-main/by-level/특급Ⅱ.csv.output.csv'읽고보고해봄' 카테고리의 다른 글
| Matter는 무엇인가? (0) | 2025.06.14 |
|---|---|
| TR067 nvim 소스 설치 (0) | 2025.04.10 |
| Tally Counter (안드로이드 앱) (0) | 2025.03.07 |
| TR065 Anki - Highlight Code (0) | 2025.03.07 |
| TR064 C++ 14 test code용 Makefile (0) | 2025.02.17 |