gimi9 Pandas Profiling

Dataset statistics

Number of variables	6
Number of observations	57
Missing cells	0
Missing cells (%)	0.0%
Duplicate rows	0
Duplicate rows (%)	0.0%
Total size in memory	2.8 KiB
Average record size in memory	50.3 B

Variable types

Text	6

Dataset

Description	국가철도공단에서 관리하는 전국 고속철도역사의 한글, 영문, 로마자, 일본어, 중국어(간체, 번체) 등의 정보를 제공합니다.
Author	국가철도공단
URL	https://www.data.go.kr/data/15096780/fileData.do

Alerts

역명(중국어 간체) has unique values Unique

Reproduction

Analysis started	2023-12-12 21:59:45.560681
Analysis finished	2023-12-12 21:59:46.197661
Duration	0.64 seconds
Software version	ydata-profiling vv4.5.1
Download configuration	config.json

역명
Text

Distinct	56
Distinct (%)	98.2%
Missing	0
Missing (%)	0.0%
Memory size	588.0 B

Length

Max length	7
Median length	2
Mean length	2.4035088
Min length	2

Characters and Unicode

Total characters	137
Distinct characters	74
Distinct categories	3 ?
Distinct scripts	2 ?
Distinct blocks	2 ?

The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique	55 ?
Unique (%)	96.5%

Sample

1st row	김제
2nd row	광주송정
3rd row	공주
4th row	계룡
5th row	정읍

Value	Count	Frequency (%)
오송	2	3.5%
김제	1	1.8%
평창	1	1.8%
횡성	1	1.8%
창원	1	1.8%
진영	1	1.8%
마산	1	1.8%
순천	1	1.8%
구례구	1	1.8%
곡성	1	1.8%
Other values (46)	46	80.7%

Most occurring characters

Value	Count	Frequency (%)
산	9	6.6%
주	6	4.4%
구	5	3.6%
포	5	3.6%
진	4	2.9%
원	4	2.9%
동	4	2.9%
천	4	2.9%
대	4	2.9%
오	3	2.2%
Other values (64)	89	65.0%

Most occurring categories

Value	Count	Frequency (%)
Other Letter	135	98.5%
Close Punctuation	1	0.7%
Open Punctuation	1	0.7%

Most frequent character per category

Other Letter

Value	Count	Frequency (%)
산	9	6.7%
주	6	4.4%
구	5	3.7%
포	5	3.7%
진	4	3.0%
원	4	3.0%
동	4	3.0%
천	4	3.0%
대	4	3.0%
오	3	2.2%
Other values (62)	87	64.4%

Close Punctuation

Value	Count	Frequency (%)
)	1	100.0%

Open Punctuation

Value	Count	Frequency (%)
(	1	100.0%

Most occurring scripts

Value	Count	Frequency (%)
Hangul	135	98.5%
Common	2	1.5%

Most frequent character per script

Hangul

Value	Count	Frequency (%)
산	9	6.7%
주	6	4.4%
구	5	3.7%
포	5	3.7%
진	4	3.0%
원	4	3.0%
동	4	3.0%
천	4	3.0%
대	4	3.0%
오	3	2.2%
Other values (62)	87	64.4%

Common

Value	Count	Frequency (%)
)	1	50.0%
(	1	50.0%

Most occurring blocks

Value	Count	Frequency (%)
Hangul	135	98.5%
ASCII	2	1.5%

Most frequent character per block

Hangul

Value	Count	Frequency (%)
산	9	6.7%
주	6	4.4%
구	5	3.7%
포	5	3.7%
진	4	3.0%
원	4	3.0%
동	4	3.0%
천	4	3.0%
대	4	3.0%
오	3	2.2%
Other values (62)	87	64.4%

ASCII

Value	Count	Frequency (%)
)	1	50.0%
(	1	50.0%

역명(영문)
Text

Distinct	56
Distinct (%)	98.2%
Missing	0
Missing (%)	0.0%
Memory size	588.0 B

Length

Max length	16
Median length	13
Mean length	7.9298246
Min length	4

Characters and Unicode

Total characters	452
Distinct characters	42
Distinct categories	6 ?
Distinct scripts	2 ?
Distinct blocks	1 ?

The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique	55 ?
Unique (%)	96.5%

Sample

1st row	Gimje
2nd row	Gwangjusongjeong
3rd row	Gongju
4th row	Gyeryong
5th row	Jeongeup

Value	Count	Frequency (%)
osong	2	3.5%
gimje	1	1.8%
pyeongchang	1	1.8%
hoengseong	1	1.8%
changwon	1	1.8%
jinyeong	1	1.8%
masan	1	1.8%
suncheon	1	1.8%
guryegu	1	1.8%
gokseong	1	1.8%
Other values (46)	46	80.7%

Most occurring characters

Value	Count	Frequency (%)
n	80	17.7%
o	52	11.5%
g	47	10.4%
e	41	9.1%
a	35	7.7%
u	24	5.3%
s	19	4.2%
j	14	3.1%
G	12	2.7%
i	12	2.7%
Other values (32)	116	25.7%

Most occurring categories

Value	Count	Frequency (%)
Lowercase Letter	385	85.2%
Uppercase Letter	60	13.3%
Open Punctuation	2	0.4%
Close Punctuation	2	0.4%
Dash Punctuation	2	0.4%
Space Separator	1	0.2%

Most frequent character per category

Lowercase Letter

Value	Count	Frequency (%)
n	80	20.8%
o	52	13.5%
g	47	12.2%
e	41	10.6%
a	35	9.1%
u	24	6.2%
s	19	4.9%
j	14	3.6%
i	12	3.1%
y	11	2.9%
Other values (12)	50	13.0%

Uppercase Letter

Value	Count	Frequency (%)
G	12	20.0%
J	8	13.3%
S	6	10.0%
D	5	8.3%
M	5	8.3%
Y	5	8.3%
C	4	6.7%
O	3	5.0%
N	3	5.0%
P	2	3.3%
Other values (6)	7	11.7%

Open Punctuation

Value	Count	Frequency (%)
(	2	100.0%

Close Punctuation

Value	Count	Frequency (%)
)	2	100.0%

Dash Punctuation

Value	Count	Frequency (%)
-	2	100.0%

Space Separator

Value	Count	Frequency (%)
	1	100.0%

Most occurring scripts

Value	Count	Frequency (%)
Latin	445	98.5%
Common	7	1.5%

Most frequent character per script

Latin

Value	Count	Frequency (%)
n	80	18.0%
o	52	11.7%
g	47	10.6%
e	41	9.2%
a	35	7.9%
u	24	5.4%
s	19	4.3%
j	14	3.1%
G	12	2.7%
i	12	2.7%
Other values (28)	109	24.5%

Common

Value	Count	Frequency (%)
(	2	28.6%
)	2	28.6%
-	2	28.6%
	1	14.3%

Most occurring blocks

Value	Count	Frequency (%)
ASCII	452	100.0%

Most frequent character per block

ASCII

Value	Count	Frequency (%)
n	80	17.7%
o	52	11.5%
g	47	10.4%
e	41	9.1%
a	35	7.7%
u	24	5.3%
s	19	4.2%
j	14	3.1%
G	12	2.7%
i	12	2.7%
Other values (32)	116	25.7%

역명(로마자)
Text

Distinct	56
Distinct (%)	98.2%
Missing	0
Missing (%)	0.0%
Memory size	588.0 B

Length

Max length	16
Median length	13
Mean length	7.9298246
Min length	4

Characters and Unicode

Total characters	452
Distinct characters	42
Distinct categories	6 ?
Distinct scripts	2 ?
Distinct blocks	1 ?

The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique	55 ?
Unique (%)	96.5%

Sample

1st row	Gimje
2nd row	Gwangjusongjeong
3rd row	Gongju
4th row	Gyeryong
5th row	Jeongeup

Value	Count	Frequency (%)
osong	2	3.5%
gimje	1	1.8%
pyeongchang	1	1.8%
hoengseong	1	1.8%
changwon	1	1.8%
jinyeong	1	1.8%
masan	1	1.8%
suncheon	1	1.8%
guryegu	1	1.8%
gokseong	1	1.8%
Other values (46)	46	80.7%

Most occurring characters

Value	Count	Frequency (%)
n	80	17.7%
o	52	11.5%
g	47	10.4%
e	41	9.1%
a	35	7.7%
u	24	5.3%
s	19	4.2%
j	14	3.1%
G	12	2.7%
i	12	2.7%
Other values (32)	116	25.7%

Most occurring categories

Value	Count	Frequency (%)
Lowercase Letter	385	85.2%
Uppercase Letter	60	13.3%
Open Punctuation	2	0.4%
Close Punctuation	2	0.4%
Dash Punctuation	2	0.4%
Space Separator	1	0.2%

Most frequent character per category

Lowercase Letter

Value	Count	Frequency (%)
n	80	20.8%
o	52	13.5%
g	47	12.2%
e	41	10.6%
a	35	9.1%
u	24	6.2%
s	19	4.9%
j	14	3.6%
i	12	3.1%
y	11	2.9%
Other values (12)	50	13.0%

Uppercase Letter

Value	Count	Frequency (%)
G	12	20.0%
J	8	13.3%
S	6	10.0%
D	5	8.3%
M	5	8.3%
Y	5	8.3%
C	4	6.7%
O	3	5.0%
N	3	5.0%
P	2	3.3%
Other values (6)	7	11.7%

Open Punctuation

Value	Count	Frequency (%)
(	2	100.0%

Close Punctuation

Value	Count	Frequency (%)
)	2	100.0%

Dash Punctuation

Value	Count	Frequency (%)
-	2	100.0%

Space Separator

Value	Count	Frequency (%)
	1	100.0%

Most occurring scripts

Value	Count	Frequency (%)
Latin	445	98.5%
Common	7	1.5%

Most frequent character per script

Latin

Value	Count	Frequency (%)
n	80	18.0%
o	52	11.7%
g	47	10.6%
e	41	9.2%
a	35	7.9%
u	24	5.4%
s	19	4.3%
j	14	3.1%
G	12	2.7%
i	12	2.7%
Other values (28)	109	24.5%

Common

Value	Count	Frequency (%)
(	2	28.6%
)	2	28.6%
-	2	28.6%
	1	14.3%

Most occurring blocks

Value	Count	Frequency (%)
ASCII	452	100.0%

Most frequent character per block

ASCII

Value	Count	Frequency (%)
n	80	17.7%
o	52	11.5%
g	47	10.4%
e	41	9.1%
a	35	7.7%
u	24	5.3%
s	19	4.2%
j	14	3.1%
G	12	2.7%
i	12	2.7%
Other values (32)	116	25.7%

역명(일본어)
Text

Distinct	56
Distinct (%)	98.2%
Missing	0
Missing (%)	0.0%
Memory size	588.0 B

Length

Max length	12
Median length	11
Mean length	4.7192982
Min length	2

Characters and Unicode

Total characters	269
Distinct characters	57
Distinct categories	4 ?
Distinct scripts	3 ?
Distinct blocks	3 ?

The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique	55 ?
Unique (%)	96.5%

Sample

1st row	キムジェ
2nd row	クァンジュソンジョン
3rd row	コンジュ
4th row	ケリョン
5th row	チョンウプ

Value	Count	Frequency (%)
オソン	2	3.4%
チャンウォン	2	3.4%
スソ	1	1.7%
チョンドンジン	1	1.7%
ジュンアン	1	1.7%
ピョンチャン	1	1.7%
フェンソン	1	1.7%
チニョン	1	1.7%
マサン	1	1.7%
スンチョン	1	1.7%
Other values (47)	47	79.7%

Most occurring characters

Value	Count	Frequency (%)
ン	73	27.1%
ョ	19	7.1%
チ	16	5.9%
ジ	14	5.2%
サ	9	3.3%
ソ	9	3.3%
ク	7	2.6%
ュ	7	2.6%
ウ	6	2.2%
ャ	6	2.2%
Other values (47)	103	38.3%

Most occurring categories

Value	Count	Frequency (%)
Other Letter	261	97.0%
Space Separator	4	1.5%
Close Punctuation	2	0.7%
Open Punctuation	2	0.7%

Most frequent character per category

Other Letter

Value	Count	Frequency (%)
ン	73	28.0%
ョ	19	7.3%
チ	16	6.1%
ジ	14	5.4%
サ	9	3.4%
ソ	9	3.4%
ク	7	2.7%
ュ	7	2.7%
ウ	6	2.3%
ャ	6	2.3%
Other values (44)	95	36.4%

Space Separator

Value	Count	Frequency (%)
	4	100.0%

Close Punctuation

Value	Count	Frequency (%)
)	2	100.0%

Open Punctuation

Value	Count	Frequency (%)
(	2	100.0%

Most occurring scripts

Value	Count	Frequency (%)
Katakana	259	96.3%
Common	8	3.0%
Han	2	0.7%

Most frequent character per script

Katakana

Value	Count	Frequency (%)
ン	73	28.2%
ョ	19	7.3%
チ	16	6.2%
ジ	14	5.4%
サ	9	3.5%
ソ	9	3.5%
ク	7	2.7%
ュ	7	2.7%
ウ	6	2.3%
ャ	6	2.3%
Other values (42)	93	35.9%

Common

Value	Count	Frequency (%)
	4	50.0%
)	2	25.0%
(	2	25.0%

Han

Value	Count	Frequency (%)
山	1	50.0%
釜	1	50.0%

Most occurring blocks

Value	Count	Frequency (%)
Katakana	259	96.3%
ASCII	8	3.0%
CJK	2	0.7%

Most frequent character per block

Katakana

Value	Count	Frequency (%)
ン	73	28.2%
ョ	19	7.3%
チ	16	6.2%
ジ	14	5.4%
サ	9	3.5%
ソ	9	3.5%
ク	7	2.7%
ュ	7	2.7%
ウ	6	2.3%
ャ	6	2.3%
Other values (42)	93	35.9%

ASCII

Value	Count	Frequency (%)
	4	50.0%
)	2	25.0%
(	2	25.0%

CJK

Value	Count	Frequency (%)
山	1	50.0%
釜	1	50.0%

역명(중국어 간체)
Text

UNIQUE

Distinct	57
Distinct (%)	100.0%
Missing	0
Missing (%)	0.0%
Memory size	588.0 B

Length

Max length	7
Median length	2
Mean length	2.4385965
Min length	2

Characters and Unicode

Total characters	139
Distinct characters	93
Distinct categories	4 ?
Distinct scripts	2 ?
Distinct blocks	3 ?

The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique	57 ?
Unique (%)	100.0%

Sample

1st row	鸡龙
2nd row	公州
3rd row	光州松汀
4th row	金堤
5th row	罗州

Value	Count	Frequency (%)
鸡龙	1	1.8%
马山	1	1.8%
晋州	1	1.8%
昌原	1	1.8%
昌原中央	1	1.8%
谷城	1	1.8%
求礼口	1	1.8%
南原	1	1.8%
顺天	1	1.8%
丽水世博会	1	1.8%
Other values (47)	47	82.5%

Most occurring characters

Value	Count	Frequency (%)
山	9	6.5%
州	6	4.3%
原	4	2.9%
浦	4	2.9%
东	4	2.9%
水	3	2.2%
昌	3	2.2%
大	3	2.2%
城	3	2.2%
龟	2	1.4%
Other values (83)	98	70.5%

Most occurring categories

Value	Count	Frequency (%)
Other Letter	133	95.7%
Open Punctuation	2	1.4%
Close Punctuation	2	1.4%
Other Punctuation	2	1.4%

Most frequent character per category

Other Letter

Value	Count	Frequency (%)
山	9	6.8%
州	6	4.5%
原	4	3.0%
浦	4	3.0%
东	4	3.0%
水	3	2.3%
昌	3	2.3%
大	3	2.3%
城	3	2.3%
龟	2	1.5%
Other values (80)	92	69.2%

Open Punctuation

Value	Count	Frequency (%)
(	2	100.0%

Close Punctuation

Value	Count	Frequency (%)
)	2	100.0%

Other Punctuation

Value	Count	Frequency (%)
?	2	100.0%

Most occurring scripts

Value	Count	Frequency (%)
Han	133	95.7%
Common	6	4.3%

Most frequent character per script

Han

Value	Count	Frequency (%)
山	9	6.8%
州	6	4.5%
原	4	3.0%
浦	4	3.0%
东	4	3.0%
水	3	2.3%
昌	3	2.3%
大	3	2.3%
城	3	2.3%
龟	2	1.5%
Other values (80)	92	69.2%

Common

Value	Count	Frequency (%)
(	2	33.3%
)	2	33.3%
?	2	33.3%

Most occurring blocks

Value	Count	Frequency (%)
CJK	131	94.2%
ASCII	6	4.3%
CJK Compat Ideographs	2	1.4%

Most frequent character per block

CJK

Value	Count	Frequency (%)
山	9	6.9%
州	6	4.6%
原	4	3.1%
浦	4	3.1%
东	4	3.1%
水	3	2.3%
昌	3	2.3%
大	3	2.3%
城	3	2.3%
龟	2	1.5%
Other values (79)	90	68.7%

ASCII

Value	Count	Frequency (%)
(	2	33.3%
)	2	33.3%
?	2	33.3%

CJK Compat Ideographs

Value	Count	Frequency (%)
金	2	100.0%

역명(중국어 번체)
Text

Distinct	56
Distinct (%)	98.2%
Missing	0
Missing (%)	0.0%
Memory size	588.0 B

Length

Max length	10
Median length	2
Mean length	2.5438596
Min length	2

Characters and Unicode

Total characters	145
Distinct characters	94
Distinct categories	3 ?
Distinct scripts	3 ?
Distinct blocks	4 ?

The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique	55 ?
Unique (%)	96.5%

Sample

1st row	金堤
2nd row	光州松汀
3rd row	公州
4th row	鷄龍
5th row	井邑

Value	Count	Frequency (%)
五松	2	3.5%
金堤	1	1.8%
平昌	1	1.8%
橫城	1	1.8%
昌原	1	1.8%
進永	1	1.8%
馬山	1	1.8%
順天	1	1.8%
求禮口	1	1.8%
谷城	1	1.8%
Other values (46)	46	80.7%

Most occurring characters

Value	Count	Frequency (%)
山	9	6.2%
州	6	4.1%
浦	4	2.8%
原	4	2.8%
東	4	2.8%
五	3	2.1%
(	3	2.1%
昌	3	2.1%
松	3	2.1%
水	3	2.1%
Other values (84)	103	71.0%

Most occurring categories

Value	Count	Frequency (%)
Other Letter	139	95.9%
Open Punctuation	3	2.1%
Close Punctuation	3	2.1%

Most frequent character per category

Other Letter

Value	Count	Frequency (%)
山	9	6.5%
州	6	4.3%
浦	4	2.9%
原	4	2.9%
東	4	2.9%
五	3	2.2%
昌	3	2.2%
松	3	2.2%
水	3	2.2%
城	3	2.2%
Other values (82)	97	69.8%

Open Punctuation

Value	Count	Frequency (%)
(	3	100.0%

Close Punctuation

Value	Count	Frequency (%)
)	3	100.0%

Most occurring scripts

Value	Count	Frequency (%)
Han	136	93.8%
Common	6	4.1%
Hangul	3	2.1%

Most frequent character per script

Han

Value	Count	Frequency (%)
山	9	6.6%
州	6	4.4%
浦	4	2.9%
原	4	2.9%
東	4	2.9%
五	3	2.2%
昌	3	2.2%
松	3	2.2%
水	3	2.2%
城	3	2.2%
Other values (79)	94	69.1%

Hangul

Value	Count	Frequency (%)
포	1	33.3%
스	1	33.3%
엑	1	33.3%

Common

Value	Count	Frequency (%)
(	3	50.0%
)	3	50.0%

Most occurring blocks

Value	Count	Frequency (%)
CJK	129	89.0%
CJK Compat Ideographs	7	4.8%
ASCII	6	4.1%
Hangul	3	2.1%

Most frequent character per block

CJK

Value	Count	Frequency (%)
山	9	7.0%
州	6	4.7%
浦	4	3.1%
原	4	3.1%
東	4	3.1%
五	3	2.3%
昌	3	2.3%
松	3	2.3%
水	3	2.3%
城	3	2.3%
Other values (74)	87	67.4%

ASCII

Value	Count	Frequency (%)
(	3	50.0%
)	3	50.0%

CJK Compat Ideographs

Value	Count	Frequency (%)
麗	2	28.6%
金	2	28.6%
論	1	14.3%
羅	1	14.3%
龍	1	14.3%

Hangul

Value	Count	Frequency (%)
포	1	33.3%
스	1	33.3%
엑	1	33.3%

Phik (φk)

Heatmap
Table

	역명	역명(영문)	역명(로마자)	역명(일본어)	역명(중국어 간체)	역명(중국어 번체)
역명	1.000	1.000	1.000	1.000	1.000	1.000
역명(영문)	1.000	1.000	1.000	1.000	1.000	1.000
역명(로마자)	1.000	1.000	1.000	1.000	1.000	1.000
역명(일본어)	1.000	1.000	1.000	1.000	1.000	1.000
역명(중국어 간체)	1.000	1.000	1.000	1.000	1.000	1.000
역명(중국어 번체)	1.000	1.000	1.000	1.000	1.000	1.000

Count
Matrix

A simple visualization of nullity by column.

Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

First rows
Last rows

	역명	역명(영문)	역명(로마자)	역명(일본어)	역명(중국어 간체)	역명(중국어 번체)
0	김제	Gimje	Gimje	キムジェ	鸡龙	金堤
1	광주송정	Gwangjusongjeong	Gwangjusongjeong	クァンジュソンジョン	公州	光州松汀
2	공주	Gongju	Gongju	コンジュ	光州松汀	公州
3	계룡	Gyeryong	Gyeryong	ケリョン	金堤	鷄龍
4	정읍	Jeongeup	Jeongeup	チョンウプ	罗州	井邑
5	나주	Naju	Naju	ナジュ	论山	羅州
6	익산	Iksan	Iksan	イクサン	木浦	益山
7	오송	Osong	Osong	オソン	西大田	五松
8	서대전	Seodaejeon	Seodaejeon	ソデジョン	益山	西大田
9	목포	Mokpo	Mokpo	モクポ	长城	木浦

	역명	역명(영문)	역명(로마자)	역명(일본어)	역명(중국어 간체)	역명(중국어 번체)
47	청량리	Cheongnyangni	Cheongnyangni	チョンニャンニ	万钟	淸凉里
48	강릉	Gangneung	Gangneung	カンヌン	墨湖	江陵
49	동해	Donghae	Donghae	トンヘ	正东津	東海
50	둔내	Dunnae	Dunnae	トゥンネ	珍富(五台山)	屯內
51	만종	Manjong	Manjong	マンジョン	平昌	萬鍾
52	묵호	Mukho	Mukho	ムコ	横城	墨湖
53	정동진	Jeongdongjin	Jeongdongjin	チョンドンジン	水西	正東津
54	수서	Suseo	Suseo	スソ	芝制	水西
55	지제	Jije	Jije	チジェ	东滩	芝制
56	동탄	Dongtan	Dongtan	トンタン	??	東灘

Overview

Variables

Most occurring characters

Most occurring categories

Most frequent character per category

Other Letter

Close Punctuation

Open Punctuation

Most occurring scripts

Most frequent character per script

Hangul

Common

Most occurring blocks

Most frequent character per block

Hangul

ASCII

Most occurring characters

Most occurring categories

Most frequent character per category

Lowercase Letter

Uppercase Letter

Open Punctuation

Close Punctuation

Dash Punctuation

Space Separator

Most occurring scripts

Most frequent character per script

Latin

Common

Most occurring blocks

Most frequent character per block

ASCII

Most occurring characters

Most occurring categories

Most frequent character per category

Lowercase Letter

Uppercase Letter

Open Punctuation

Close Punctuation

Dash Punctuation

Space Separator

Most occurring scripts

Most frequent character per script

Latin

Common

Most occurring blocks

Most frequent character per block

ASCII

Most occurring characters

Most occurring categories

Most frequent character per category

Other Letter

Space Separator

Close Punctuation

Open Punctuation

Most occurring scripts

Most frequent character per script

Katakana

Common

Han

Most occurring blocks

Most frequent character per block

Katakana

ASCII

CJK

Most occurring characters

Most occurring categories

Most frequent character per category

Other Letter

Open Punctuation

Close Punctuation

Other Punctuation

Most occurring scripts

Most frequent character per script

Han

Common

Most occurring blocks

Most frequent character per block

CJK

ASCII