Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 57 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.0 KiB |
Average record size in memory | 35.3 B |
Variable types
Categorical | 1 |
---|---|
Text | 3 |
Dataset
Description | 인천교통공사에서 운영중인 인천지하철 1호선, 인천지하철 2호선 역사의 외국어 표기명으로 사용 외국어는 국어, 한자, 영어 현황입니다. (필드정보는 호선, 역사명, 한자, 영문명 입니다.) |
---|---|
URL | https://www.data.go.kr/data/15043808/fileData.do |
한 글 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 14:32:42.367252 |
---|---|
Analysis finished | 2023-12-12 14:32:42.827637 |
Duration | 0.46 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
호선
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 3.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 588.0 B |
1 | |
---|---|
2 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 30 | |
2 | 27 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 30 | |
2 | 27 |
한 글
Text
UNIQUE
 
Distinct | 57 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 588.0 B |
Value | Count | Frequency (%) |
인천시청 | 2 | 3.5% |
계양 | 1 | 1.8% |
국제업무지구 | 1 | 1.8% |
왕길 | 1 | 1.8% |
검단사거리 | 1 | 1.8% |
마전 | 1 | 1.8% |
완정 | 1 | 1.8% |
독정 | 1 | 1.8% |
검암 | 1 | 1.8% |
검바위 | 1 | 1.8% |
Other values (46) | 46 |
Most occurring characters
Value | Count | Frequency (%) |
54 | 21.0% | |
인 | 8 | 3.1% |
시 | 8 | 3.1% |
천 | 7 | 2.7% |
구 | 6 | 2.3% |
장 | 6 | 2.3% |
청 | 5 | 1.9% |
가 | 5 | 1.9% |
부 | 5 | 1.9% |
정 | 5 | 1.9% |
Other values (90) | 148 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 203 | |
Space Separator | 54 | 21.0% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
인 | 8 | 3.9% |
시 | 8 | 3.9% |
천 | 7 | 3.4% |
구 | 6 | 3.0% |
장 | 6 | 3.0% |
청 | 5 | 2.5% |
가 | 5 | 2.5% |
부 | 5 | 2.5% |
정 | 5 | 2.5% |
리 | 4 | 2.0% |
Other values (89) | 144 |
Space Separator
Value | Count | Frequency (%) |
54 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 203 | |
Common | 54 | 21.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
인 | 8 | 3.9% |
시 | 8 | 3.9% |
천 | 7 | 3.4% |
구 | 6 | 3.0% |
장 | 6 | 3.0% |
청 | 5 | 2.5% |
가 | 5 | 2.5% |
부 | 5 | 2.5% |
정 | 5 | 2.5% |
리 | 4 | 2.0% |
Other values (89) | 144 |
Common
Value | Count | Frequency (%) |
54 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 203 | |
ASCII | 54 | 21.0% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
54 |
Hangul
Value | Count | Frequency (%) |
인 | 8 | 3.9% |
시 | 8 | 3.9% |
천 | 7 | 3.4% |
구 | 6 | 3.0% |
장 | 6 | 3.0% |
청 | 5 | 2.5% |
가 | 5 | 2.5% |
부 | 5 | 2.5% |
정 | 5 | 2.5% |
리 | 4 | 2.0% |
Other values (89) | 144 |
漢 字
Text
Distinct | 51 |
---|---|
Distinct (%) | 89.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 588.0 B |
Value | Count | Frequency (%) |
仁川市廳 | 2 | 3.5% |
知識情報團地 | 1 | 1.8% |
거북市場 | 1 | 1.8% |
黔丹四거리 | 1 | 1.8% |
麻田 | 1 | 1.8% |
完井 | 1 | 1.8% |
篤亭 | 1 | 1.8% |
黔岩 | 1 | 1.8% |
아시아드競技場 | 1 | 1.8% |
公村四거리 | 1 | 1.8% |
Other values (46) | 46 |
Most occurring characters
Value | Count | Frequency (%) |
12 | 5.1% | |
仁 | 9 | 3.8% |
市 | 8 | 3.4% |
場 | 7 | 3.0% |
川 | 7 | 3.0% |
( | 7 | 3.0% |
) | 6 | 2.6% |
리 | 6 | 2.6% |
거 | 6 | 2.6% |
廳 | 5 | 2.1% |
Other values (105) | 162 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 209 | |
Space Separator | 12 | 5.1% |
Open Punctuation | 7 | 3.0% |
Close Punctuation | 6 | 2.6% |
Uppercase Letter | 1 | 0.4% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
仁 | 9 | 4.3% |
市 | 8 | 3.8% |
場 | 7 | 3.3% |
川 | 7 | 3.3% |
리 | 6 | 2.9% |
거 | 6 | 2.9% |
廳 | 5 | 2.4% |
黔 | 4 | 1.9% |
區 | 4 | 1.9% |
地 | 4 | 1.9% |
Other values (101) | 149 |
Space Separator
Value | Count | Frequency (%) |
12 |
Open Punctuation
Value | Count | Frequency (%) |
( | 7 |
Close Punctuation
Value | Count | Frequency (%) |
) | 6 |
Uppercase Letter
Value | Count | Frequency (%) |
J | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Han | 182 | |
Hangul | 27 | 11.5% |
Common | 25 | 10.6% |
Latin | 1 | 0.4% |
Most frequent character per script
Han
Value | Count | Frequency (%) |
仁 | 9 | 4.9% |
市 | 8 | 4.4% |
場 | 7 | 3.8% |
川 | 7 | 3.8% |
廳 | 5 | 2.7% |
黔 | 4 | 2.2% |
區 | 4 | 2.2% |
地 | 4 | 2.2% |
平 | 4 | 2.2% |
富 | 4 | 2.2% |
Other values (85) | 126 |
Hangul
Value | Count | Frequency (%) |
리 | 6 | |
거 | 6 | |
아 | 2 | 7.4% |
바 | 1 | 3.7% |
드 | 1 | 3.7% |
시 | 1 | 3.7% |
모 | 1 | 3.7% |
래 | 1 | 3.7% |
위 | 1 | 3.7% |
석 | 1 | 3.7% |
Other values (6) | 6 |
Common
Value | Count | Frequency (%) |
12 | ||
( | 7 | |
) | 6 |
Latin
Value | Count | Frequency (%) |
J | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
CJK | 181 | |
Hangul | 27 | 11.5% |
ASCII | 26 | 11.1% |
CJK Compat Ideographs | 1 | 0.4% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
12 | ||
( | 7 | |
) | 6 | |
J | 1 | 3.8% |
CJK
Value | Count | Frequency (%) |
仁 | 9 | 5.0% |
市 | 8 | 4.4% |
場 | 7 | 3.9% |
川 | 7 | 3.9% |
廳 | 5 | 2.8% |
黔 | 4 | 2.2% |
區 | 4 | 2.2% |
地 | 4 | 2.2% |
平 | 4 | 2.2% |
富 | 4 | 2.2% |
Other values (84) | 125 |
Hangul
Value | Count | Frequency (%) |
리 | 6 | |
거 | 6 | |
아 | 2 | 7.4% |
바 | 1 | 3.7% |
드 | 1 | 3.7% |
시 | 1 | 3.7% |
모 | 1 | 3.7% |
래 | 1 | 3.7% |
위 | 1 | 3.7% |
석 | 1 | 3.7% |
Other values (6) | 6 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
林 | 1 |
로마字
Text
Distinct | 56 |
---|---|
Distinct (%) | 98.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 588.0 B |
Value | Count | Frequency (%) |
incheon | 7 | 6.0% |
market | 5 | 4.3% |
office | 3 | 2.6% |
sageori | 3 | 2.6% |
geomdan | 3 | 2.6% |
city | 3 | 2.6% |
complex | 3 | 2.6% |
park | 3 | 2.6% |
gajeong | 2 | 1.7% |
univ | 2 | 1.7% |
Other values (74) | 82 |
Most occurring characters
Value | Count | Frequency (%) |
n | 83 | 10.0% |
e | 73 | 8.8% |
a | 67 | 8.1% |
o | 65 | 7.9% |
59 | 7.1% | |
i | 33 | 4.0% |
u | 32 | 3.9% |
g | 30 | 3.6% |
r | 29 | 3.5% |
t | 29 | 3.5% |
Other values (44) | 326 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 626 | |
Uppercase Letter | 116 | 14.0% |
Space Separator | 59 | 7.1% |
Close Punctuation | 7 | 0.8% |
Open Punctuation | 7 | 0.8% |
Other Punctuation | 5 | 0.6% |
Dash Punctuation | 4 | 0.5% |
Decimal Number | 1 | 0.1% |
Final Punctuation | 1 | 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
n | 83 | |
e | 73 | |
a | 67 | |
o | 65 | |
i | 33 | 5.3% |
u | 32 | 5.1% |
g | 30 | 4.8% |
r | 29 | 4.6% |
t | 29 | 4.6% |
l | 23 | 3.7% |
Other values (15) | 162 |
Uppercase Letter
Value | Count | Frequency (%) |
G | 18 | |
C | 15 | |
S | 12 | |
I | 12 | |
M | 9 | 7.8% |
B | 8 | 6.9% |
J | 5 | 4.3% |
D | 5 | 4.3% |
W | 5 | 4.3% |
O | 4 | 3.4% |
Other values (10) | 23 |
Other Punctuation
Value | Count | Frequency (%) |
' | 3 | |
& | 1 | 20.0% |
. | 1 | 20.0% |
Space Separator
Value | Count | Frequency (%) |
59 |
Close Punctuation
Value | Count | Frequency (%) |
) | 7 |
Open Punctuation
Value | Count | Frequency (%) |
( | 7 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4 |
Decimal Number
Value | Count | Frequency (%) |
1 | 1 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 742 | |
Common | 84 | 10.2% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
n | 83 | 11.2% |
e | 73 | 9.8% |
a | 67 | 9.0% |
o | 65 | 8.8% |
i | 33 | 4.4% |
u | 32 | 4.3% |
g | 30 | 4.0% |
r | 29 | 3.9% |
t | 29 | 3.9% |
l | 23 | 3.1% |
Other values (35) | 278 |
Common
Value | Count | Frequency (%) |
59 | ||
) | 7 | 8.3% |
( | 7 | 8.3% |
- | 4 | 4.8% |
' | 3 | 3.6% |
& | 1 | 1.2% |
. | 1 | 1.2% |
1 | 1 | 1.2% |
’ | 1 | 1.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 825 | |
Punctuation | 1 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
n | 83 | 10.1% |
e | 73 | 8.8% |
a | 67 | 8.1% |
o | 65 | 7.9% |
59 | 7.2% | |
i | 33 | 4.0% |
u | 32 | 3.9% |
g | 30 | 3.6% |
r | 29 | 3.5% |
t | 29 | 3.5% |
Other values (43) | 325 |
Punctuation
Value | Count | Frequency (%) |
’ | 1 |
호선 | 한 글 | 漢 字 | 로마字 | |
---|---|---|---|---|
호선 | 1.000 | 1.000 | 0.000 | 0.000 |
한 글 | 1.000 | 1.000 | 1.000 | 1.000 |
漢 字 | 0.000 | 1.000 | 1.000 | 1.000 |
로마字 | 0.000 | 1.000 | 1.000 | 1.000 |
호선 | 한 글 | 漢 字 | 로마字 | |
---|---|---|---|---|
0 | 1 | 계양 | 桂陽 | Gyeyang |
1 | 1 | 귤현 | 橘峴 | Gyulhyeon |
2 | 1 | 박촌 | 朴村 | Bakchon |
3 | 1 | 임학 | 林鶴 | Imhak |
4 | 1 | 계산 | 桂山 | Gyesan |
5 | 1 | 경인교대 | 京仁敎大入口 | Gyeong-in Nat'l |
6 | 1 | 입구 | Univ. of Education | |
7 | 1 | 작전 | 鵲田 | Jakjeon |
8 | 1 | 갈산 | 葛山 | Galsan |
9 | 1 | 부평구청 | 富平區廳 | Bupyeong-gu Office |
호선 | 한 글 | 漢 字 | 로마字 | |
---|---|---|---|---|
47 | 2 | 주안 | 朱安 | Juan |
48 | 2 | 시민공원 | 市民公園 (文化創作地帶) | Citizens Park (Culture Creation Zone) |
49 | 2 | 석바위시장 | 석바위市場 | Seokbawi Market |
50 | 2 | 인천시청 | 仁川市廳 | Incheon City Hall |
51 | 2 | 석천사거리 | 石泉四거리 | Seokcheon Sageori |
52 | 2 | 모래내시장 | 모래내市場 | Moraenae Market |
53 | 2 | 만수 | 萬壽 | Mansu |
54 | 2 | 남동구청 | 南洞區廳 | Namdong-gu Office |
55 | 2 | 인천대공원 | 仁川大公園 | Incheon Grand Park |
56 | 2 | 운연 | 云宴 (西昌) | Unyeon (Seochang) |