Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 300 |
Missing cells | 1 |
Missing cells (%) | 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 12.1 KiB |
Average record size in memory | 41.4 B |
Variable types
Numeric | 1 |
---|---|
Text | 4 |
Dataset
Description | 한국철도역 역명에 대한 한자표기와 영문표기입니다. 이 데이터는 번호,한글,영어,한자(번체),주소 항목을 제공합니다. |
---|---|
Author | 한국철도공사 |
URL | https://www.data.go.kr/data/15042115/fileData.do |
Reproduction
Analysis started | 2023-12-12 19:32:06.969838 |
---|---|
Analysis finished | 2023-12-12 19:32:07.810352 |
Duration | 0.84 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
번호
Real number (ℝ)
UNIQUE
 
Distinct | 300 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 150.5 |
Minimum | 1 |
---|---|
Maximum | 300 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.8 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 15.95 |
Q1 | 75.75 |
median | 150.5 |
Q3 | 225.25 |
95-th percentile | 285.05 |
Maximum | 300 |
Range | 299 |
Interquartile range (IQR) | 149.5 |
Descriptive statistics
Standard deviation | 86.746758 |
---|---|
Coefficient of variation (CV) | 0.57639042 |
Kurtosis | -1.2 |
Mean | 150.5 |
Median Absolute Deviation (MAD) | 75 |
Skewness | 0 |
Sum | 45150 |
Variance | 7525 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.3% |
208 | 1 | 0.3% |
206 | 1 | 0.3% |
205 | 1 | 0.3% |
204 | 1 | 0.3% |
203 | 1 | 0.3% |
202 | 1 | 0.3% |
201 | 1 | 0.3% |
200 | 1 | 0.3% |
199 | 1 | 0.3% |
Other values (290) | 290 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
300 | 1 | |
299 | 1 | |
298 | 1 | |
297 | 1 | |
296 | 1 | |
295 | 1 | |
294 | 1 | |
293 | 1 | |
292 | 1 | |
291 | 1 |
역명
Text
UNIQUE
 
Distinct | 300 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.5 KiB |
Value | Count | Frequency (%) |
가남 | 1 | 0.3% |
원주 | 1 | 0.3% |
원동 | 1 | 0.3% |
웅천 | 1 | 0.3% |
울산 | 1 | 0.3% |
우보 | 1 | 0.3% |
용산 | 1 | 0.3% |
용동 | 1 | 0.3% |
용궁 | 1 | 0.3% |
왜관 | 1 | 0.3% |
Other values (290) | 290 |
Most occurring characters
Value | Count | Frequency (%) |
천 | 31 | 4.5% |
산 | 23 | 3.4% |
주 | 20 | 2.9% |
원 | 19 | 2.8% |
성 | 17 | 2.5% |
동 | 14 | 2.0% |
신 | 14 | 2.0% |
리 | 13 | 1.9% |
대 | 12 | 1.8% |
구 | 11 | 1.6% |
Other values (185) | 510 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 680 | |
Close Punctuation | 2 | 0.3% |
Open Punctuation | 2 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
천 | 31 | 4.6% |
산 | 23 | 3.4% |
주 | 20 | 2.9% |
원 | 19 | 2.8% |
성 | 17 | 2.5% |
동 | 14 | 2.1% |
신 | 14 | 2.1% |
리 | 13 | 1.9% |
대 | 12 | 1.8% |
구 | 11 | 1.6% |
Other values (183) | 506 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 680 | |
Common | 4 | 0.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
천 | 31 | 4.6% |
산 | 23 | 3.4% |
주 | 20 | 2.9% |
원 | 19 | 2.8% |
성 | 17 | 2.5% |
동 | 14 | 2.1% |
신 | 14 | 2.1% |
리 | 13 | 1.9% |
대 | 12 | 1.8% |
구 | 11 | 1.6% |
Other values (183) | 506 |
Common
Value | Count | Frequency (%) |
) | 2 | |
( | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 680 | |
ASCII | 4 | 0.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
천 | 31 | 4.6% |
산 | 23 | 3.4% |
주 | 20 | 2.9% |
원 | 19 | 2.8% |
성 | 17 | 2.5% |
동 | 14 | 2.1% |
신 | 14 | 2.1% |
리 | 13 | 1.9% |
대 | 12 | 1.8% |
구 | 11 | 1.6% |
Other values (183) | 506 |
ASCII
Value | Count | Frequency (%) |
) | 2 | |
( | 2 |
영문
Text
UNIQUE
 
Distinct | 300 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.5 KiB |
Value | Count | Frequency (%) |
ganam | 1 | 0.3% |
yongdong | 1 | 0.3% |
wonju | 1 | 0.3% |
wolleung | 1 | 0.3% |
wondong | 1 | 0.3% |
ungcheon | 1 | 0.3% |
ulsan | 1 | 0.3% |
ubo | 1 | 0.3% |
yongsan | 1 | 0.3% |
yonggung | 1 | 0.3% |
Other values (294) | 294 |
Most occurring characters
Value | Count | Frequency (%) |
n | 357 | |
o | 260 | 11.5% |
g | 225 | 9.9% |
a | 207 | 9.1% |
e | 207 | 9.1% |
u | 99 | 4.4% |
h | 73 | 3.2% |
i | 68 | 3.0% |
s | 64 | 2.8% |
y | 55 | 2.4% |
Other values (36) | 654 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 1935 | |
Uppercase Letter | 311 | 13.7% |
Dash Punctuation | 13 | 0.6% |
Space Separator | 5 | 0.2% |
Close Punctuation | 2 | 0.1% |
Open Punctuation | 2 | 0.1% |
Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
n | 357 | |
o | 260 | |
g | 225 | |
a | 207 | |
e | 207 | |
u | 99 | 5.1% |
h | 73 | 3.8% |
i | 68 | 3.5% |
s | 64 | 3.3% |
y | 55 | 2.8% |
Other values (12) | 320 |
Uppercase Letter
Value | Count | Frequency (%) |
G | 45 | |
S | 44 | |
J | 33 | |
B | 25 | |
H | 23 | |
Y | 20 | 6.4% |
D | 18 | 5.8% |
M | 17 | 5.5% |
C | 16 | 5.1% |
N | 14 | 4.5% |
Other values (9) | 56 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 13 |
Space Separator
Value | Count | Frequency (%) |
5 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2 |
Other Punctuation
Value | Count | Frequency (%) |
' | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 2246 | |
Common | 23 | 1.0% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
n | 357 | |
o | 260 | |
g | 225 | 10.0% |
a | 207 | 9.2% |
e | 207 | 9.2% |
u | 99 | 4.4% |
h | 73 | 3.3% |
i | 68 | 3.0% |
s | 64 | 2.8% |
y | 55 | 2.4% |
Other values (31) | 631 |
Common
Value | Count | Frequency (%) |
- | 13 | |
5 | 21.7% | |
) | 2 | 8.7% |
( | 2 | 8.7% |
' | 1 | 4.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2269 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
n | 357 | |
o | 260 | 11.5% |
g | 225 | 9.9% |
a | 207 | 9.1% |
e | 207 | 9.1% |
u | 99 | 4.4% |
h | 73 | 3.2% |
i | 68 | 3.0% |
s | 64 | 2.8% |
y | 55 | 2.4% |
Other values (36) | 654 |
한자
Text
Distinct | 298 |
---|---|
Distinct (%) | 99.7% |
Missing | 1 |
Missing (%) | 0.3% |
Memory size | 2.5 KiB |
Value | Count | Frequency (%) |
2 | 0.7% | |
玉山 | 1 | 0.3% |
元陵 | 1 | 0.3% |
院洞 | 1 | 0.3% |
熊川 | 1 | 0.3% |
蔚山 | 1 | 0.3% |
友保 | 1 | 0.3% |
龍山 | 1 | 0.3% |
龍宮 | 1 | 0.3% |
元竹 | 1 | 0.3% |
Other values (288) | 288 |
Most occurring characters
Value | Count | Frequency (%) |
山 | 22 | 3.2% |
川 | 19 | 2.8% |
州 | 16 | 2.3% |
新 | 13 | 1.9% |
里 | 13 | 1.9% |
城 | 13 | 1.9% |
東 | 10 | 1.5% |
大 | 10 | 1.5% |
泉 | 10 | 1.5% |
浦 | 9 | 1.3% |
Other values (304) | 548 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 675 | |
Close Punctuation | 3 | 0.4% |
Open Punctuation | 3 | 0.4% |
Dash Punctuation | 2 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
山 | 22 | 3.3% |
川 | 19 | 2.8% |
州 | 16 | 2.4% |
新 | 13 | 1.9% |
里 | 13 | 1.9% |
城 | 13 | 1.9% |
東 | 10 | 1.5% |
大 | 10 | 1.5% |
泉 | 10 | 1.5% |
浦 | 9 | 1.3% |
Other values (301) | 540 |
Close Punctuation
Value | Count | Frequency (%) |
) | 3 |
Open Punctuation
Value | Count | Frequency (%) |
( | 3 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Han | 672 | |
Common | 8 | 1.2% |
Hangul | 3 | 0.4% |
Most frequent character per script
Han
Value | Count | Frequency (%) |
山 | 22 | 3.3% |
川 | 19 | 2.8% |
州 | 16 | 2.4% |
新 | 13 | 1.9% |
里 | 13 | 1.9% |
城 | 13 | 1.9% |
東 | 10 | 1.5% |
大 | 10 | 1.5% |
泉 | 10 | 1.5% |
浦 | 9 | 1.3% |
Other values (298) | 537 |
Common
Value | Count | Frequency (%) |
) | 3 | |
( | 3 | |
- | 2 |
Hangul
Value | Count | Frequency (%) |
엑 | 1 | |
스 | 1 | |
포 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
CJK | 643 | |
CJK Compat Ideographs | 29 | 4.2% |
ASCII | 8 | 1.2% |
Hangul | 3 | 0.4% |
Most frequent character per block
CJK
Value | Count | Frequency (%) |
山 | 22 | 3.4% |
川 | 19 | 3.0% |
州 | 16 | 2.5% |
新 | 13 | 2.0% |
里 | 13 | 2.0% |
城 | 13 | 2.0% |
東 | 10 | 1.6% |
大 | 10 | 1.6% |
泉 | 10 | 1.6% |
浦 | 9 | 1.4% |
Other values (276) | 508 |
ASCII
Value | Count | Frequency (%) |
) | 3 | |
( | 3 | |
- | 2 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
金 | 3 | 10.3% |
龍 | 3 | 10.3% |
禮 | 2 | 6.9% |
麗 | 2 | 6.9% |
羅 | 2 | 6.9% |
林 | 1 | 3.4% |
梨 | 1 | 3.4% |
栗 | 1 | 3.4% |
蓮 | 1 | 3.4% |
綾 | 1 | 3.4% |
Other values (12) | 12 |
Hangul
Value | Count | Frequency (%) |
엑 | 1 | |
스 | 1 | |
포 | 1 |
주소
Text
UNIQUE
 
Distinct | 300 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.5 KiB |
Length
Max length | 34 |
---|---|
Median length | 26.5 |
Mean length | 18.576667 |
Min length | 12 |
Characters and Unicode
Total characters | 5573 |
---|---|
Distinct characters | 251 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 300 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 경기도 여주시 가남읍 태평리 |
---|---|
2nd row | 대전 서구 벌곡로 1324(가수원동) |
3rd row | 부산 부산진구 백양대로 91 |
4th row | 충북 영동군 심천면 각계길 55 |
5th row | 전북 정읍시 감곡면 호남철로 501 |
Value | Count | Frequency (%) |
경북 | 55 | 3.9% |
전남 | 36 | 2.5% |
충남 | 27 | 1.9% |
경남 | 26 | 1.8% |
충북 | 25 | 1.8% |
강원도 | 24 | 1.7% |
전북 | 21 | 1.5% |
경기도 | 20 | 1.4% |
강원 | 14 | 1.0% |
정선군 | 11 | 0.8% |
Other values (824) | 1160 |
Most occurring characters
Value | Count | Frequency (%) |
1128 | 20.2% | |
1 | 203 | 3.6% |
시 | 173 | 3.1% |
로 | 169 | 3.0% |
면 | 138 | 2.5% |
북 | 128 | 2.3% |
경 | 127 | 2.3% |
군 | 122 | 2.2% |
2 | 120 | 2.2% |
남 | 119 | 2.1% |
Other values (241) | 3146 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 3397 | |
Space Separator | 1128 | 20.2% |
Decimal Number | 921 | 16.5% |
Dash Punctuation | 74 | 1.3% |
Close Punctuation | 25 | 0.4% |
Open Punctuation | 25 | 0.4% |
Other Punctuation | 3 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 173 | 5.1% |
로 | 169 | 5.0% |
면 | 138 | 4.1% |
북 | 128 | 3.8% |
경 | 127 | 3.7% |
군 | 122 | 3.6% |
남 | 119 | 3.5% |
전 | 86 | 2.5% |
원 | 84 | 2.5% |
길 | 82 | 2.4% |
Other values (226) | 2169 |
Decimal Number
Value | Count | Frequency (%) |
1 | 203 | |
2 | 120 | |
3 | 88 | |
5 | 86 | |
7 | 76 | 8.3% |
6 | 74 | 8.0% |
4 | 72 | 7.8% |
8 | 69 | 7.5% |
0 | 67 | 7.3% |
9 | 66 | 7.2% |
Space Separator
Value | Count | Frequency (%) |
1128 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 74 |
Close Punctuation
Value | Count | Frequency (%) |
) | 25 |
Open Punctuation
Value | Count | Frequency (%) |
( | 25 |
Other Punctuation
Value | Count | Frequency (%) |
, | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 3397 | |
Common | 2176 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 173 | 5.1% |
로 | 169 | 5.0% |
면 | 138 | 4.1% |
북 | 128 | 3.8% |
경 | 127 | 3.7% |
군 | 122 | 3.6% |
남 | 119 | 3.5% |
전 | 86 | 2.5% |
원 | 84 | 2.5% |
길 | 82 | 2.4% |
Other values (226) | 2169 |
Common
Value | Count | Frequency (%) |
1128 | ||
1 | 203 | 9.3% |
2 | 120 | 5.5% |
3 | 88 | 4.0% |
5 | 86 | 4.0% |
7 | 76 | 3.5% |
- | 74 | 3.4% |
6 | 74 | 3.4% |
4 | 72 | 3.3% |
8 | 69 | 3.2% |
Other values (5) | 186 | 8.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 3397 | |
ASCII | 2176 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1128 | ||
1 | 203 | 9.3% |
2 | 120 | 5.5% |
3 | 88 | 4.0% |
5 | 86 | 4.0% |
7 | 76 | 3.5% |
- | 74 | 3.4% |
6 | 74 | 3.4% |
4 | 72 | 3.3% |
8 | 69 | 3.2% |
Other values (5) | 186 | 8.5% |
Hangul
Value | Count | Frequency (%) |
시 | 173 | 5.1% |
로 | 169 | 5.0% |
면 | 138 | 4.1% |
북 | 128 | 3.8% |
경 | 127 | 3.7% |
군 | 122 | 3.6% |
남 | 119 | 3.5% |
전 | 86 | 2.5% |
원 | 84 | 2.5% |
길 | 82 | 2.4% |
Other values (226) | 2169 |
번호 | 역명 | 영문 | 한자 | 주소 | |
---|---|---|---|---|---|
0 | 1 | 가남 | Ganam | 加南 | 경기도 여주시 가남읍 태평리 |
1 | 2 | 가수원 | Gasuwon | 佳水院 | 대전 서구 벌곡로 1324(가수원동) |
2 | 3 | 가야 | Gaya | 伽倻 | 부산 부산진구 백양대로 91 |
3 | 4 | 각계 | Gakgye | 覺溪 | 충북 영동군 심천면 각계길 55 |
4 | 5 | 감곡 | Gamgok | 甘谷 | 전북 정읍시 감곡면 호남철로 501 |
5 | 6 | 감곡장호원 | GangokJanghowon | 甘谷長湖院 | 충북 음성군 감곡면 왕장리 312-2 |
6 | 7 | 강경 | Ganggyeong | 江景 | 충남 논산시 강경읍 대흥로 1 |
7 | 8 | 강구 | Ganggu | 江口 | 경상북도 영덕군 강구면 강산로 67 |
8 | 9 | 강릉 | Gangneung | 江陵 | 강원도 강릉시 용지로 176 |
9 | 10 | 개운 | Gaeun | 開雲 | 전남 순천시 서면 개운길 30 |
번호 | 역명 | 영문 | 한자 | 주소 | |
---|---|---|---|---|---|
290 | 291 | 화명 | Hwamyeong | 華明 | 부산 북구 학사로 135(화명동) |
291 | 292 | 화본 | Hwabon | 花本 | 경북 군위군 산성면 산성가음로 711-9 |
292 | 293 | 화산 | Hwasan | 花山 | 경북 영천시 화산면 장수로 917-10 |
293 | 294 | 화순 | Hwasun | 和順 | 전남 화순군 화순읍 벽라리 507 |
294 | 295 | 화양 | Hwayang | 華陽 | 충남 홍성군 금마면 화양리 181 |
295 | 296 | 황간 | Hwanggan | 黃澗 | 충북 영동군 황간면 하옥포2길 14 |
296 | 297 | 횡성 | Hoengseong | 橫城 | 강원도 횡성군 횡성읍 덕고로 591 |
297 | 298 | 횡천 | Hoengcheon | 橫川 | 경남 하동군 횡천면 중마길 277 |
298 | 299 | 효자 | Hyoja | 孝子 | 경북 포항시 남구 새천년대로 289 |
299 | 300 | 효천 | Hyocheon | 孝泉 | 광주시 남구 효천길 5 |