Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 56 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.8 KiB |
Average record size in memory | 50.3 B |
Variable types
Categorical | 6 |
---|
Dataset
Description | 역명(한글),역명(영문),역명(로마자),역명(일본어),역명(중국어간체),역명(중국어번체) 등의 정보를 제공 |
---|---|
Author | 국가철도공단 |
URL | https://www.data.go.kr/data/15064043/fileData.do |
역명 has a high cardinality: 56 distinct values | High cardinality |
역명(영문) has a high cardinality: 56 distinct values | High cardinality |
역명(로마자) has a high cardinality: 56 distinct values | High cardinality |
역명(일본어) has a high cardinality: 56 distinct values | High cardinality |
역명(중국어 간체) has a high cardinality: 56 distinct values | High cardinality |
역명(중국어 번체) has a high cardinality: 56 distinct values | High cardinality |
역명(중국어 간체) is highly correlated with 역명 and 4 other fields | High correlation |
역명 is highly correlated with 역명(중국어 간체) and 4 other fields | High correlation |
역명(로마자) is highly correlated with 역명(중국어 간체) and 4 other fields | High correlation |
역명(중국어 번체) is highly correlated with 역명(중국어 간체) and 4 other fields | High correlation |
역명(일본어) is highly correlated with 역명(중국어 간체) and 4 other fields | High correlation |
역명(영문) is highly correlated with 역명(중국어 간체) and 4 other fields | High correlation |
역명 has unique values | Unique |
역명(영문) has unique values | Unique |
역명(로마자) has unique values | Unique |
역명(일본어) has unique values | Unique |
역명(중국어 간체) has unique values | Unique |
역명(중국어 번체) has unique values | Unique |
Reproduction
Analysis started | 2023-02-18 08:47:57.244795 |
---|---|
Analysis finished | 2023-02-18 08:47:58.020980 |
Duration | 0.78 seconds |
Software version | pandas-profiling v3.2.0 |
Download configuration | config.json |
Distinct | 56 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 576.0 B |
방화 | 1 |
---|---|
개화산 | 1 |
영등포구청 | 1 |
김포공항 | 1 |
송정 | 1 |
Other values (51) |
Length
Max length | 13 |
---|---|
Median length | 11 |
Mean length | 4.357142857 |
Min length | 2 |
Unique
Unique | 56 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 방화 |
---|---|
2nd row | 개화산 |
3rd row | 김포공항 |
4th row | 송정 |
5th row | 마곡 |
Common Values
Value | Count | Frequency (%) |
방화 | 1 | 1.8% |
개화산 | 1 | 1.8% |
영등포구청 | 1 | 1.8% |
김포공항 | 1 | 1.8% |
송정 | 1 | 1.8% |
마곡 | 1 | 1.8% |
발산 | 1 | 1.8% |
우장산 | 1 | 1.8% |
화곡 | 1 | 1.8% |
까치산 | 1 | 1.8% |
Other values (46) | 46 |
Length
Value | Count | Frequency (%) |
방화 | 1 | 1.8% |
개화산 | 1 | 1.8% |
명일 | 1 | 1.8% |
왕십리 | 1 | 1.8% |
마장 | 1 | 1.8% |
답십리 | 1 | 1.8% |
장한평 | 1 | 1.8% |
군자(능동 | 1 | 1.8% |
아차산(어린이대공원후문 | 1 | 1.8% |
광나루(장신대 | 1 | 1.8% |
Other values (46) | 46 |
Distinct | 56 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 576.0 B |
Banghwa | 1 |
---|---|
Gaehwasan | 1 |
Yeongdeungpo-gu Office | 1 |
Gimpo Int'l Airport | 1 |
Songjeong | 1 |
Other values (51) |
Length
Max length | 55 |
---|---|
Median length | 40 |
Mean length | 14.875 |
Min length | 4 |
Unique
Unique | 56 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | Banghwa |
---|---|
2nd row | Gaehwasan |
3rd row | Gimpo Int'l Airport |
4th row | Songjeong |
5th row | Magok |
Common Values
Value | Count | Frequency (%) |
Banghwa | 1 | 1.8% |
Gaehwasan | 1 | 1.8% |
Yeongdeungpo-gu Office | 1 | 1.8% |
Gimpo Int'l Airport | 1 | 1.8% |
Songjeong | 1 | 1.8% |
Magok | 1 | 1.8% |
Balsan | 1 | 1.8% |
Ujangsan | 1 | 1.8% |
Hwagok | 1 | 1.8% |
Kkachisan | 1 | 1.8% |
Other values (46) | 46 |
Length
Value | Count | Frequency (%) |
park | 4 | 3.8% |
hanam | 3 | 2.9% |
3 | 2.9% | |
gangdong | 2 | 1.9% |
univ | 2 | 1.9% |
center | 2 | 1.9% |
mokdong | 2 | 1.9% |
cheonho | 1 | 1.0% |
seminary | 1 | 1.0% |
theological | 1 | 1.0% |
Other values (84) | 84 |
Distinct | 56 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 576.0 B |
Banghwa | 1 |
---|---|
Gaehwasan | 1 |
Yeongdeungpo-gu Office | 1 |
Gimpo Int'l Airport | 1 |
Songjeong | 1 |
Other values (51) |
Length
Max length | 36 |
---|---|
Median length | 28 |
Mean length | 13.30357143 |
Min length | 4 |
Unique
Unique | 56 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | Banghwa |
---|---|
2nd row | Gaehwasan |
3rd row | Gimpo Int'l Airport |
4th row | Songjeong |
5th row | Magok |
Common Values
Value | Count | Frequency (%) |
Banghwa | 1 | 1.8% |
Gaehwasan | 1 | 1.8% |
Yeongdeungpo-gu Office | 1 | 1.8% |
Gimpo Int'l Airport | 1 | 1.8% |
Songjeong | 1 | 1.8% |
Magok | 1 | 1.8% |
Balsan | 1 | 1.8% |
Ujangsan | 1 | 1.8% |
Hwagok | 1 | 1.8% |
Kkachisan | 1 | 1.8% |
Other values (46) | 46 |
Length
Value | Count | Frequency (%) |
hanam | 3 | 4.1% |
park | 2 | 2.7% |
banghwa | 1 | 1.4% |
gil-dong | 1 | 1.4% |
cheonho(pungnaptoseong | 1 | 1.4% |
sem | 1 | 1.4% |
1 | 1.4% | |
college | 1 | 1.4% |
gwangnaru(presby | 1 | 1.4% |
achasan(eorinidaegongwonhumun | 1 | 1.4% |
Other values (60) | 60 |
Distinct | 56 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 576.0 B |
パンファ | 1 |
---|---|
ケファサン | 1 |
ヨンドンポグチョン | 1 |
キンポゴンハン | 1 |
ソンジョン | 1 |
Other values (51) |
Length
Max length | 30 |
---|---|
Median length | 18 |
Mean length | 6 |
Min length | 2 |
Unique
Unique | 56 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | パンファ |
---|---|
2nd row | ケファサン |
3rd row | キンポゴンハン |
4th row | ソンジョン |
5th row | マゴク |
Common Values
Value | Count | Frequency (%) |
パンファ | 1 | 1.8% |
ケファサン | 1 | 1.8% |
ヨンドンポグチョン | 1 | 1.8% |
キンポゴンハン | 1 | 1.8% |
ソンジョン | 1 | 1.8% |
マゴク | 1 | 1.8% |
パルサン | 1 | 1.8% |
ウジャンサン | 1 | 1.8% |
ファゴク | 1 | 1.8% |
カチサン | 1 | 1.8% |
Other values (46) | 46 |
Length
Value | Count | Frequency (%) |
パンファ | 1 | 1.8% |
ケファサン | 1 | 1.8% |
ミョンイル | 1 | 1.8% |
ワンシムニ | 1 | 1.8% |
マジャン | 1 | 1.8% |
タプシムニ | 1 | 1.8% |
チャンハンピョン | 1 | 1.8% |
クンジャ | 1 | 1.8% |
アチャサン | 1 | 1.8% |
クァンナル | 1 | 1.8% |
Other values (46) | 46 |
Distinct | 56 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 576.0 B |
傍花 | 1 |
---|---|
开花山 | 1 |
永登浦区厅 | 1 |
金浦机场 | 1 |
松亭 | 1 |
Other values (51) |
Length
Max length | 14 |
---|---|
Median length | 11 |
Mean length | 3.732142857 |
Min length | 2 |
Unique
Unique | 56 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 傍花 |
---|---|
2nd row | 开花山 |
3rd row | 金浦机场 |
4th row | 松亭 |
5th row | 麻谷 |
Common Values
Value | Count | Frequency (%) |
傍花 | 1 | 1.8% |
开花山 | 1 | 1.8% |
永登浦区厅 | 1 | 1.8% |
金浦机场 | 1 | 1.8% |
松亭 | 1 | 1.8% |
麻谷 | 1 | 1.8% |
钵山 | 1 | 1.8% |
雨裝山 | 1 | 1.8% |
禾谷 | 1 | 1.8% |
喜鹊山 | 1 | 1.8% |
Other values (46) | 46 |
Length
Value | Count | Frequency (%) |
傍花 | 1 | 1.8% |
开花山 | 1 | 1.8% |
明逸 | 1 | 1.8% |
往十里 | 1 | 1.8% |
马场 | 1 | 1.8% |
踏十里 | 1 | 1.8% |
长汉坪 | 1 | 1.8% |
君子(陵洞 | 1 | 1.8% |
峨嵯山 | 1 | 1.8% |
广渡口(长神大学 | 1 | 1.8% |
Other values (46) | 46 |
Distinct | 56 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 576.0 B |
傍花 | 1 |
---|---|
開花山 | 1 |
永登浦區廳 | 1 |
金浦空港 | 1 |
松亭 | 1 |
Other values (51) |
Length
Max length | 13 |
---|---|
Median length | 11 |
Mean length | 4.357142857 |
Min length | 2 |
Unique
Unique | 56 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 傍花 |
---|---|
2nd row | 開花山 |
3rd row | 金浦空港 |
4th row | 松亭 |
5th row | 麻谷 |
Common Values
Value | Count | Frequency (%) |
傍花 | 1 | 1.8% |
開花山 | 1 | 1.8% |
永登浦區廳 | 1 | 1.8% |
金浦空港 | 1 | 1.8% |
松亭 | 1 | 1.8% |
麻谷 | 1 | 1.8% |
鉢山 | 1 | 1.8% |
雨裝山 | 1 | 1.8% |
禾谷 | 1 | 1.8% |
까치山 | 1 | 1.8% |
Other values (46) | 46 |
Length
Value | Count | Frequency (%) |
傍花 | 1 | 1.8% |
開花山 | 1 | 1.8% |
明逸 | 1 | 1.8% |
往十里 | 1 | 1.8% |
馬場 | 1 | 1.8% |
踏十里 | 1 | 1.8% |
長漢坪 | 1 | 1.8% |
君子(陵洞 | 1 | 1.8% |
峨嵯山(어린이大公園後門 | 1 | 1.8% |
광나루(長神大 | 1 | 1.8% |
Other values (46) | 46 |
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
역명 | 역명(영문) | 역명(로마자) | 역명(일본어) | 역명(중국어 간체) | 역명(중국어 번체) | |
---|---|---|---|---|---|---|
0 | 방화 | Banghwa | Banghwa | パンファ | 傍花 | 傍花 |
1 | 개화산 | Gaehwasan | Gaehwasan | ケファサン | 开花山 | 開花山 |
2 | 김포공항 | Gimpo Int'l Airport | Gimpo Int'l Airport | キンポゴンハン | 金浦机场 | 金浦空港 |
3 | 송정 | Songjeong | Songjeong | ソンジョン | 松亭 | 松亭 |
4 | 마곡 | Magok | Magok | マゴク | 麻谷 | 麻谷 |
5 | 발산 | Balsan | Balsan | パルサン | 钵山 | 鉢山 |
6 | 우장산 | Ujangsan | Ujangsan | ウジャンサン | 雨裝山 | 雨裝山 |
7 | 화곡 | Hwagok | Hwagok | ファゴク | 禾谷 | 禾谷 |
8 | 까치산 | Kkachisan | Kkachisan | カチサン | 喜鹊山 | 까치山 |
9 | 신정(은행정) | Sinjeong (Eunhaengjeong) | Sinjeong(Eunhaengjeong) | シンジョン | 新亭 | 新亭(銀杏亭) |
Last rows
역명 | 역명(영문) | 역명(로마자) | 역명(일본어) | 역명(중국어 간체) | 역명(중국어 번체) | |
---|---|---|---|---|---|---|
46 | 하남풍산 | Hanam Pungsan | Hanam Pungsan | ハナムプンサン(河南豊山) | 河南丰山 | 河南豊山 |
47 | 하남시청(덕풍·신장) | Hanam City Hall(Deokpung·Sinjang) | Hanam Sicheong(Deokpung·Sinjang) | ハナムシチョン-ドクプン·シンジャん(河南市庁-德豊·新長) | 河南市庁(德丰·新长) | 河南市廳(德豊·新長) |
48 | 하남검단산 | Hanam Geomdansan | Hanam Geomdansan | ハナムゴムダンサン(河南黔丹山) | 河南黔丹山 | 河南黔丹山 |
49 | 둔촌동 | Dunchondong | Dunchon-dong | トゥンチョンドン | 遁村洞 | 遁村洞 |
50 | 올림픽공원(한국체대) | Olympic Park (Korea National Sport Univ.) | Olympic park(Hangukchedae) | オリンピック·コンウォン | 奥林匹克公园(韩国体育大学) | 올림픽公園(韓國體大) |
51 | 방이 | Bangi | Bang | パンイ | 芳荑 | 芳荑 |
52 | 오금 | Ogeum | Ogeum | オグム | 梧琴 | 梧琴 |
53 | 개롱 | Gaerong | Gaerong | ケロン | 开笼 | 開籠 |
54 | 거여 | Geoyeo | Geoyeo | コヨ | 巨余 | 巨余 |
55 | 마천 | Macheon | Macheon | マチョン | 马川 | 馬川 |