Overview

Dataset statistics

Number of variables8
Number of observations1696
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory107.8 KiB
Average record size in memory65.1 B

Variable types

Categorical4
Numeric1
Text3

Dataset

Description경상남도_도립거창대학교신주소 데이터입니다.(번호, 우편번호1, 우편번호2, 우편번호 주소, 시도 등의 데이터를 포함하고있습니다)
Author경상남도
URLhttps://www.data.go.kr/data/15049413/fileData.do

Alerts

시도 has constant value ""Constant
시군구 is highly overall correlated with 우편번호 and 1 other fieldsHigh correlation
우편번호 is highly overall correlated with 시군구 and 1 other fieldsHigh correlation
읍면동 is highly overall correlated with 우편번호 and 1 other fieldsHigh correlation
우편번호순서 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:49:15.195998
Analysis finished2023-12-12 08:49:16.063703
Duration0.87 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

우편번호
Categorical

HIGH CORRELATION 

Distinct23
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size13.4 KiB
445-010
189 
445-842
168 
413-842
165 
445-873
159 
445-380
135 
Other values (18)
880 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row413-861
2nd row445-040
3rd row445-040
4th row445-040
5th row445-040

Common Values

ValueCountFrequency (%)
445-010 189
11.1%
445-842 168
9.9%
413-842 165
9.7%
445-873 159
 
9.4%
445-380 135
 
8.0%
445-040 127
 
7.5%
445-872 108
 
6.4%
445-891 94
 
5.5%
445-370 77
 
4.5%
445-360 73
 
4.3%
Other values (13) 401
23.6%

Length

2023-12-12T17:49:16.144580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
445-010 189
11.1%
445-842 168
9.9%
413-842 165
9.7%
445-873 159
 
9.4%
445-380 135
 
8.0%
445-040 127
 
7.5%
445-872 108
 
6.4%
445-891 94
 
5.5%
445-370 77
 
4.5%
445-360 73
 
4.3%
Other values (13) 401
23.6%

우편번호순서
Real number (ℝ)

UNIQUE 

Distinct1696
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1325173.2
Minimum570421
Maximum5490412
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size15.0 KiB
2023-12-12T17:49:16.329991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum570421
5-th percentile896565.75
Q11023537.8
median1036214.5
Q31042770.2
95-th percentile4598513.2
Maximum5490412
Range4919991
Interquartile range (IQR)19232.5

Descriptive statistics

Standard deviation1066990.9
Coefficient of variation (CV)0.80517093
Kurtosis6.7781792
Mean1325173.2
Median Absolute Deviation (MAD)9246
Skewness2.9209714
Sum2.2474937 × 109
Variance1.1384697 × 1012
MonotonicityNot monotonic
2023-12-12T17:49:16.505572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
926394 1
 
0.1%
923184 1
 
0.1%
923182 1
 
0.1%
923181 1
 
0.1%
923180 1
 
0.1%
923179 1
 
0.1%
923178 1
 
0.1%
1033178 1
 
0.1%
1033177 1
 
0.1%
1033176 1
 
0.1%
Other values (1686) 1686
99.4%
ValueCountFrequency (%)
570421 1
0.1%
570422 1
0.1%
571105 1
0.1%
571109 1
0.1%
571113 1
0.1%
571114 1
0.1%
571115 1
0.1%
571513 1
0.1%
571514 1
0.1%
571515 1
0.1%
ValueCountFrequency (%)
5490412 1
0.1%
5490411 1
0.1%
5490410 1
0.1%
5490409 1
0.1%
5490408 1
0.1%
5490284 1
0.1%
5490283 1
0.1%
5490282 1
0.1%
5490281 1
0.1%
5490280 1
0.1%

주소
Text

UNIQUE 

Distinct1696
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size13.4 KiB
2023-12-12T17:49:16.769490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length25
Mean length20.617925
Min length13

Characters and Unicode

Total characters34968
Distinct characters138
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1696 ?
Unique (%)100.0%

Sample

1st row경기도 파주시 파주읍 현암말길 17-5
2nd row경기도 화성시 주석로80번길 35
3rd row경기도 화성시 주석로80번길 67-7
4th row경기도 화성시 주석로80번길 49-13
5th row경기도 화성시 주석로80번길 96-15
ValueCountFrequency (%)
경기도 1696
21.8%
화성시 1342
 
17.3%
파주시 298
 
3.8%
송산면 267
 
3.4%
탄현면 238
 
3.1%
비봉면 211
 
2.7%
봉담읍 149
 
1.9%
새오리로 65
 
0.8%
작현길 61
 
0.8%
광탄면 58
 
0.7%
Other values (1246) 3380
43.5%
2023-12-12T17:49:17.163888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6069
 
17.4%
1786
 
5.1%
1771
 
5.1%
1706
 
4.9%
1705
 
4.9%
1453
 
4.2%
1 1438
 
4.1%
1382
 
4.0%
1374
 
3.9%
1069
 
3.1%
Other values (128) 15215
43.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20879
59.7%
Decimal Number 7136
 
20.4%
Space Separator 6069
 
17.4%
Dash Punctuation 884
 
2.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1786
 
8.6%
1771
 
8.5%
1706
 
8.2%
1705
 
8.2%
1453
 
7.0%
1382
 
6.6%
1374
 
6.6%
1069
 
5.1%
774
 
3.7%
755
 
3.6%
Other values (116) 7104
34.0%
Decimal Number
ValueCountFrequency (%)
1 1438
20.2%
2 1050
14.7%
3 860
12.1%
4 721
10.1%
5 645
9.0%
0 571
 
8.0%
6 526
 
7.4%
7 477
 
6.7%
8 424
 
5.9%
9 424
 
5.9%
Space Separator
ValueCountFrequency (%)
6069
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 884
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20879
59.7%
Common 14089
40.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1786
 
8.6%
1771
 
8.5%
1706
 
8.2%
1705
 
8.2%
1453
 
7.0%
1382
 
6.6%
1374
 
6.6%
1069
 
5.1%
774
 
3.7%
755
 
3.6%
Other values (116) 7104
34.0%
Common
ValueCountFrequency (%)
6069
43.1%
1 1438
 
10.2%
2 1050
 
7.5%
- 884
 
6.3%
3 860
 
6.1%
4 721
 
5.1%
5 645
 
4.6%
0 571
 
4.1%
6 526
 
3.7%
7 477
 
3.4%
Other values (2) 848
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20879
59.7%
ASCII 14089
40.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6069
43.1%
1 1438
 
10.2%
2 1050
 
7.5%
- 884
 
6.3%
3 860
 
6.1%
4 721
 
5.1%
5 645
 
4.6%
0 571
 
4.1%
6 526
 
3.7%
7 477
 
3.4%
Other values (2) 848
 
6.0%
Hangul
ValueCountFrequency (%)
1786
 
8.6%
1771
 
8.5%
1706
 
8.2%
1705
 
8.2%
1453
 
7.0%
1382
 
6.6%
1374
 
6.6%
1069
 
5.1%
774
 
3.7%
755
 
3.6%
Other values (116) 7104
34.0%
Distinct171
Distinct (%)10.1%
Missing0
Missing (%)0.0%
Memory size13.4 KiB
2023-12-12T17:49:17.392297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length21
Mean length16.119104
Min length11

Characters and Unicode

Total characters27338
Distinct characters137
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)1.4%

Sample

1st row경기도 파주시 파주읍 현암말길
2nd row경기도 화성시 주석로80번길
3rd row경기도 화성시 주석로80번길
4th row경기도 화성시 주석로80번길
5th row경기도 화성시 주석로80번길
ValueCountFrequency (%)
경기도 1696
27.9%
화성시 1342
22.1%
파주시 298
 
4.9%
송산면 267
 
4.4%
탄현면 238
 
3.9%
비봉면 211
 
3.5%
봉담읍 149
 
2.5%
새오리로 65
 
1.1%
작현길 61
 
1.0%
주석로 58
 
1.0%
Other values (170) 1684
27.7%
2023-12-12T17:49:17.759521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4373
16.0%
1786
 
6.5%
1771
 
6.5%
1706
 
6.2%
1705
 
6.2%
1453
 
5.3%
1382
 
5.1%
1374
 
5.0%
1069
 
3.9%
774
 
2.8%
Other values (127) 9945
36.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20879
76.4%
Space Separator 4373
 
16.0%
Decimal Number 2086
 
7.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1786
 
8.6%
1771
 
8.5%
1706
 
8.2%
1705
 
8.2%
1453
 
7.0%
1382
 
6.6%
1374
 
6.6%
1069
 
5.1%
774
 
3.7%
755
 
3.6%
Other values (116) 7104
34.0%
Decimal Number
ValueCountFrequency (%)
2 406
19.5%
1 376
18.0%
0 222
10.6%
4 221
10.6%
5 216
10.4%
3 193
9.3%
9 125
 
6.0%
8 120
 
5.8%
7 104
 
5.0%
6 103
 
4.9%
Space Separator
ValueCountFrequency (%)
4373
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20879
76.4%
Common 6459
 
23.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1786
 
8.6%
1771
 
8.5%
1706
 
8.2%
1705
 
8.2%
1453
 
7.0%
1382
 
6.6%
1374
 
6.6%
1069
 
5.1%
774
 
3.7%
755
 
3.6%
Other values (116) 7104
34.0%
Common
ValueCountFrequency (%)
4373
67.7%
2 406
 
6.3%
1 376
 
5.8%
0 222
 
3.4%
4 221
 
3.4%
5 216
 
3.3%
3 193
 
3.0%
9 125
 
1.9%
8 120
 
1.9%
7 104
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20879
76.4%
ASCII 6459
 
23.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4373
67.7%
2 406
 
6.3%
1 376
 
5.8%
0 222
 
3.4%
4 221
 
3.4%
5 216
 
3.3%
3 193
 
3.0%
9 125
 
1.9%
8 120
 
1.9%
7 104
 
1.6%
Hangul
ValueCountFrequency (%)
1786
 
8.6%
1771
 
8.5%
1706
 
8.2%
1705
 
8.2%
1453
 
7.0%
1382
 
6.6%
1374
 
6.6%
1069
 
5.1%
774
 
3.7%
755
 
3.6%
Other values (116) 7104
34.0%

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.4 KiB
경기도
1696 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 1696
100.0%

Length

2023-12-12T17:49:18.172698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:49:18.274862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 1696
100.0%

시군구
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size13.4 KiB
화성시
1342 
파주시
298 
수원시 권선구
 
56

Length

Max length7
Median length3
Mean length3.1320755
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row파주시
2nd row화성시
3rd row화성시
4th row화성시
5th row화성시

Common Values

ValueCountFrequency (%)
화성시 1342
79.1%
파주시 298
 
17.6%
수원시 권선구 56
 
3.3%

Length

2023-12-12T17:49:18.413030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:49:18.533444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
화성시 1342
76.6%
파주시 298
 
17.0%
수원시 56
 
3.2%
권선구 56
 
3.2%

읍면동
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size13.4 KiB
<NA>
771 
송산면
267 
탄현면
238 
비봉면
211 
봉담읍
149 
Other values (2)
 
60

Length

Max length4
Median length3
Mean length3.4545991
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row파주읍
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 771
45.5%
송산면 267
 
15.7%
탄현면 238
 
14.0%
비봉면 211
 
12.4%
봉담읍 149
 
8.8%
광탄면 58
 
3.4%
파주읍 2
 
0.1%

Length

2023-12-12T17:49:18.696967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:49:18.839777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 771
45.5%
송산면 267
 
15.7%
탄현면 238
 
14.0%
비봉면 211
 
12.4%
봉담읍 149
 
8.8%
광탄면 58
 
3.4%
파주읍 2
 
0.1%
Distinct169
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size13.4 KiB
2023-12-12T17:49:19.240172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length5.8054245
Min length3

Characters and Unicode

Total characters9846
Distinct characters127
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)1.4%

Sample

1st row현암말길
2nd row주석로80번길
3rd row주석로80번길
4th row주석로80번길
5th row주석로80번길
ValueCountFrequency (%)
새오리로 65
 
3.8%
작현길 61
 
3.6%
주석로 58
 
3.4%
안녕남로 47
 
2.8%
송산로 47
 
2.8%
주석로80번길 47
 
2.8%
안녕남로142번길 40
 
2.4%
칠곡길 38
 
2.2%
삼화길105번길 37
 
2.2%
방촌로879번길 36
 
2.1%
Other values (159) 1220
71.9%
2023-12-12T17:49:19.798142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1382
 
14.0%
1069
 
10.9%
755
 
7.7%
2 406
 
4.1%
1 376
 
3.8%
0 222
 
2.3%
4 221
 
2.2%
220
 
2.2%
5 216
 
2.2%
207
 
2.1%
Other values (117) 4772
48.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7760
78.8%
Decimal Number 2086
 
21.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1382
17.8%
1069
 
13.8%
755
 
9.7%
220
 
2.8%
207
 
2.7%
163
 
2.1%
150
 
1.9%
139
 
1.8%
136
 
1.8%
129
 
1.7%
Other values (107) 3410
43.9%
Decimal Number
ValueCountFrequency (%)
2 406
19.5%
1 376
18.0%
0 222
10.6%
4 221
10.6%
5 216
10.4%
3 193
9.3%
9 125
 
6.0%
8 120
 
5.8%
7 104
 
5.0%
6 103
 
4.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7760
78.8%
Common 2086
 
21.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1382
17.8%
1069
 
13.8%
755
 
9.7%
220
 
2.8%
207
 
2.7%
163
 
2.1%
150
 
1.9%
139
 
1.8%
136
 
1.8%
129
 
1.7%
Other values (107) 3410
43.9%
Common
ValueCountFrequency (%)
2 406
19.5%
1 376
18.0%
0 222
10.6%
4 221
10.6%
5 216
10.4%
3 193
9.3%
9 125
 
6.0%
8 120
 
5.8%
7 104
 
5.0%
6 103
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7760
78.8%
ASCII 2086
 
21.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1382
17.8%
1069
 
13.8%
755
 
9.7%
220
 
2.8%
207
 
2.7%
163
 
2.1%
150
 
1.9%
139
 
1.8%
136
 
1.8%
129
 
1.7%
Other values (107) 3410
43.9%
ASCII
ValueCountFrequency (%)
2 406
19.5%
1 376
18.0%
0 222
10.6%
4 221
10.6%
5 216
10.4%
3 193
9.3%
9 125
 
6.0%
8 120
 
5.8%
7 104
 
5.0%
6 103
 
4.9%

Interactions

2023-12-12T17:49:15.684499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:49:19.912152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호우편번호순서시군구읍면동
우편번호1.0000.6111.0001.000
우편번호순서0.6111.0000.3150.638
시군구1.0000.3151.0001.000
읍면동1.0000.6381.0001.000
2023-12-12T17:49:20.026470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구우편번호읍면동
시군구1.0000.9940.998
우편번호0.9941.0000.996
읍면동0.9980.9961.000
2023-12-12T17:49:20.116209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호순서우편번호시군구읍면동
우편번호순서1.0000.3950.1070.331
우편번호0.3951.0000.9940.996
시군구0.1070.9941.0000.998
읍면동0.3310.9960.9981.000

Missing values

2023-12-12T17:49:15.822679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:49:15.990355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

우편번호우편번호순서주소신주소시도시군구읍면동도로주소명
0413-861926394경기도 파주시 파주읍 현암말길 17-5경기도 파주시 파주읍 현암말길경기도파주시파주읍현암말길
1445-0401036356경기도 화성시 주석로80번길 35경기도 화성시 주석로80번길경기도화성시<NA>주석로80번길
2445-0401036357경기도 화성시 주석로80번길 67-7경기도 화성시 주석로80번길경기도화성시<NA>주석로80번길
3445-0401036358경기도 화성시 주석로80번길 49-13경기도 화성시 주석로80번길경기도화성시<NA>주석로80번길
4445-0401036359경기도 화성시 주석로80번길 96-15경기도 화성시 주석로80번길경기도화성시<NA>주석로80번길
5445-0401036360경기도 화성시 주석로80번길 100경기도 화성시 주석로80번길경기도화성시<NA>주석로80번길
6445-0401036361경기도 화성시 주석로80번길 57-5경기도 화성시 주석로80번길경기도화성시<NA>주석로80번길
7445-0401036362경기도 화성시 주석로80번길 27경기도 화성시 주석로80번길경기도화성시<NA>주석로80번길
8445-0401036363경기도 화성시 주석로80번길 38경기도 화성시 주석로80번길경기도화성시<NA>주석로80번길
9445-0401036364경기도 화성시 주석로80번길 8경기도 화성시 주석로80번길경기도화성시<NA>주석로80번길
우편번호우편번호순서주소신주소시도시군구읍면동도로주소명
1686413-843922864경기도 파주시 탄현면 한산로 6-42경기도 파주시 탄현면 한산로경기도파주시탄현면한산로
1687413-843922865경기도 파주시 탄현면 한산로 62-58경기도 파주시 탄현면 한산로경기도파주시탄현면한산로
1688413-843922866경기도 파주시 탄현면 한산로 47경기도 파주시 탄현면 한산로경기도파주시탄현면한산로
1689413-843922867경기도 파주시 탄현면 한산로 36경기도 파주시 탄현면 한산로경기도파주시탄현면한산로
1690413-843922868경기도 파주시 탄현면 정승로 51경기도 파주시 탄현면 정승로경기도파주시탄현면정승로
1691413-843922869경기도 파주시 탄현면 정승로 91경기도 파주시 탄현면 정승로경기도파주시탄현면정승로
1692413-843922870경기도 파주시 탄현면 정승로 109경기도 파주시 탄현면 정승로경기도파주시탄현면정승로
1693413-843922871경기도 파주시 탄현면 정승로 21경기도 파주시 탄현면 정승로경기도파주시탄현면정승로
1694413-843922872경기도 파주시 탄현면 정승로 8-2경기도 파주시 탄현면 정승로경기도파주시탄현면정승로
1695413-843922873경기도 파주시 탄현면 정승로 17경기도 파주시 탄현면 정승로경기도파주시탄현면정승로