Overview

Dataset statistics

Number of variables6
Number of observations305
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.0 KiB
Average record size in memory50.4 B

Variable types

Numeric2
Categorical2
Text2

Dataset

Description경상남도 관공서(사무소 및 행정복지센터) 현황으로, 관공서 위치, 명칭, 우편번호, 주소 등의 정보를 포함하고 있습니다.
URLhttps://www.data.go.kr/data/15039158/fileData.do

Alerts

시도 has constant value ""Constant
연번 is highly overall correlated with 시군구High correlation
우편번호 is highly overall correlated with 시군구High correlation
시군구 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:40:33.516360
Analysis finished2023-12-12 08:40:34.911079
Duration1.39 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct305
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean153
Minimum1
Maximum305
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 KiB
2023-12-12T17:40:35.002204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile16.2
Q177
median153
Q3229
95-th percentile289.8
Maximum305
Range304
Interquartile range (IQR)152

Descriptive statistics

Standard deviation88.190136
Coefficient of variation (CV)0.57640611
Kurtosis-1.2
Mean153
Median Absolute Deviation (MAD)76
Skewness0
Sum46665
Variance7777.5
MonotonicityStrictly increasing
2023-12-12T17:40:35.205934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
202 1
 
0.3%
209 1
 
0.3%
208 1
 
0.3%
207 1
 
0.3%
206 1
 
0.3%
205 1
 
0.3%
204 1
 
0.3%
203 1
 
0.3%
201 1
 
0.3%
Other values (295) 295
96.7%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
305 1
0.3%
304 1
0.3%
303 1
0.3%
302 1
0.3%
301 1
0.3%
300 1
0.3%
299 1
0.3%
298 1
0.3%
297 1
0.3%
296 1
0.3%

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
경남
305 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경남
2nd row경남
3rd row경남
4th row경남
5th row경남

Common Values

ValueCountFrequency (%)
경남 305
100.0%

Length

2023-12-12T17:40:35.342776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:40:35.461586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경남 305
100.0%

시군구
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)7.2%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
진주시
30 
김해시
 
19
거제시
 
18
합천군
 
17
밀양시
 
16
Other values (17)
205 

Length

Max length9
Median length3
Mean length3.8983607
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row창원시 의창구
2nd row창원시 의창구
3rd row창원시 의창구
4th row창원시 의창구
5th row창원시 의창구

Common Values

ValueCountFrequency (%)
진주시 30
 
9.8%
김해시 19
 
6.2%
거제시 18
 
5.9%
합천군 17
 
5.6%
밀양시 16
 
5.2%
창원시 마산합포구 15
 
4.9%
통영시 15
 
4.9%
고성군 14
 
4.6%
사천시 14
 
4.6%
창녕군 14
 
4.6%
Other values (12) 133
43.6%

Length

2023-12-12T17:40:35.618469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
창원시 55
 
15.3%
진주시 30
 
8.3%
김해시 19
 
5.3%
거제시 18
 
5.0%
합천군 17
 
4.7%
밀양시 16
 
4.4%
마산합포구 15
 
4.2%
통영시 15
 
4.2%
고성군 14
 
3.9%
사천시 14
 
3.9%
Other values (13) 147
40.8%
Distinct302
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-12T17:40:35.901064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length7.7868852
Min length6

Characters and Unicode

Total characters2375
Distinct characters163
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique300 ?
Unique (%)98.4%

Sample

1st row동읍행정복지센터
2nd row북면행정복지센터
3rd row대산면행정복지센터
4th row의창동행정복지센터
5th row팔룡동행정복지센터
ValueCountFrequency (%)
행정복지센터 12
 
3.8%
중앙동행정복지센터 3
 
0.9%
상동면행정복지센터 2
 
0.6%
대병면사무소 1
 
0.3%
동읍행정복지센터 1
 
0.3%
이방면사무소 1
 
0.3%
대합면사무소 1
 
0.3%
성산면사무소 1
 
0.3%
고암면사무소 1
 
0.3%
남지읍행정복지센터 1
 
0.3%
Other values (293) 293
92.4%
2023-12-12T17:40:36.367385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
181
 
7.6%
181
 
7.6%
175
 
7.4%
175
 
7.4%
174
 
7.3%
167
 
7.0%
167
 
7.0%
132
 
5.6%
130
 
5.5%
128
 
5.4%
Other values (153) 765
32.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2348
98.9%
Decimal Number 15
 
0.6%
Space Separator 12
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
181
 
7.7%
181
 
7.7%
175
 
7.5%
175
 
7.5%
174
 
7.4%
167
 
7.1%
167
 
7.1%
132
 
5.6%
130
 
5.5%
128
 
5.5%
Other values (149) 738
31.4%
Decimal Number
ValueCountFrequency (%)
1 7
46.7%
2 7
46.7%
3 1
 
6.7%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2348
98.9%
Common 27
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
181
 
7.7%
181
 
7.7%
175
 
7.5%
175
 
7.5%
174
 
7.4%
167
 
7.1%
167
 
7.1%
132
 
5.6%
130
 
5.5%
128
 
5.5%
Other values (149) 738
31.4%
Common
ValueCountFrequency (%)
12
44.4%
1 7
25.9%
2 7
25.9%
3 1
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2348
98.9%
ASCII 27
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
181
 
7.7%
181
 
7.7%
175
 
7.5%
175
 
7.5%
174
 
7.4%
167
 
7.1%
167
 
7.1%
132
 
5.6%
130
 
5.5%
128
 
5.5%
Other values (149) 738
31.4%
ASCII
ValueCountFrequency (%)
12
44.4%
1 7
25.9%
2 7
25.9%
3 1
 
3.7%

우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct304
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean51680.984
Minimum50001
Maximum53333
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 KiB
2023-12-12T17:40:36.538843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum50001
5-th percentile50113.4
Q150574
median51777
Q352602
95-th percentile53210.4
Maximum53333
Range3332
Interquartile range (IQR)2028

Descriptive statistics

Standard deviation1048.3537
Coefficient of variation (CV)0.020285096
Kurtosis-1.3733449
Mean51680.984
Median Absolute Deviation (MAD)903
Skewness-0.14421162
Sum15762700
Variance1099045.6
MonotonicityNot monotonic
2023-12-12T17:40:36.717052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
51670 2
 
0.7%
51128 1
 
0.3%
52073 1
 
0.3%
50305 1
 
0.3%
50301 1
 
0.3%
50311 1
 
0.3%
50313 1
 
0.3%
50315 1
 
0.3%
50329 1
 
0.3%
50328 1
 
0.3%
Other values (294) 294
96.4%
ValueCountFrequency (%)
50001 1
0.3%
50003 1
0.3%
50009 1
0.3%
50018 1
0.3%
50022 1
0.3%
50025 1
0.3%
50027 1
0.3%
50041 1
0.3%
50050 1
0.3%
50051 1
0.3%
ValueCountFrequency (%)
53333 1
0.3%
53331 1
0.3%
53328 1
0.3%
53320 1
0.3%
53313 1
0.3%
53310 1
0.3%
53294 1
0.3%
53286 1
0.3%
53281 1
0.3%
53276 1
0.3%

주소
Text

UNIQUE 

Distinct305
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-12T17:40:37.014201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length26
Mean length20.357377
Min length14

Characters and Unicode

Total characters6209
Distinct characters225
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique305 ?
Unique (%)100.0%

Sample

1st row경상남도 창원시 의창구 동읍 동읍로 88
2nd row경상남도 창원시 의창구 북면 천주로 1085
3rd row경상남도 창원시 의창구 대산면 가술산단동로 10
4th row경상남도 창원시 의창구 서상로12번길 75
5th row경상남도 창원시 의창구 팔용로 435
ValueCountFrequency (%)
경상남도 305
 
20.7%
창원시 55
 
3.7%
진주시 30
 
2.0%
김해시 19
 
1.3%
거제시 18
 
1.2%
합천군 17
 
1.2%
밀양시 16
 
1.1%
마산합포구 15
 
1.0%
통영시 15
 
1.0%
고성군 14
 
1.0%
Other values (698) 968
65.8%
2023-12-12T17:40:37.485733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1168
18.8%
346
 
5.6%
325
 
5.2%
316
 
5.1%
312
 
5.0%
231
 
3.7%
1 190
 
3.1%
182
 
2.9%
175
 
2.8%
128
 
2.1%
Other values (215) 2836
45.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4097
66.0%
Space Separator 1174
 
18.9%
Decimal Number 894
 
14.4%
Dash Punctuation 28
 
0.5%
Open Punctuation 8
 
0.1%
Close Punctuation 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
346
 
8.4%
325
 
7.9%
316
 
7.7%
312
 
7.6%
231
 
5.6%
182
 
4.4%
175
 
4.3%
128
 
3.1%
119
 
2.9%
114
 
2.8%
Other values (200) 1849
45.1%
Decimal Number
ValueCountFrequency (%)
1 190
21.3%
5 108
12.1%
3 103
11.5%
2 98
11.0%
4 80
8.9%
7 77
8.6%
6 71
 
7.9%
8 63
 
7.0%
0 57
 
6.4%
9 47
 
5.3%
Space Separator
ValueCountFrequency (%)
1168
99.5%
  6
 
0.5%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4097
66.0%
Common 2112
34.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
346
 
8.4%
325
 
7.9%
316
 
7.7%
312
 
7.6%
231
 
5.6%
182
 
4.4%
175
 
4.3%
128
 
3.1%
119
 
2.9%
114
 
2.8%
Other values (200) 1849
45.1%
Common
ValueCountFrequency (%)
1168
55.3%
1 190
 
9.0%
5 108
 
5.1%
3 103
 
4.9%
2 98
 
4.6%
4 80
 
3.8%
7 77
 
3.6%
6 71
 
3.4%
8 63
 
3.0%
0 57
 
2.7%
Other values (5) 97
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4097
66.0%
ASCII 2106
33.9%
None 6
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1168
55.5%
1 190
 
9.0%
5 108
 
5.1%
3 103
 
4.9%
2 98
 
4.7%
4 80
 
3.8%
7 77
 
3.7%
6 71
 
3.4%
8 63
 
3.0%
0 57
 
2.7%
Other values (4) 91
 
4.3%
Hangul
ValueCountFrequency (%)
346
 
8.4%
325
 
7.9%
316
 
7.7%
312
 
7.6%
231
 
5.6%
182
 
4.4%
175
 
4.3%
128
 
3.1%
119
 
2.9%
114
 
2.8%
Other values (200) 1849
45.1%
None
ValueCountFrequency (%)
  6
100.0%

Interactions

2023-12-12T17:40:34.423675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:40:33.861624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:40:34.526415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:40:34.291873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:40:37.615411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시군구우편번호
연번1.0000.9810.930
시군구0.9811.0000.977
우편번호0.9300.9771.000
2023-12-12T17:40:37.739702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번우편번호시군구
연번1.000-0.2920.873
우편번호-0.2921.0000.853
시군구0.8730.8531.000

Missing values

2023-12-12T17:40:34.707013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:40:34.854281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시도시군구읍면동우편번호주소
01경남창원시 의창구동읍행정복지센터51128경상남도 창원시 의창구 동읍 동읍로 88
12경남창원시 의창구북면행정복지센터51103경상남도 창원시 의창구 북면 천주로 1085
23경남창원시 의창구대산면행정복지센터51124경상남도 창원시 의창구 대산면 가술산단동로 10
34경남창원시 의창구의창동행정복지센터51194경상남도 창원시 의창구 서상로12번길 75
45경남창원시 의창구팔룡동행정복지센터51374경상남도 창원시 의창구 팔용로 435
56경남창원시 의창구명곡동행정복지센터51168경상남도 창원시 의창구 태복산로15번길 8
67경남창원시 의창구봉림동행정복지센터51157경상남도 창원시 의창구 대봉로26번길 5
78경남창원시 성산구반송동행정복지센터51427경상남도 창원시 성산구 원이대로473번길 19-14
89경남창원시 성산구중앙동행정복지센터51524경상남도 창원시 성산구 외동반림로 5
910경남창원시 의창구용지동행정복지센터51431경상남도 창원시 의창구 용지로239번길 19-4
연번시도시군구읍면동우편번호주소
295296경남합천군쌍책면사무소50252경상남도 합천군 쌍책면 성산큰길 48
296297경남합천군덕곡면사무소50248경상남도 합천군 덕곡면 율지2길 11
297298경남합천군청덕면사무소50253경상남도 합천군 청덕면 동부로 1754
298299경남합천군적중면사무소50247경상남도 합천군 적중면 적중로 98
299300경남합천군대양면사무소50240경상남도 합천군 대양면 대한로 5
300301경남합천군쌍백면사무소50218경상남도 합천군 쌍백면 쌍백중앙로 63
301302경남합천군삼가면사무소50222경상남도 합천군 삼가면 삼가중앙2길 12-8
302303경남합천군가회면사무소50226경상남도 합천군 가회면 황매산로 52
303304경남합천군대병면사무소50216경상남도 합천군 대병면 신성동길 23
304305경남합천군용주면사무소50213경상남도 합천군 용주면 황계폭포로 1154