Overview

Dataset statistics

Number of variables6
Number of observations35
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory52.8 B

Variable types

Categorical2
Text3
Numeric1

Dataset

Description경기도_용인시_관공서현황(시군명,구분명,주소,우편번호,전화번호,데이터기준일자)
Author경기도 용인시
URLhttps://www.data.go.kr/data/15046176/fileData.do

Alerts

시군명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
구분명 has unique valuesUnique
주소 has unique valuesUnique
우편번호 has unique valuesUnique
전화번호안내 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:36:10.959609
Analysis finished2023-12-12 23:36:11.387059
Duration0.43 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
용인시
35 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row용인시
2nd row용인시
3rd row용인시
4th row용인시
5th row용인시

Common Values

ValueCountFrequency (%)
용인시 35
100.0%

Length

2023-12-13T08:36:11.438365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:36:11.510776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
용인시 35
100.0%

구분명
Text

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-13T08:36:11.652037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length3.3428571
Min length3

Characters and Unicode

Total characters117
Distinct characters52
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row용인시청
2nd row처인구청
3rd row포곡읍
4th row모현읍
5th row남사면
ValueCountFrequency (%)
용인시청 1
 
2.9%
기흥동 1
 
2.9%
구성동 1
 
2.9%
마북동 1
 
2.9%
동백동 1
 
2.9%
상하동 1
 
2.9%
보정동 1
 
2.9%
수지구청 1
 
2.9%
서농동 1
 
2.9%
풍덕천1동 1
 
2.9%
Other values (25) 25
71.4%
2023-12-13T08:36:11.931623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28
23.9%
5
 
4.3%
4
 
3.4%
4
 
3.4%
4
 
3.4%
2 3
 
2.6%
3
 
2.6%
1 3
 
2.6%
3
 
2.6%
3
 
2.6%
Other values (42) 57
48.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 111
94.9%
Decimal Number 6
 
5.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
28
25.2%
5
 
4.5%
4
 
3.6%
4
 
3.6%
4
 
3.6%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
Other values (40) 51
45.9%
Decimal Number
ValueCountFrequency (%)
2 3
50.0%
1 3
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 111
94.9%
Common 6
 
5.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
28
25.2%
5
 
4.5%
4
 
3.6%
4
 
3.6%
4
 
3.6%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
Other values (40) 51
45.9%
Common
ValueCountFrequency (%)
2 3
50.0%
1 3
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 111
94.9%
ASCII 6
 
5.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
28
25.2%
5
 
4.5%
4
 
3.6%
4
 
3.6%
4
 
3.6%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
Other values (40) 51
45.9%
ASCII
ValueCountFrequency (%)
2 3
50.0%
1 3
50.0%

주소
Text

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-13T08:36:12.145123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length28
Mean length26.085714
Min length22

Characters and Unicode

Total characters913
Distinct characters98
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row경기도 용인시 처인구 중부대로 1199(삼가동)
2nd row경기도 용인시 처인구 금령로 50(김량장동)
3rd row경기도 용인시 처인구 포곡읍 포곡로 258(삼계리)
4th row경기도 용인시 처인구 모현면 독점로 31-6(갈담리)
5th row경기도 용인시 처인구 남사면 내기로 22(봉무리)
ValueCountFrequency (%)
경기도 35
19.1%
용인시 35
19.1%
처인구 13
 
7.1%
기흥구 12
 
6.6%
수지구 10
 
5.5%
포은대로 2
 
1.1%
중부대로 2
 
1.1%
만현로 1
 
0.5%
구성로77번길 1
 
0.5%
어정로 1
 
0.5%
Other values (71) 71
38.8%
2023-12-13T08:36:12.492044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
148
 
16.2%
49
 
5.4%
48
 
5.3%
39
 
4.3%
36
 
3.9%
( 35
 
3.8%
35
 
3.8%
) 35
 
3.8%
35
 
3.8%
35
 
3.8%
Other values (88) 418
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 581
63.6%
Space Separator 148
 
16.2%
Decimal Number 111
 
12.2%
Open Punctuation 35
 
3.8%
Close Punctuation 35
 
3.8%
Dash Punctuation 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
49
 
8.4%
48
 
8.3%
39
 
6.7%
36
 
6.2%
35
 
6.0%
35
 
6.0%
35
 
6.0%
35
 
6.0%
34
 
5.9%
17
 
2.9%
Other values (74) 218
37.5%
Decimal Number
ValueCountFrequency (%)
1 26
23.4%
5 14
12.6%
2 14
12.6%
3 11
9.9%
4 11
9.9%
7 9
 
8.1%
8 8
 
7.2%
0 7
 
6.3%
6 6
 
5.4%
9 5
 
4.5%
Space Separator
ValueCountFrequency (%)
148
100.0%
Open Punctuation
ValueCountFrequency (%)
( 35
100.0%
Close Punctuation
ValueCountFrequency (%)
) 35
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 581
63.6%
Common 332
36.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
49
 
8.4%
48
 
8.3%
39
 
6.7%
36
 
6.2%
35
 
6.0%
35
 
6.0%
35
 
6.0%
35
 
6.0%
34
 
5.9%
17
 
2.9%
Other values (74) 218
37.5%
Common
ValueCountFrequency (%)
148
44.6%
( 35
 
10.5%
) 35
 
10.5%
1 26
 
7.8%
5 14
 
4.2%
2 14
 
4.2%
3 11
 
3.3%
4 11
 
3.3%
7 9
 
2.7%
8 8
 
2.4%
Other values (4) 21
 
6.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 581
63.6%
ASCII 332
36.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
148
44.6%
( 35
 
10.5%
) 35
 
10.5%
1 26
 
7.8%
5 14
 
4.2%
2 14
 
4.2%
3 11
 
3.3%
4 11
 
3.3%
7 9
 
2.7%
8 8
 
2.4%
Other values (4) 21
 
6.3%
Hangul
ValueCountFrequency (%)
49
 
8.4%
48
 
8.3%
39
 
6.7%
36
 
6.2%
35
 
6.0%
35
 
6.0%
35
 
6.0%
35
 
6.0%
34
 
5.9%
17
 
2.9%
Other values (74) 218
37.5%

우편번호
Real number (ℝ)

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16990.771
Minimum16825
Maximum17178
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-13T08:36:12.619869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum16825
5-th percentile16834.1
Q116904
median16994
Q317063.5
95-th percentile17161
Maximum17178
Range353
Interquartile range (IQR)159.5

Descriptive statistics

Standard deviation108.54874
Coefficient of variation (CV)0.006388688
Kurtosis-1.142509
Mean16990.771
Median Absolute Deviation (MAD)84
Skewness0.063220424
Sum594677
Variance11782.829
MonotonicityNot monotonic
2023-12-13T08:36:12.723029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
17019 1
 
2.9%
17049 1
 
2.9%
16917 1
 
2.9%
16910 1
 
2.9%
17007 1
 
2.9%
16994 1
 
2.9%
16898 1
 
2.9%
16835 1
 
2.9%
16832 1
 
2.9%
16844 1
 
2.9%
Other values (25) 25
71.4%
ValueCountFrequency (%)
16825 1
2.9%
16832 1
2.9%
16835 1
2.9%
16844 1
2.9%
16845 1
2.9%
16852 1
2.9%
16870 1
2.9%
16872 1
2.9%
16898 1
2.9%
16910 1
2.9%
ValueCountFrequency (%)
17178 1
2.9%
17168 1
2.9%
17158 1
2.9%
17144 1
2.9%
17136 1
2.9%
17118 1
2.9%
17108 1
2.9%
17085 1
2.9%
17072 1
2.9%
17055 1
2.9%

전화번호안내
Text

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-13T08:36:12.929211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.914286
Min length9

Characters and Unicode

Total characters417
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row1577-1122
2nd row031-324-5023
3rd row031-324-5531
4th row031-324-5592
5th row031-324-5641
ValueCountFrequency (%)
1577-1122 1
 
2.9%
031-324-6678 1
 
2.9%
031-324-6711 1
 
2.9%
031-324-6732 1
 
2.9%
031-324-6893 1
 
2.9%
031-324-6795 1
 
2.9%
031-324-6772 1
 
2.9%
031-324-8022 1
 
2.9%
031-324-6681 1
 
2.9%
031-324-8602 1
 
2.9%
Other values (25) 25
71.4%
2023-12-13T08:36:13.330790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 80
19.2%
- 69
16.5%
2 55
13.2%
1 47
11.3%
4 41
9.8%
0 40
9.6%
6 25
 
6.0%
5 18
 
4.3%
8 17
 
4.1%
7 15
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 348
83.5%
Dash Punctuation 69
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 80
23.0%
2 55
15.8%
1 47
13.5%
4 41
11.8%
0 40
11.5%
6 25
 
7.2%
5 18
 
5.2%
8 17
 
4.9%
7 15
 
4.3%
9 10
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 69
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 417
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 80
19.2%
- 69
16.5%
2 55
13.2%
1 47
11.3%
4 41
9.8%
0 40
9.6%
6 25
 
6.0%
5 18
 
4.3%
8 17
 
4.1%
7 15
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 417
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 80
19.2%
- 69
16.5%
2 55
13.2%
1 47
11.3%
4 41
9.8%
0 40
9.6%
6 25
 
6.0%
5 18
 
4.3%
8 17
 
4.1%
7 15
 
3.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
2019-04-04
35 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019-04-04
2nd row2019-04-04
3rd row2019-04-04
4th row2019-04-04
5th row2019-04-04

Common Values

ValueCountFrequency (%)
2019-04-04 35
100.0%

Length

2023-12-13T08:36:13.492408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:36:13.591731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019-04-04 35
100.0%

Interactions

2023-12-13T08:36:11.125446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:36:13.653867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분명주소우편번호전화번호안내
구분명1.0001.0001.0001.000
주소1.0001.0001.0001.000
우편번호1.0001.0001.0001.000
전화번호안내1.0001.0001.0001.000

Missing values

2023-12-13T08:36:11.238314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:36:11.345324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명구분명주소우편번호전화번호안내데이터기준일자
0용인시용인시청경기도 용인시 처인구 중부대로 1199(삼가동)170191577-11222019-04-04
1용인시처인구청경기도 용인시 처인구 금령로 50(김량장동)17049031-324-50232019-04-04
2용인시포곡읍경기도 용인시 처인구 포곡읍 포곡로 258(삼계리)17028031-324-55312019-04-04
3용인시모현읍경기도 용인시 처인구 모현면 독점로 31-6(갈담리)17036031-324-55922019-04-04
4용인시남사면경기도 용인시 처인구 남사면 내기로 22(봉무리)17118031-324-56412019-04-04
5용인시이동읍경기도 용인시 처인구 이동면 경기동로 673(송전리)17136031-324-56912019-04-04
6용인시원삼면경기도 용인시 처인구 원삼면 원양로 64(고당리)17168031-324-57422019-04-04
7용인시백암면경기도 용인시 처인구 백암면 백암로 189(백암리)17178031-324-57922019-04-04
8용인시양지면경기도 용인시 처인구 양지면 양지로105번길 5(양지리)17158031-324-58422019-04-04
9용인시중앙동경기도 용인시 처인구 금령로78번길 7(김량장동)17051031-324-58942019-04-04
시군명구분명주소우편번호전화번호안내데이터기준일자
25용인시수지구청경기도 용인시 수지구 포은대로 435(풍덕천동)16835031-324-80222019-04-04
26용인시풍덕천1동경기도 용인시 수지구 수지로342번길 3(풍덕천동)16832031-324-86022019-04-04
27용인시풍덕천2동경기도 용인시 수지구 풍덕천로 51(풍덕천동)16844031-324-86332019-04-04
28용인시신봉동경기도 용인시 수지구 수지로 215(신봉동)16845031-324-86422019-04-04
29용인시죽전1동경기도 용인시 수지구 대지로15번길 50(죽전동)16872031-324-86622019-04-04
30용인시죽전2동경기도 용인시 수지구 포은대로 523(죽전동)16870031-324-86932019-04-04
31용인시동천동경기도 용인시 수지구 신수로783번길 40(동천동)16825031-324-87022019-04-04
32용인시상현1동경기도 용인시 수지구 상현로 71(상현동)16937031-324-87332019-04-04
33용인시상현2동경기도 용인시 수지구 만현로 48(상현동)16929031-324-87522019-04-04
34용인시성복동경기도 용인시 수지구 성복1로 100(성복동)16852031-324-87722019-04-04