Overview

Dataset statistics

Number of variables4
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory38.4 B

Variable types

Numeric2
Categorical1
Text1

Dataset

Description샘플 데이터
Author국토연구원
URLhttps://bigdata-region.kr/#/dataset/07972e3e-38c3-4346-9352-7239f41b873d

Alerts

시군구코드 is highly overall correlated with 시도명High correlation
시도명 is highly overall correlated with 시군구코드High correlation
관리번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 14:22:08.204040
Analysis finished2023-12-10 14:22:09.558265
Duration1.35 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

관리번호
Real number (ℝ)

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.0100088 × 1018
Minimum1.0100008 × 1018
Maximum1.01001 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:22:09.694820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.0100008 × 1018
5-th percentile1.0100009 × 1018
Q11.01001 × 1018
median1.01001 × 1018
Q31.01001 × 1018
95-th percentile1.01001 × 1018
Maximum1.01001 × 1018
Range9.1930029 × 1012
Interquartile range (IQR)1.5752501 × 1010

Descriptive statistics

Standard deviation3.1639698 × 1012
Coefficient of variation (CV)3.1326161 × 10-6
Kurtosis3.3861521
Mean1.0100088 × 1018
Median Absolute Deviation (MAD)8.0014818 × 109
Skewness-2.2725241
Sum-6.5932243 × 1018
Variance1.0010705 × 1025
MonotonicityStrictly increasing
2023-12-10T23:22:09.958903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1010000840000001425 1
 
3.3%
1010010012003800711 1
 
3.3%
1010010033002901926 1
 
3.3%
1010010028001100264 1
 
3.3%
1010010025001400016 1
 
3.3%
1010010023002100260 1
 
3.3%
1010010023000802361 1
 
3.3%
1010010022004202321 1
 
3.3%
1010010020006000245 1
 
3.3%
1010010020006000243 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
1010000840000001425 1
3.3%
1010000840000001631 1
3.3%
1010000884000001493 1
3.3%
1010000888000001742 1
3.3%
1010010001009601252 1
3.3%
1010010003000200566 1
3.3%
1010010004000106859 1
3.3%
1010010004002831320 1
3.3%
1010010005000201468 1
3.3%
1010010005000201470 1
3.3%
ValueCountFrequency (%)
1010010033002901926 1
3.3%
1010010028001100264 1
3.3%
1010010025001400016 1
3.3%
1010010023002100260 1
3.3%
1010010023000802361 1
3.3%
1010010022004202321 1
3.3%
1010010020006000245 1
3.3%
1010010020006000243 1
3.3%
1010010020000700330 1
3.3%
1010010018000318880 1
3.3%

시도명
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
강원도
경기도
대구광역시
부산광역시
경상북도
Other values (5)

Length

Max length5
Median length4.5
Mean length3.8333333
Min length3

Unique

Unique5 ?
Unique (%)16.7%

Sample

1st row강원도
2nd row강원도
3rd row강원도
4th row강원도
5th row부산광역시

Common Values

ValueCountFrequency (%)
강원도 8
26.7%
경기도 7
23.3%
대구광역시 5
16.7%
부산광역시 3
 
10.0%
경상북도 2
 
6.7%
전라북도 1
 
3.3%
충청남도 1
 
3.3%
대전광역시 1
 
3.3%
인천광역시 1
 
3.3%
경상남도 1
 
3.3%

Length

2023-12-10T23:22:10.237200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:22:10.532442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강원도 8
26.7%
경기도 7
23.3%
대구광역시 5
16.7%
부산광역시 3
 
10.0%
경상북도 2
 
6.7%
전라북도 1
 
3.3%
충청남도 1
 
3.3%
대전광역시 1
 
3.3%
인천광역시 1
 
3.3%
경상남도 1
 
3.3%
Distinct17
Distinct (%)56.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:22:10.848285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.9333333
Min length2

Characters and Unicode

Total characters88
Distinct characters28
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)36.7%

Sample

1st row동해시
2nd row동해시
3rd row동해시
4th row동해시
5th row연제구
ValueCountFrequency (%)
서구 5
16.7%
동해시 4
13.3%
광주시 3
10.0%
강릉시 3
10.0%
남양주시 2
 
6.7%
연제구 2
 
6.7%
태백시 1
 
3.3%
의정부시 1
 
3.3%
해운대구 1
 
3.3%
동구 1
 
3.3%
Other values (7) 7
23.3%
2023-12-10T23:22:11.328143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20
22.7%
10
11.4%
8
 
9.1%
5
 
5.7%
5
 
5.7%
5
 
5.7%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
Other values (18) 23
26.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 88
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
22.7%
10
11.4%
8
 
9.1%
5
 
5.7%
5
 
5.7%
5
 
5.7%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
Other values (18) 23
26.1%

Most occurring scripts

ValueCountFrequency (%)
Hangul 88
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
22.7%
10
11.4%
8
 
9.1%
5
 
5.7%
5
 
5.7%
5
 
5.7%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
Other values (18) 23
26.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 88
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
20
22.7%
10
11.4%
8
 
9.1%
5
 
5.7%
5
 
5.7%
5
 
5.7%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
Other values (18) 23
26.1%

시군구코드
Real number (ℝ)

HIGH CORRELATION 

Distinct17
Distinct (%)56.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37767
Minimum26350
Maximum48220
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:22:11.499001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum26350
5-th percentile26470
Q127412.5
median41610
Q342170
95-th percentile47174
Maximum48220
Range21870
Interquartile range (IQR)14757.5

Descriptive statistics

Standard deviation7723.4266
Coefficient of variation (CV)0.20450199
Kurtosis-1.4166133
Mean37767
Median Absolute Deviation (MAD)1610
Skewness-0.57113941
Sum1133010
Variance59651318
MonotonicityNot monotonic
2023-12-10T23:22:11.639436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
27170 5
16.7%
42170 4
13.3%
42150 3
10.0%
41610 3
10.0%
26470 2
 
6.7%
41360 2
 
6.7%
47210 1
 
3.3%
41480 1
 
3.3%
45190 1
 
3.3%
47130 1
 
3.3%
Other values (7) 7
23.3%
ValueCountFrequency (%)
26350 1
 
3.3%
26470 2
 
6.7%
27170 5
16.7%
28140 1
 
3.3%
30230 1
 
3.3%
41150 1
 
3.3%
41360 2
 
6.7%
41480 1
 
3.3%
41610 3
10.0%
42150 3
10.0%
ValueCountFrequency (%)
48220 1
 
3.3%
47210 1
 
3.3%
47130 1
 
3.3%
45190 1
 
3.3%
44250 1
 
3.3%
42190 1
 
3.3%
42170 4
13.3%
42150 3
10.0%
41610 3
10.0%
41480 1
 
3.3%

Interactions

2023-12-10T23:22:09.007171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:22:08.667659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:22:09.166524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:22:08.816549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:22:11.734661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리번호시도명시군구명시군구코드
관리번호1.0000.3591.0000.644
시도명0.3591.0001.0001.000
시군구명1.0001.0001.0001.000
시군구코드0.6441.0001.0001.000
2023-12-10T23:22:11.859506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리번호시군구코드시도명
관리번호1.000-0.3340.342
시군구코드-0.3341.0000.913
시도명0.3420.9131.000

Missing values

2023-12-10T23:22:09.318428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:22:09.489914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

관리번호시도명시군구명시군구코드
01010000840000001425강원도동해시42170
11010000840000001631강원도동해시42170
21010000884000001493강원도동해시42170
31010000888000001742강원도동해시42170
41010010001009601252부산광역시연제구26470
51010010003000200566경상북도영주시47210
61010010004000106859경기도파주시41480
71010010004002831320전라북도남원시45190
81010010005000201468강원도강릉시42150
91010010005000201470강원도강릉시42150
관리번호시도명시군구명시군구코드
201010010018000318880인천광역시동구28140
211010010020000700330강원도태백시42190
221010010020006000243경기도광주시41610
231010010020006000245경기도광주시41610
241010010022004202321부산광역시해운대구26350
251010010023000802361부산광역시연제구26470
261010010023002100260경기도남양주시41360
271010010025001400016경기도의정부시41150
281010010028001100264경기도광주시41610
291010010033002901926경상남도통영시48220