Overview

Dataset statistics

Number of variables4
Number of observations30
Missing cells21
Missing cells (%)17.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory38.4 B

Variable types

Categorical2
Text1
Numeric1

Dataset

Description샘플 데이터
Author경기콘텐츠진흥원
URLhttps://www.bigdata-region.kr/#/dataset/0399e099-6702-4da6-b59f-f7b3a7c37f11

Alerts

시도명 has constant value ""Constant
기준년도 has constant value ""Constant
사업체현황지수(현황) has 21 (70.0%) missing valuesMissing
시군구명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 14:01:11.659270
Analysis finished2023-12-10 14:01:12.234508
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
경기도
30 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 30
100.0%

Length

2023-12-10T23:01:12.369285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:01:12.553684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 30
100.0%

시군구명
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:01:12.810332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length5
Min length3

Characters and Unicode

Total characters150
Distinct characters50
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row광주시
2nd row고양시 덕양구
3rd row안성시
4th row부천시 소사구
5th row양주시
ValueCountFrequency (%)
수원시 4
 
9.1%
고양시 3
 
6.8%
안양시 2
 
4.5%
용인시 2
 
4.5%
광주시 1
 
2.3%
상록구 1
 
2.3%
화성시 1
 
2.3%
일산서구 1
 
2.3%
영통구 1
 
2.3%
시흥시 1
 
2.3%
Other values (27) 27
61.4%
2023-12-10T23:01:13.434907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29
19.3%
14
 
9.3%
14
 
9.3%
9
 
6.0%
7
 
4.7%
5
 
3.3%
5
 
3.3%
5
 
3.3%
4
 
2.7%
3
 
2.0%
Other values (40) 55
36.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 136
90.7%
Space Separator 14
 
9.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
21.3%
14
 
10.3%
9
 
6.6%
7
 
5.1%
5
 
3.7%
5
 
3.7%
5
 
3.7%
4
 
2.9%
3
 
2.2%
3
 
2.2%
Other values (39) 52
38.2%
Space Separator
ValueCountFrequency (%)
14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 136
90.7%
Common 14
 
9.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
21.3%
14
 
10.3%
9
 
6.6%
7
 
5.1%
5
 
3.7%
5
 
3.7%
5
 
3.7%
4
 
2.9%
3
 
2.2%
3
 
2.2%
Other values (39) 52
38.2%
Common
ValueCountFrequency (%)
14
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 136
90.7%
ASCII 14
 
9.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
29
21.3%
14
 
10.3%
9
 
6.6%
7
 
5.1%
5
 
3.7%
5
 
3.7%
5
 
3.7%
4
 
2.9%
3
 
2.2%
3
 
2.2%
Other values (39) 52
38.2%
ASCII
ValueCountFrequency (%)
14
100.0%

기준년도
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2015
30 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2015
2nd row2015
3rd row2015
4th row2015
5th row2015

Common Values

ValueCountFrequency (%)
2015 30
100.0%

Length

2023-12-10T23:01:13.662746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:01:13.811048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2015 30
100.0%

사업체현황지수(현황)
Real number (ℝ)

MISSING 

Distinct9
Distinct (%)100.0%
Missing21
Missing (%)70.0%
Infinite0
Infinite (%)0.0%
Mean474.88889
Minimum230
Maximum965
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:01:13.959999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum230
5-th percentile251.2
Q1310
median362
Q3454
95-th percentile952.6
Maximum965
Range735
Interquartile range (IQR)144

Descriptive statistics

Standard deviation277.48939
Coefficient of variation (CV)0.58432487
Kurtosis0.33480663
Mean474.88889
Median Absolute Deviation (MAD)79
Skewness1.3747656
Sum4274
Variance77000.361
MonotonicityNot monotonic
2023-12-10T23:01:14.131705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
310 1
 
3.3%
965 1
 
3.3%
230 1
 
3.3%
418 1
 
3.3%
318 1
 
3.3%
934 1
 
3.3%
362 1
 
3.3%
454 1
 
3.3%
283 1
 
3.3%
(Missing) 21
70.0%
ValueCountFrequency (%)
230 1
3.3%
283 1
3.3%
310 1
3.3%
318 1
3.3%
362 1
3.3%
418 1
3.3%
454 1
3.3%
934 1
3.3%
965 1
3.3%
ValueCountFrequency (%)
965 1
3.3%
934 1
3.3%
454 1
3.3%
418 1
3.3%
362 1
3.3%
318 1
3.3%
310 1
3.3%
283 1
3.3%
230 1
3.3%

Interactions

2023-12-10T23:01:11.791625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:01:14.266596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구명사업체현황지수(현황)
시군구명1.0001.000
사업체현황지수(현황)1.0001.000

Missing values

2023-12-10T23:01:12.023798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:01:12.171134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명기준년도사업체현황지수(현황)
0경기도광주시2015310
1경기도고양시 덕양구2015<NA>
2경기도안성시2015965
3경기도부천시 소사구2015<NA>
4경기도양주시2015<NA>
5경기도양평군2015230
6경기도용인시 수지구2015<NA>
7경기도김포시2015<NA>
8경기도남양주시2015<NA>
9경기도수원시 팔달구2015<NA>
시도명시군구명기준년도사업체현황지수(현황)
20경기도시흥시2015<NA>
21경기도안양시 만안구2015<NA>
22경기도연천군2015362
23경기도수원시 권선구2015<NA>
24경기도안양시 동안구2015454
25경기도용인시 기흥구2015<NA>
26경기도의왕시2015<NA>
27경기도파주시2015<NA>
28경기도하남시2015283
29경기도광명시2015<NA>