Overview

Dataset statistics

Number of variables8
Number of observations400
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory27.1 KiB
Average record size in memory69.3 B

Variable types

DateTime1
Categorical5
Numeric2

Alerts

ETL일시 has constant value ""Constant
기준년월일 has constant value ""Constant
행정동코드 has constant value ""Constant
ELT날짜 has constant value ""Constant
24시간대구분코드 is highly overall correlated with 인구수High correlation
인구수 is highly overall correlated with 24시간대구분코드High correlation
24시간대구분코드 has 28 (7.0%) zerosZeros

Reproduction

Analysis started2023-12-10 06:39:28.271344
Analysis finished2023-12-10 06:39:29.525410
Duration1.25 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

ETL일시
Date

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
Minimum2020-02-10 00:12:32
Maximum2020-02-10 00:12:32
2023-12-10T15:39:29.610636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:29.806634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

기준년월일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
20200201
400 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20200201
2nd row20200201
3rd row20200201
4th row20200201
5th row20200201

Common Values

ValueCountFrequency (%)
20200201 400
100.0%

Length

2023-12-10T15:39:29.997672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:39:30.220635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20200201 400
100.0%

24시간대구분코드
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct16
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.0725
Minimum0
Maximum15
Zeros28
Zeros (%)7.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:39:30.379678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q13
median7
Q311
95-th percentile14
Maximum15
Range15
Interquartile range (IQR)8

Descriptive statistics

Standard deviation4.3892377
Coefficient of variation (CV)0.62060624
Kurtosis-1.2240289
Mean7.0725
Median Absolute Deviation (MAD)4
Skewness-0.015163499
Sum2829
Variance19.265407
MonotonicityIncreasing
2023-12-10T15:39:30.588388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
0 28
 
7.0%
13 28
 
7.0%
9 28
 
7.0%
2 27
 
6.8%
14 27
 
6.8%
10 27
 
6.8%
12 27
 
6.8%
3 26
 
6.5%
4 26
 
6.5%
6 26
 
6.5%
Other values (6) 130
32.5%
ValueCountFrequency (%)
0 28
7.0%
1 26
6.5%
2 27
6.8%
3 26
6.5%
4 26
6.5%
5 25
6.2%
6 26
6.5%
7 26
6.5%
8 25
6.2%
9 28
7.0%
ValueCountFrequency (%)
15 2
 
0.5%
14 27
6.8%
13 28
7.0%
12 27
6.8%
11 26
6.5%
10 27
6.8%
9 28
7.0%
8 25
6.2%
7 26
6.5%
6 26
6.5%
Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
F
200 
M
200 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowF
3rd rowF
4th rowF
5th rowF

Common Values

ValueCountFrequency (%)
F 200
50.0%
M 200
50.0%

Length

2023-12-10T15:39:30.821119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:39:30.999227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
f 200
50.0%
m 200
50.0%
Distinct14
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
age_15
30 
age_20
30 
age_25
30 
age_30
30 
age_35
30 
Other values (9)
250 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowage_00
2nd rowage_10
3rd rowage_15
4th rowage_20
5th rowage_25

Common Values

ValueCountFrequency (%)
age_15 30
 
7.5%
age_20 30
 
7.5%
age_25 30
 
7.5%
age_30 30
 
7.5%
age_35 30
 
7.5%
age_40 30
 
7.5%
age_45 30
 
7.5%
age_50 30
 
7.5%
age_55 30
 
7.5%
age_60 30
 
7.5%
Other values (4) 100
25.0%

Length

2023-12-10T15:39:31.183513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
age_15 30
 
7.5%
age_20 30
 
7.5%
age_25 30
 
7.5%
age_30 30
 
7.5%
age_35 30
 
7.5%
age_40 30
 
7.5%
age_45 30
 
7.5%
age_50 30
 
7.5%
age_55 30
 
7.5%
age_60 30
 
7.5%
Other values (4) 100
25.0%

행정동코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
11110560
400 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row11110560
2nd row11110560
3rd row11110560
4th row11110560
5th row11110560

Common Values

ValueCountFrequency (%)
11110560 400
100.0%

Length

2023-12-10T15:39:31.373472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:39:31.514995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
11110560 400
100.0%

인구수
Real number (ℝ)

HIGH CORRELATION 

Distinct72
Distinct (%)18.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.5625
Minimum1
Maximum110
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:39:31.709560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q16
median11.5
Q321
95-th percentile59.15
Maximum110
Range109
Interquartile range (IQR)15

Descriptive statistics

Standard deviation18.745388
Coefficient of variation (CV)1.067353
Kurtosis5.925484
Mean17.5625
Median Absolute Deviation (MAD)6.5
Skewness2.3062844
Sum7025
Variance351.38957
MonotonicityNot monotonic
2023-12-10T15:39:31.957048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5 25
 
6.2%
7 25
 
6.2%
6 24
 
6.0%
1 21
 
5.2%
4 19
 
4.8%
10 18
 
4.5%
3 17
 
4.2%
9 17
 
4.2%
8 13
 
3.2%
14 12
 
3.0%
Other values (62) 209
52.2%
ValueCountFrequency (%)
1 21
5.2%
2 12
3.0%
3 17
4.2%
4 19
4.8%
5 25
6.2%
6 24
6.0%
7 25
6.2%
8 13
3.2%
9 17
4.2%
10 18
4.5%
ValueCountFrequency (%)
110 1
0.2%
99 1
0.2%
96 1
0.2%
95 1
0.2%
93 2
0.5%
89 1
0.2%
83 1
0.2%
81 1
0.2%
79 1
0.2%
78 1
0.2%

ELT날짜
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
20200201
400 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20200201
2nd row20200201
3rd row20200201
4th row20200201
5th row20200201

Common Values

ValueCountFrequency (%)
20200201 400
100.0%

Length

2023-12-10T15:39:32.169653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:39:32.349836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20200201 400
100.0%

Interactions

2023-12-10T15:39:28.871660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:28.576566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:29.023553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:28.726539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:39:32.467854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
24시간대구분코드성별구분코드연령대구분코드인구수
24시간대구분코드1.0000.0000.0000.631
성별구분코드0.0001.0000.0000.109
연령대구분코드0.0000.0001.0000.405
인구수0.6310.1090.4051.000
2023-12-10T15:39:32.626358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연령대구분코드성별구분코드
연령대구분코드1.0000.000
성별구분코드0.0001.000
2023-12-10T15:39:33.109168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
24시간대구분코드인구수성별구분코드연령대구분코드
24시간대구분코드1.0000.5120.0000.000
인구수0.5121.0000.0830.175
성별구분코드0.0000.0831.0000.000
연령대구분코드0.0000.1750.0001.000

Missing values

2023-12-10T15:39:29.224882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:39:29.442450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

ETL일시기준년월일24시간대구분코드성별구분코드연령대구분코드행정동코드인구수ELT날짜
02020-02-10 00:12:32.0202002010Fage_0011110560620200201
12020-02-10 00:12:32.0202002010Fage_10111105601220200201
22020-02-10 00:12:32.0202002010Fage_15111105601720200201
32020-02-10 00:12:32.0202002010Fage_20111105603420200201
42020-02-10 00:12:32.0202002010Fage_25111105601820200201
52020-02-10 00:12:32.0202002010Fage_30111105601520200201
62020-02-10 00:12:32.0202002010Fage_35111105602420200201
72020-02-10 00:12:32.0202002010Fage_40111105602020200201
82020-02-10 00:12:32.0202002010Fage_45111105602920200201
92020-02-10 00:12:32.0202002010Fage_50111105603020200201
ETL일시기준년월일24시간대구분코드성별구분코드연령대구분코드행정동코드인구수ELT날짜
3902020-02-10 00:12:32.02020020114Mage_35111105602520200201
3912020-02-10 00:12:32.02020020114Mage_40111105604320200201
3922020-02-10 00:12:32.02020020114Mage_45111105605820200201
3932020-02-10 00:12:32.02020020114Mage_50111105605920200201
3942020-02-10 00:12:32.02020020114Mage_55111105607520200201
3952020-02-10 00:12:32.02020020114Mage_60111105606420200201
3962020-02-10 00:12:32.02020020114Mage_65111105602620200201
3972020-02-10 00:12:32.02020020114Mage_70111105602520200201
3982020-02-10 00:12:32.02020020115Fage_0011110560220200201
3992020-02-10 00:12:32.02020020115Fage_10111105601020200201