Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.0 KiB
Average record size in memory71.3 B

Variable types

Numeric4
Categorical4

Alerts

base_year has constant value ""Constant
yngbgs_dynmc_popltn_mvmn_time has constant value ""Constant
wkday_nm has constant value ""Constant
seq_no is highly overall correlated with lc_la and 2 other fieldsHigh correlation
lc_la is highly overall correlated with seq_no and 2 other fieldsHigh correlation
dynmc_popltn_co is highly overall correlated with seq_no and 1 other fieldsHigh correlation
era_nm is highly overall correlated with seq_no and 1 other fieldsHigh correlation
era_nm is highly imbalanced (80.6%)Imbalance
seq_no has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:07:01.880244
Analysis finished2023-12-10 10:07:06.041425
Duration4.16 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

seq_no
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean62.29
Minimum1
Maximum408
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:07:06.177212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.95
Q127.75
median53.5
Q378.25
95-th percentile98.05
Maximum408
Range407
Interquartile range (IQR)50.5

Descriptive statistics

Standard deviation67.134915
Coefficient of variation (CV)1.07778
Kurtosis19.377914
Mean62.29
Median Absolute Deviation (MAD)25.5
Skewness4.0937361
Sum6229
Variance4507.0969
MonotonicityNot monotonic
2023-12-10T19:07:06.447645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
9 1
1.0%
10 1
1.0%
11 1
1.0%
12 1
1.0%
ValueCountFrequency (%)
408 1
1.0%
407 1
1.0%
406 1
1.0%
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%

base_year
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2019
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2019 100
100.0%

Length

2023-12-10T19:07:06.717535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:07:06.929626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 100
100.0%

era_nm
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
청소년의달
97 
여름방학
 
3

Length

Max length5
Median length5
Mean length4.97
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row청소년의달
2nd row여름방학
3rd row청소년의달
4th row청소년의달
5th row청소년의달

Common Values

ValueCountFrequency (%)
청소년의달 97
97.0%
여름방학 3
 
3.0%

Length

2023-12-10T19:07:07.113571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:07:07.304874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
청소년의달 97
97.0%
여름방학 3
 
3.0%

yngbgs_dynmc_popltn_mvmn_time
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1900
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1900
2nd row1900
3rd row1900
4th row1900
5th row1900

Common Values

ValueCountFrequency (%)
1900 100
100.0%

Length

2023-12-10T19:07:07.558582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:07:07.725389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1900 100
100.0%

wkday_nm
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
토요일
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row토요일
2nd row토요일
3rd row토요일
4th row토요일
5th row토요일

Common Values

ValueCountFrequency (%)
토요일 100
100.0%

Length

2023-12-10T19:07:07.944187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:07:08.128749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
토요일 100
100.0%

lc_la
Real number (ℝ)

HIGH CORRELATION 

Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.518247
Minimum37.517025
Maximum37.521988
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:07:08.327502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.517025
5-th percentile37.517042
Q137.517504
median37.518382
Q337.518852
95-th percentile37.51931
Maximum37.521988
Range0.00496294
Interquartile range (IQR)0.001347685

Descriptive statistics

Standard deviation0.00087092659
Coefficient of variation (CV)2.3213414 × 10-5
Kurtosis2.2628792
Mean37.518247
Median Absolute Deviation (MAD)0.000487775
Skewness0.85774228
Sum3751.8247
Variance7.5851313 × 10-7
MonotonicityNot monotonic
2023-12-10T19:07:08.731158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.51842306 2
 
2.0%
37.51702541 1
 
1.0%
37.5184117 1
 
1.0%
37.51884524 1
 
1.0%
37.51884238 1
 
1.0%
37.51883951 1
 
1.0%
37.51883664 1
 
1.0%
37.51883377 1
 
1.0%
37.5188309 1
 
1.0%
37.51882802 1
 
1.0%
Other values (89) 89
89.0%
ValueCountFrequency (%)
37.51702541 1
1.0%
37.51703116 1
1.0%
37.51703403 1
1.0%
37.5170369 1
1.0%
37.51703976 1
1.0%
37.51704263 1
1.0%
37.51704834 1
1.0%
37.5170512 1
1.0%
37.51705405 1
1.0%
37.5170569 1
1.0%
ValueCountFrequency (%)
37.52198835 1
1.0%
37.52067066 1
1.0%
37.5193187 1
1.0%
37.51931586 1
1.0%
37.51931301 1
1.0%
37.51931017 1
1.0%
37.51930732 1
1.0%
37.51930447 1
1.0%
37.51930161 1
1.0%
37.51929875 1
1.0%

lc_lo
Real number (ℝ)

Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.90242
Minimum126.89791
Maximum126.90698
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:07:08.981818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.89791
5-th percentile126.89793
Q1126.90018
median126.90245
Q3126.90471
95-th percentile126.90697
Maximum126.90698
Range0.0090704
Interquartile range (IQR)0.004535175

Descriptive statistics

Standard deviation0.0027772931
Coefficient of variation (CV)2.1885265 × 10-5
Kurtosis-1.210959
Mean126.90242
Median Absolute Deviation (MAD)0.0022685
Skewness0.012357734
Sum12690.242
Variance7.7133567 × 10-6
MonotonicityNot monotonic
2023-12-10T19:07:09.344749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.9069703 2
 
2.0%
126.8979287 1
 
1.0%
126.9047072 1
 
1.0%
126.9013089 1
 
1.0%
126.9007431 1
 
1.0%
126.9001773 1
 
1.0%
126.8996116 1
 
1.0%
126.8990458 1
 
1.0%
126.89848 1
 
1.0%
126.8979142 1
 
1.0%
Other values (89) 89
89.0%
ValueCountFrequency (%)
126.8979106 1
1.0%
126.8979142 1
1.0%
126.8979178 1
1.0%
126.8979215 1
1.0%
126.8979251 1
1.0%
126.8979287 1
1.0%
126.8984764 1
1.0%
126.89848 1
1.0%
126.8984836 1
1.0%
126.8984872 1
1.0%
ValueCountFrequency (%)
126.906981 1
1.0%
126.9069774 1
1.0%
126.9069738 1
1.0%
126.9069703 2
2.0%
126.9069667 1
1.0%
126.9064152 1
1.0%
126.9064116 1
1.0%
126.9064081 1
1.0%
126.9064045 1
1.0%
126.9064009 1
1.0%

dynmc_popltn_co
Real number (ℝ)

HIGH CORRELATION 

Distinct94
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.5449
Minimum0.04
Maximum135.72
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:07:09.682850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.04
5-th percentile0.2985
Q11.2
median3.47
Q315.6275
95-th percentile33.117
Maximum135.72
Range135.68
Interquartile range (IQR)14.4275

Descriptive statistics

Standard deviation17.296238
Coefficient of variation (CV)1.6402468
Kurtosis27.462673
Mean10.5449
Median Absolute Deviation (MAD)3.03
Skewness4.3481668
Sum1054.49
Variance299.15985
MonotonicityNot monotonic
2023-12-10T19:07:10.026800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.63 2
 
2.0%
0.38 2
 
2.0%
0.04 2
 
2.0%
1.99 2
 
2.0%
1.2 2
 
2.0%
0.4 2
 
2.0%
18.29 1
 
1.0%
4.71 1
 
1.0%
10.01 1
 
1.0%
8.51 1
 
1.0%
Other values (84) 84
84.0%
ValueCountFrequency (%)
0.04 2
2.0%
0.17 1
1.0%
0.26 1
1.0%
0.27 1
1.0%
0.3 1
1.0%
0.31 1
1.0%
0.38 2
2.0%
0.4 2
2.0%
0.42 1
1.0%
0.46 1
1.0%
ValueCountFrequency (%)
135.72 1
1.0%
53.89 1
1.0%
45.91 1
1.0%
45.73 1
1.0%
44.08 1
1.0%
32.54 1
1.0%
31.1 1
1.0%
30.04 1
1.0%
29.09 1
1.0%
26.48 1
1.0%

Interactions

2023-12-10T19:07:04.493786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:07:02.135272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:07:02.848530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:07:03.791255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:07:04.800395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:07:02.301914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:07:03.063490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:07:03.983673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:07:05.096837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:07:02.475563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:07:03.332069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:07:04.144495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:07:05.263763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:07:02.662598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:07:03.564699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:07:04.278781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:07:10.281272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
seq_noera_nmlc_lalc_lodynmc_popltn_co
seq_no1.0001.0000.8980.0000.356
era_nm1.0001.0000.7470.0000.000
lc_la0.8980.7471.0000.0000.297
lc_lo0.0000.0000.0001.0000.473
dynmc_popltn_co0.3560.0000.2970.4731.000
2023-12-10T19:07:10.532648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
seq_nolc_lalc_lodynmc_popltn_coera_nm
seq_no1.0000.9930.0690.5590.990
lc_la0.9931.0000.0490.5630.788
lc_lo0.0690.0491.0000.3700.000
dynmc_popltn_co0.5590.5630.3701.0000.000
era_nm0.9900.7880.0000.0001.000

Missing values

2023-12-10T19:07:05.571601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:07:05.956860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

seq_nobase_yearera_nmyngbgs_dynmc_popltn_mvmn_timewkday_nmlc_lalc_lodynmc_popltn_co
012019청소년의달1900토요일37.517025126.8979292.87
14062019여름방학1900토요일37.520671126.9058219.26
232019청소년의달1900토요일37.517031126.899062.67
342019청소년의달1900토요일37.517034126.8996261.9
452019청소년의달1900토요일37.517037126.9001920.81
562019청소년의달1900토요일37.51704126.9007570.04
672019청소년의달1900토요일37.517043126.9013230.42
74072019여름방학1900토요일37.518423126.906977.27
892019청소년의달1900토요일37.517048126.9024550.75
9102019청소년의달1900토요일37.517051126.9030211.71
seq_nobase_yearera_nmyngbgs_dynmc_popltn_mvmn_timewkday_nmlc_lalc_lodynmc_popltn_co
90912019청소년의달1900토요일37.519293126.9007432.54
91922019청소년의달1900토요일37.519296126.90130545.73
92932019청소년의달1900토요일37.519299126.90187130.04
93942019청소년의달1900토요일37.519302126.90243723.13
94952019청소년의달1900토요일37.519304126.90300311.8
95962019청소년의달1900토요일37.519307126.90356913.72
96972019청소년의달1900토요일37.51931126.90413453.89
97982019청소년의달1900토요일37.519313126.904745.91
98992019청소년의달1900토요일37.519316126.905266135.72
991002019청소년의달1900토요일37.519319126.90583226.48