Overview

Dataset statistics

Number of variables6
Number of observations200
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.5 KiB
Average record size in memory53.7 B

Variable types

Numeric3
Categorical3

Dataset

DescriptionSample
Author(재)인천테크노파크
URLhttps://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=ICTVHCLCMGHIS0000001

Alerts

시설물ID has constant value ""Constant
출입일련번호 is highly overall correlated with 출입일자High correlation
출입일자 is highly overall correlated with 출입일련번호High correlation
출입일련번호 has unique valuesUnique
출입시간 has 4 (2.0%) zerosZeros
출입분 has 4 (2.0%) zerosZeros

Reproduction

Analysis started2023-12-10 06:55:34.360515
Analysis finished2023-12-10 06:55:35.384155
Duration1.02 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

출입일련번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct200
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean551.185
Minimum166
Maximum1035
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:55:35.462696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum166
5-th percentile182.95
Q1317.5
median540
Q3752.25
95-th percentile1009.15
Maximum1035
Range869
Interquartile range (IQR)434.75

Descriptive statistics

Standard deviation259.1008
Coefficient of variation (CV)0.47007956
Kurtosis-1.0718023
Mean551.185
Median Absolute Deviation (MAD)216.5
Skewness0.24201944
Sum110237
Variance67133.227
MonotonicityNot monotonic
2023-12-10T15:55:35.626464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
166 1
 
0.5%
759 1
 
0.5%
636 1
 
0.5%
666 1
 
0.5%
667 1
 
0.5%
676 1
 
0.5%
713 1
 
0.5%
715 1
 
0.5%
723 1
 
0.5%
749 1
 
0.5%
Other values (190) 190
95.0%
ValueCountFrequency (%)
166 1
0.5%
168 1
0.5%
170 1
0.5%
171 1
0.5%
172 1
0.5%
178 1
0.5%
179 1
0.5%
180 1
0.5%
181 1
0.5%
182 1
0.5%
ValueCountFrequency (%)
1035 1
0.5%
1029 1
0.5%
1023 1
0.5%
1021 1
0.5%
1019 1
0.5%
1018 1
0.5%
1017 1
0.5%
1015 1
0.5%
1014 1
0.5%
1012 1
0.5%

시설물ID
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
SDTRPIS00100002
200 

Length

Max length15
Median length15
Mean length15
Min length15

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSDTRPIS00100002
2nd rowSDTRPIS00100002
3rd rowSDTRPIS00100002
4th rowSDTRPIS00100002
5th rowSDTRPIS00100002

Common Values

ValueCountFrequency (%)
SDTRPIS00100002 200
100.0%

Length

2023-12-10T15:55:35.768609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:55:35.880011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
sdtrpis00100002 200
100.0%
Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
0
104 
1
96 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
0 104
52.0%
1 96
48.0%

Length

2023-12-10T15:55:35.980861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:55:36.097377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 104
52.0%
1 96
48.0%

출입일자
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
20161119
85 
20161120
64 
20161118
46 
20161121
 
5

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20161118
2nd row20161118
3rd row20161118
4th row20161118
5th row20161118

Common Values

ValueCountFrequency (%)
20161119 85
42.5%
20161120 64
32.0%
20161118 46
23.0%
20161121 5
 
2.5%

Length

2023-12-10T15:55:36.194752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:55:36.288343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20161119 85
42.5%
20161120 64
32.0%
20161118 46
23.0%
20161121 5
 
2.5%

출입시간
Real number (ℝ)

ZEROS 

Distinct19
Distinct (%)9.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.09
Minimum0
Maximum23
Zeros4
Zeros (%)2.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:55:36.399784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q113
median18
Q320
95-th percentile22
Maximum23
Range23
Interquartile range (IQR)7

Descriptive statistics

Standard deviation5.3802075
Coefficient of variation (CV)0.33438207
Kurtosis1.8869801
Mean16.09
Median Absolute Deviation (MAD)3
Skewness-1.4191799
Sum3218
Variance28.946633
MonotonicityNot monotonic
2023-12-10T15:55:36.506356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
20 30
15.0%
18 29
14.5%
13 15
 
7.5%
21 15
 
7.5%
16 14
 
7.0%
17 14
 
7.0%
22 13
 
6.5%
12 12
 
6.0%
15 10
 
5.0%
19 9
 
4.5%
Other values (9) 39
19.5%
ValueCountFrequency (%)
0 4
 
2.0%
1 7
3.5%
2 2
 
1.0%
7 4
 
2.0%
8 1
 
0.5%
10 1
 
0.5%
11 6
 
3.0%
12 12
6.0%
13 15
7.5%
14 8
4.0%
ValueCountFrequency (%)
23 6
 
3.0%
22 13
6.5%
21 15
7.5%
20 30
15.0%
19 9
 
4.5%
18 29
14.5%
17 14
7.0%
16 14
7.0%
15 10
 
5.0%
14 8
 
4.0%

출입분
Real number (ℝ)

ZEROS 

Distinct58
Distinct (%)29.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28.54
Minimum0
Maximum59
Zeros4
Zeros (%)2.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:55:36.634159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q117
median26.5
Q342
95-th percentile57
Maximum59
Range59
Interquartile range (IQR)25

Descriptive statistics

Standard deviation16.478962
Coefficient of variation (CV)0.5773988
Kurtosis-0.96998608
Mean28.54
Median Absolute Deviation (MAD)13
Skewness0.16910854
Sum5708
Variance271.55618
MonotonicityNot monotonic
2023-12-10T15:55:36.761595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
28 8
 
4.0%
22 8
 
4.0%
21 8
 
4.0%
17 7
 
3.5%
42 7
 
3.5%
19 6
 
3.0%
3 6
 
3.0%
33 6
 
3.0%
57 5
 
2.5%
43 5
 
2.5%
Other values (48) 134
67.0%
ValueCountFrequency (%)
0 4
2.0%
1 2
 
1.0%
2 2
 
1.0%
3 6
3.0%
4 1
 
0.5%
5 3
1.5%
6 2
 
1.0%
7 2
 
1.0%
8 4
2.0%
9 4
2.0%
ValueCountFrequency (%)
59 4
2.0%
58 2
 
1.0%
57 5
2.5%
56 4
2.0%
55 3
1.5%
54 4
2.0%
52 1
 
0.5%
51 2
 
1.0%
50 3
1.5%
49 2
 
1.0%

Interactions

2023-12-10T15:55:34.966659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:55:34.532386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:55:34.748247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:55:35.044292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:55:34.598147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:55:34.820664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:55:35.117001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:55:34.673132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:55:34.891237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:55:36.872434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출입일련번호출입구분코드출입일자출입시간출입분
출입일련번호1.0000.0000.9140.8720.053
출입구분코드0.0001.0000.0000.0000.000
출입일자0.9140.0001.0000.7980.191
출입시간0.8720.0000.7981.0000.142
출입분0.0530.0000.1910.1421.000
2023-12-10T15:55:36.975711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출입일자출입구분코드
출입일자1.0000.000
출입구분코드0.0001.000
2023-12-10T15:55:37.063052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출입일련번호출입시간출입분출입구분코드출입일자
출입일련번호1.0000.1620.0010.0000.796
출입시간0.1621.000-0.1420.0000.459
출입분0.001-0.1421.0000.0000.112
출입구분코드0.0000.0000.0001.0000.000
출입일자0.7960.4590.1120.0001.000

Missing values

2023-12-10T15:55:35.232931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:55:35.342922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

출입일련번호시설물ID출입구분코드출입일자출입시간출입분
0166SDTRPIS001000020201611181648
1179SDTRPIS001000021201611181719
2180SDTRPIS001000020201611181720
3189SDTRPIS001000020201611181759
4192SDTRPIS00100002120161118182
5196SDTRPIS001000020201611181815
6197SDTRPIS001000021201611181815
7202SDTRPIS001000020201611181821
8226SDTRPIS001000020201611181825
9231SDTRPIS001000020201611181833
출입일련번호시설물ID출입구분코드출입일자출입시간출입분
190485SDTRPIS001000020201611191518
191504SDTRPIS001000021201611191640
192532SDTRPIS001000021201611191924
193536SDTRPIS001000021201611191928
194537SDTRPIS001000021201611191928
195541SDTRPIS001000020201611191936
196555SDTRPIS001000020201611192013
197559SDTRPIS001000021201611192019
198560SDTRPIS001000020201611192022
199562SDTRPIS001000021201611192025