Overview

Dataset statistics

Number of variables5
Number of observations5000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory219.9 KiB
Average record size in memory45.0 B

Variable types

Numeric5

Dataset

Description한국지능정보사회진흥원(NIA)에서 제공하는 스마트워크 도입을 위한 자가진단 사용자답변에 관한 정보(사용자답변일련번호, 사용자정보일련번호, 설문지일련번호 등)입니다.
Author한국지능정보사회진흥원
URLhttps://www.data.go.kr/data/15064788/fileData.do

Alerts

사용자답변일련번호 is highly overall correlated with 사용자정보일련번호 and 3 other fieldsHigh correlation
사용자정보일련번호 is highly overall correlated with 사용자답변일련번호 and 3 other fieldsHigh correlation
설문지일련번호 is highly overall correlated with 사용자답변일련번호 and 3 other fieldsHigh correlation
질문일련번호 is highly overall correlated with 사용자답변일련번호 and 3 other fieldsHigh correlation
답변일련번호 is highly overall correlated with 사용자답변일련번호 and 3 other fieldsHigh correlation
사용자답변일련번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 02:28:16.906278
Analysis finished2023-12-12 02:28:20.974306
Duration4.07 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사용자답변일련번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct5000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7785.2614
Minimum1
Maximum11725
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size44.1 KiB
2023-12-12T11:28:21.049283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile250.95
Q16876.75
median8126.5
Q310376.25
95-th percentile11475.05
Maximum11725
Range11724
Interquartile range (IQR)3499.5

Descriptive statistics

Standard deviation3361.2212
Coefficient of variation (CV)0.4317416
Kurtosis0.56919695
Mean7785.2614
Median Absolute Deviation (MAD)1750
Skewness-1.194842
Sum38926307
Variance11297808
MonotonicityNot monotonic
2023-12-12T11:28:21.232818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8168 1
 
< 0.1%
677 1
 
< 0.1%
9965 1
 
< 0.1%
9964 1
 
< 0.1%
9963 1
 
< 0.1%
9962 1
 
< 0.1%
9961 1
 
< 0.1%
679 1
 
< 0.1%
678 1
 
< 0.1%
676 1
 
< 0.1%
Other values (4990) 4990
99.8%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
11725 1
< 0.1%
11724 1
< 0.1%
11723 1
< 0.1%
11722 1
< 0.1%
11721 1
< 0.1%
11720 1
< 0.1%
11719 1
< 0.1%
11718 1
< 0.1%
11717 1
< 0.1%
11716 1
< 0.1%

사용자정보일련번호
Real number (ℝ)

HIGH CORRELATION 

Distinct101
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean183.5668
Minimum1
Maximum283
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size44.1 KiB
2023-12-12T11:28:21.403653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6
Q1164
median192
Q3240
95-th percentile277
Maximum283
Range282
Interquartile range (IQR)76

Descriptive statistics

Standard deviation79.141856
Coefficient of variation (CV)0.43113382
Kurtosis0.64572736
Mean183.5668
Median Absolute Deviation (MAD)38
Skewness-1.2012544
Sum917834
Variance6263.4334
MonotonicityNot monotonic
2023-12-12T11:28:21.557898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
242 108
 
2.2%
280 70
 
1.4%
230 70
 
1.4%
228 69
 
1.4%
229 67
 
1.3%
231 66
 
1.3%
271 63
 
1.3%
9 63
 
1.3%
279 63
 
1.3%
270 62
 
1.2%
Other values (91) 4299
86.0%
ValueCountFrequency (%)
1 48
1.0%
2 47
0.9%
3 46
0.9%
4 47
0.9%
5 49
1.0%
6 46
0.9%
7 46
0.9%
8 49
1.0%
9 63
1.3%
10 50
1.0%
ValueCountFrequency (%)
283 39
0.8%
281 46
0.9%
280 70
1.4%
279 63
1.3%
277 53
1.1%
275 55
1.1%
273 50
1.0%
271 63
1.3%
270 62
1.2%
268 48
1.0%

설문지일련번호
Real number (ℝ)

HIGH CORRELATION 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.4534
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size44.1 KiB
2023-12-12T11:28:21.694686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median2
Q38
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.1578193
Coefficient of variation (CV)0.70908055
Kurtosis-1.5557837
Mean4.4534
Median Absolute Deviation (MAD)1
Skewness0.51421248
Sum22267
Variance9.9718228
MonotonicityNot monotonic
2023-12-12T11:28:21.829525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
2 2156
43.1%
9 1147
22.9%
8 519
 
10.4%
1 442
 
8.8%
3 391
 
7.8%
5 275
 
5.5%
7 70
 
1.4%
ValueCountFrequency (%)
1 442
 
8.8%
2 2156
43.1%
3 391
 
7.8%
5 275
 
5.5%
7 70
 
1.4%
8 519
 
10.4%
9 1147
22.9%
ValueCountFrequency (%)
9 1147
22.9%
8 519
 
10.4%
7 70
 
1.4%
5 275
 
5.5%
3 391
 
7.8%
2 2156
43.1%
1 442
 
8.8%

질문일련번호
Real number (ℝ)

HIGH CORRELATION 

Distinct276
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean187.6268
Minimum1
Maximum368
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size44.1 KiB
2023-12-12T11:28:21.975284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile26
Q1110
median136
Q3314
95-th percentile359
Maximum368
Range367
Interquartile range (IQR)204

Descriptive statistics

Standard deviation109.78416
Coefficient of variation (CV)0.58511982
Kurtosis-1.270298
Mean187.6268
Median Absolute Deviation (MAD)39
Skewness0.38859265
Sum938134
Variance12052.562
MonotonicityNot monotonic
2023-12-12T11:28:22.155248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
133 54
 
1.1%
124 51
 
1.0%
102 51
 
1.0%
99 50
 
1.0%
132 50
 
1.0%
134 49
 
1.0%
138 49
 
1.0%
120 49
 
1.0%
131 48
 
1.0%
128 48
 
1.0%
Other values (266) 4501
90.0%
ValueCountFrequency (%)
1 9
0.2%
2 9
0.2%
3 9
0.2%
4 9
0.2%
5 9
0.2%
6 9
0.2%
7 13
0.3%
8 9
0.2%
9 9
0.2%
10 15
0.3%
ValueCountFrequency (%)
368 28
0.6%
367 20
0.4%
366 20
0.4%
365 20
0.4%
364 35
0.7%
363 29
0.6%
362 40
0.8%
361 24
0.5%
360 21
0.4%
359 21
0.4%

답변일련번호
Real number (ℝ)

HIGH CORRELATION 

Distinct1253
Distinct (%)25.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean879.2728
Minimum1
Maximum1982
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size44.1 KiB
2023-12-12T11:28:22.337822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile135
Q1335
median481.5
Q31687
95-th percentile1929
Maximum1982
Range1981
Interquartile range (IQR)1352

Descriptive statistics

Standard deviation674.21151
Coefficient of variation (CV)0.76678309
Kurtosis-1.4654983
Mean879.2728
Median Absolute Deviation (MAD)232.5
Skewness0.53037889
Sum4396364
Variance454561.17
MonotonicityNot monotonic
2023-12-12T11:28:22.537745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
295 42
 
0.8%
431 42
 
0.8%
315 41
 
0.8%
310 41
 
0.8%
464 41
 
0.8%
380 41
 
0.8%
413 41
 
0.8%
305 41
 
0.8%
452 41
 
0.8%
442 40
 
0.8%
Other values (1243) 4589
91.8%
ValueCountFrequency (%)
1 1
 
< 0.1%
3 4
0.1%
4 3
0.1%
5 1
 
< 0.1%
6 1
 
< 0.1%
7 3
0.1%
8 3
0.1%
9 1
 
< 0.1%
10 1
 
< 0.1%
12 1
 
< 0.1%
ValueCountFrequency (%)
1982 7
0.1%
1980 5
0.1%
1979 3
 
0.1%
1978 7
0.1%
1977 6
0.1%
1976 2
 
< 0.1%
1975 2
 
< 0.1%
1973 4
 
0.1%
1972 12
0.2%
1971 7
0.1%

Interactions

2023-12-12T11:28:20.212837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:17.473434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:18.499606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:19.076298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:19.667989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:20.310917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:17.614665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:18.616500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:19.199213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:19.789051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:20.432795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:17.743555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:18.710688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:19.323333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:19.897696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:20.558794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:17.888130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:18.822063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:19.426423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:20.014950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:20.664923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:18.016006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:18.963282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:19.535646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:28:20.110531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:28:22.651878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용자답변일련번호사용자정보일련번호설문지일련번호질문일련번호답변일련번호
사용자답변일련번호1.0000.9880.8760.8750.851
사용자정보일련번호0.9881.0000.8610.8750.839
설문지일련번호0.8760.8611.0000.9200.941
질문일련번호0.8750.8750.9201.0000.963
답변일련번호0.8510.8390.9410.9631.000
2023-12-12T11:28:22.778738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용자답변일련번호사용자정보일련번호설문지일련번호질문일련번호답변일련번호
사용자답변일련번호1.0001.0000.8590.8110.820
사용자정보일련번호1.0001.0000.8590.8080.817
설문지일련번호0.8590.8591.0000.9500.951
질문일련번호0.8110.8080.9501.0000.999
답변일련번호0.8200.8170.9510.9991.000

Missing values

2023-12-12T11:28:20.816072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:28:20.931977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사용자답변일련번호사용자정보일련번호설문지일련번호질문일련번호답변일련번호
081681922134471
181691922134472
281701922135476
381711922136480
481721922137485
581731922138490
68174193293250
78175193294253
88176193295261
98177193296264
사용자답변일련번호사용자정보일련번호설문지일련번호질문일련번호답변일련번호
49901171628393521890
49911171728393531899
49921171828393541900
49931171928393551912
49941172028393561913
49951172128393571918
49961172228393581927
49971172328393591929
49981172428393601938
49991172528393611939