Overview

Dataset statistics

Number of variables4
Number of observations1409
Missing cells157
Missing cells (%)2.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory49.7 KiB
Average record size in memory36.1 B

Variable types

Numeric3
Categorical1

Dataset

Description공통규격 DB 관리내용 중 일정한 번호를 내겨 나타냄 공통규격 DB 관리내용 중 일련번호별 별점 공통규격 DB 관리내용 중 공통규격별 일련번호 공통규격 DB 관리내용 중 사용자별 아이디를 표시함
URLhttps://www.data.go.kr/data/15069423/fileData.do

Alerts

일련번호 is highly overall correlated with 공통규격일련번호High correlation
공통규격일련번호 is highly overall correlated with 일련번호High correlation
공통규격일련번호 has 157 (11.1%) missing valuesMissing
일련번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:38:54.645588
Analysis finished2023-12-12 13:38:55.904479
Duration1.26 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1409
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1244.2569
Minimum104
Maximum3020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.5 KiB
2023-12-12T22:38:55.977486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum104
5-th percentile316.4
Q1848
median1213
Q31584
95-th percentile2537.6
Maximum3020
Range2916
Interquartile range (IQR)736

Descriptive statistics

Standard deviation593.61463
Coefficient of variation (CV)0.47708365
Kurtosis0.54894284
Mean1244.2569
Median Absolute Deviation (MAD)368
Skewness0.61662029
Sum1753158
Variance352378.33
MonotonicityNot monotonic
2023-12-12T22:38:56.129636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
326 1
 
0.1%
2135 1
 
0.1%
1396 1
 
0.1%
1386 1
 
0.1%
1384 1
 
0.1%
1381 1
 
0.1%
1380 1
 
0.1%
1377 1
 
0.1%
1373 1
 
0.1%
1370 1
 
0.1%
Other values (1399) 1399
99.3%
ValueCountFrequency (%)
104 1
0.1%
105 1
0.1%
106 1
0.1%
123 1
0.1%
124 1
0.1%
143 1
0.1%
163 1
0.1%
164 1
0.1%
165 1
0.1%
166 1
0.1%
ValueCountFrequency (%)
3020 1
0.1%
3019 1
0.1%
3002 1
0.1%
3001 1
0.1%
3000 1
0.1%
2999 1
0.1%
2980 1
0.1%
2979 1
0.1%
2959 1
0.1%
2939 1
0.1%

별점
Categorical

Distinct5
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size11.1 KiB
5
842 
4
339 
3
167 
2
 
35
1
 
26

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4
2nd row5
3rd row3
4th row4
5th row4

Common Values

ValueCountFrequency (%)
5 842
59.8%
4 339
24.1%
3 167
 
11.9%
2 35
 
2.5%
1 26
 
1.8%

Length

2023-12-12T22:38:56.273244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:38:56.465635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5 842
59.8%
4 339
24.1%
3 167
 
11.9%
2 35
 
2.5%
1 26
 
1.8%

공통규격일련번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct242
Distinct (%)19.3%
Missing157
Missing (%)11.1%
Infinite0
Infinite (%)0.0%
Mean522.85623
Minimum81
Maximum3711
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.5 KiB
2023-12-12T22:38:56.607470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum81
5-th percentile259
Q1386
median513
Q3682
95-th percentile820
Maximum3711
Range3630
Interquartile range (IQR)296

Descriptive statistics

Standard deviation227.87997
Coefficient of variation (CV)0.43583678
Kurtosis58.794451
Mean522.85623
Median Absolute Deviation (MAD)157
Skewness4.1077168
Sum654616
Variance51929.282
MonotonicityNot monotonic
2023-12-12T22:38:56.780243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
689 40
 
2.8%
270 33
 
2.3%
422 29
 
2.1%
271 28
 
2.0%
272 27
 
1.9%
622 24
 
1.7%
492 24
 
1.7%
576 24
 
1.7%
501 24
 
1.7%
723 22
 
1.6%
Other values (232) 977
69.3%
(Missing) 157
 
11.1%
ValueCountFrequency (%)
81 13
0.9%
82 12
0.9%
83 7
0.5%
86 12
0.9%
87 3
 
0.2%
88 1
 
0.1%
91 1
 
0.1%
176 2
 
0.1%
259 15
1.1%
260 12
0.9%
ValueCountFrequency (%)
3711 1
 
0.1%
3688 1
 
0.1%
879 3
 
0.2%
877 1
 
0.1%
875 4
 
0.3%
870 14
1.0%
869 4
 
0.3%
868 1
 
0.1%
827 4
 
0.3%
826 1
 
0.1%

사용자아이디
Real number (ℝ)

Distinct331
Distinct (%)23.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3881.1462
Minimum2
Maximum22147
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.5 KiB
2023-12-12T22:38:56.973984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile54
Q1578
median1917
Q35085
95-th percentile14148
Maximum22147
Range22145
Interquartile range (IQR)4507

Descriptive statistics

Standard deviation4662.559
Coefficient of variation (CV)1.2013356
Kurtosis2.1134446
Mean3881.1462
Median Absolute Deviation (MAD)1629
Skewness1.634688
Sum5468535
Variance21739456
MonotonicityNot monotonic
2023-12-12T22:38:57.109241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
73 73
 
5.2%
12107 53
 
3.8%
790 48
 
3.4%
2 46
 
3.3%
209 30
 
2.1%
8327 30
 
2.1%
8998 29
 
2.1%
1657 23
 
1.6%
3474 23
 
1.6%
1089 22
 
1.6%
Other values (321) 1032
73.2%
ValueCountFrequency (%)
2 46
3.3%
23 8
 
0.6%
38 10
 
0.7%
52 5
 
0.4%
54 7
 
0.5%
60 2
 
0.1%
61 3
 
0.2%
73 73
5.2%
83 3
 
0.2%
90 3
 
0.2%
ValueCountFrequency (%)
22147 2
 
0.1%
20731 13
0.9%
18962 10
0.7%
18948 1
 
0.1%
18881 1
 
0.1%
17527 3
 
0.2%
17466 5
 
0.4%
16268 11
0.8%
16116 2
 
0.1%
15963 1
 
0.1%

Interactions

2023-12-12T22:38:55.444032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:38:54.811052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:38:55.114964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:38:55.547724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:38:54.935572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:38:55.239909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:38:55.647591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:38:55.030064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:38:55.353485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:38:57.202514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호별점공통규격일련번호사용자아이디
일련번호1.0000.1830.5720.439
별점0.1831.0000.0000.233
공통규격일련번호0.5720.0001.0000.050
사용자아이디0.4390.2330.0501.000
2023-12-12T22:38:57.316533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호공통규격일련번호사용자아이디별점
일련번호1.0000.5620.0830.077
공통규격일련번호0.5621.0000.0220.000
사용자아이디0.0830.0221.0000.099
별점0.0770.0000.0991.000

Missing values

2023-12-12T22:38:55.765639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:38:55.859479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일련번호별점공통규격일련번호사용자아이디
032643104712
124952602
23053<NA>504
33294270145
43304<NA>145
53583813107
635932594159
73602<NA>4159
836753081917
93765<NA>795
일련번호별점공통규격일련번호사용자아이디
139929025493790
1400290336211403
140129395690702
140229795495540
140329805821540
1404299958701637
1405300048701637
1406300126212702
1407300226212702
1408302045761657