Overview

Dataset statistics

Number of variables2
Number of observations6492
Missing cells0
Missing cells (%)0.0%
Duplicate rows7
Duplicate rows (%)0.1%
Total size in memory114.2 KiB
Average record size in memory18.0 B

Variable types

Numeric2

Dataset

Description결과 관리 번호,계획 관리 번호
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-21191/S/1/datasetView.do

Alerts

Dataset has 7 (0.1%) duplicate rowsDuplicates
결과 관리 번호 is highly overall correlated with 계획 관리 번호High correlation
계획 관리 번호 is highly overall correlated with 결과 관리 번호High correlation
결과 관리 번호 has 4968 (76.5%) zerosZeros
계획 관리 번호 has 1524 (23.5%) zerosZeros

Reproduction

Analysis started2024-05-11 03:39:55.179164
Analysis finished2024-05-11 03:39:57.382807
Duration2.2 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

결과 관리 번호
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct1523
Distinct (%)23.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean179.50539
Minimum0
Maximum1531
Zeros4968
Zeros (%)76.5%
Negative0
Negative (%)0.0%
Memory size57.2 KiB
2024-05-11T03:39:57.638010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1204.45
Maximum1531
Range1531
Interquartile range (IQR)0

Descriptive statistics

Standard deviation388.70968
Coefficient of variation (CV)2.1654485
Kurtosis3.1008392
Mean179.50539
Median Absolute Deviation (MAD)0
Skewness2.1027536
Sum1165349
Variance151095.21
MonotonicityNot monotonic
2024-05-11T03:39:58.256148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 4968
76.5%
2 2
 
< 0.1%
3 2
 
< 0.1%
860 1
 
< 0.1%
985 1
 
< 0.1%
993 1
 
< 0.1%
992 1
 
< 0.1%
991 1
 
< 0.1%
990 1
 
< 0.1%
989 1
 
< 0.1%
Other values (1513) 1513
 
23.3%
ValueCountFrequency (%)
0 4968
76.5%
1 1
 
< 0.1%
2 2
 
< 0.1%
3 2
 
< 0.1%
4 1
 
< 0.1%
5 1
 
< 0.1%
6 1
 
< 0.1%
7 1
 
< 0.1%
8 1
 
< 0.1%
9 1
 
< 0.1%
ValueCountFrequency (%)
1531 1
< 0.1%
1530 1
< 0.1%
1529 1
< 0.1%
1528 1
< 0.1%
1527 1
< 0.1%
1526 1
< 0.1%
1525 1
< 0.1%
1524 1
< 0.1%
1523 1
< 0.1%
1522 1
< 0.1%

계획 관리 번호
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct4950
Distinct (%)76.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2123.1396
Minimum0
Maximum5383
Zeros1524
Zeros (%)23.5%
Negative0
Negative (%)0.0%
Memory size57.2 KiB
2024-05-11T03:39:58.880418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1103.75
median1977.5
Q33695.25
95-th percentile5048.45
Maximum5383
Range5383
Interquartile range (IQR)3591.5

Descriptive statistics

Standard deviation1779.5217
Coefficient of variation (CV)0.83815577
Kurtosis-1.3313045
Mean2123.1396
Median Absolute Deviation (MAD)1784.5
Skewness0.23793219
Sum13783422
Variance3166697.4
MonotonicityNot monotonic
2024-05-11T03:39:59.502421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1524
 
23.5%
296 11
 
0.2%
250 6
 
0.1%
284 3
 
< 0.1%
5379 2
 
< 0.1%
327 2
 
< 0.1%
1840 1
 
< 0.1%
1820 1
 
< 0.1%
1818 1
 
< 0.1%
1817 1
 
< 0.1%
Other values (4940) 4940
76.1%
ValueCountFrequency (%)
0 1524
23.5%
1 1
 
< 0.1%
2 1
 
< 0.1%
3 1
 
< 0.1%
4 1
 
< 0.1%
5 1
 
< 0.1%
6 1
 
< 0.1%
7 1
 
< 0.1%
8 1
 
< 0.1%
9 1
 
< 0.1%
ValueCountFrequency (%)
5383 1
< 0.1%
5382 1
< 0.1%
5381 1
< 0.1%
5380 1
< 0.1%
5379 2
< 0.1%
5378 1
< 0.1%
5377 1
< 0.1%
5376 1
< 0.1%
5375 1
< 0.1%
5374 1
< 0.1%

Interactions

2024-05-11T03:39:56.382804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T03:39:55.418244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T03:39:56.710860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T03:39:55.751134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T03:39:59.802596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
결과 관리 번호계획 관리 번호
결과 관리 번호1.0000.674
계획 관리 번호0.6741.000
2024-05-11T03:40:00.420470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
결과 관리 번호계획 관리 번호
결과 관리 번호1.000-0.730
계획 관리 번호-0.7301.000

Missing values

2024-05-11T03:39:57.074642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T03:39:57.290282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

결과 관리 번호계획 관리 번호
08600
18610
203962
303963
48620
503964
603965
703966
803967
903968
결과 관리 번호계획 관리 번호
648201359
648301360
648401361
648501362
648601363
648701364
648801365
648901366
649001368
649101369

Duplicate rows

Most frequently occurring

결과 관리 번호계획 관리 번호# duplicates
2029611
002506
102843
303272
4053792
5202
6302