Overview

Dataset statistics

Number of variables2
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1380
Duplicate rows (%)13.8%
Total size in memory244.1 KiB
Average record size in memory25.0 B

Variable types

Numeric1
Categorical1

Dataset

Description한국기술교육대학교 온라인평생교육원 스마트 직업훈련 플랫폼 (STEP)에 대한 발송메시지 발송 결과 관련 내용을 제공합니다.
Author한국기술교육대학교
URLhttps://www.data.go.kr/data/15090985/fileData.do

Alerts

결과 코드 has constant value ""Constant
Dataset has 1380 (13.8%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 19:13:45.256811
Analysis finished2023-12-12 19:13:45.526921
Duration0.27 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

메시지 아이디
Real number (ℝ)

Distinct3877
Distinct (%)38.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean244551.29
Minimum200647
Maximum467746
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T04:13:45.590101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum200647
5-th percentile203010.95
Q1208657
median217066
Q3224977.75
95-th percentile462501
Maximum467746
Range267099
Interquartile range (IQR)16320.75

Descriptive statistics

Standard deviation70240.744
Coefficient of variation (CV)0.28722296
Kurtosis3.7681446
Mean244551.29
Median Absolute Deviation (MAD)8363
Skewness2.2339508
Sum2.4455129 × 109
Variance4.9337621 × 109
MonotonicityNot monotonic
2023-12-13T04:13:45.708245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
222845 117
 
1.2%
305648 110
 
1.1%
306152 105
 
1.1%
269199 100
 
1.0%
275960 99
 
1.0%
227665 96
 
1.0%
217132 94
 
0.9%
227673 92
 
0.9%
267649 91
 
0.9%
466609 87
 
0.9%
Other values (3867) 9009
90.1%
ValueCountFrequency (%)
200647 1
 
< 0.1%
200663 1
 
< 0.1%
200672 1
 
< 0.1%
200700 1
 
< 0.1%
200712 1
 
< 0.1%
200717 1
 
< 0.1%
200724 1
 
< 0.1%
200726 5
0.1%
200728 5
0.1%
200730 1
 
< 0.1%
ValueCountFrequency (%)
467746 80
0.8%
467380 74
0.7%
466609 87
0.9%
466305 67
0.7%
464693 76
0.8%
464329 62
0.6%
462501 74
0.7%
462493 73
0.7%
450418 2
 
< 0.1%
412689 15
 
0.1%

결과 코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
실패
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row실패
2nd row실패
3rd row실패
4th row실패
5th row실패

Common Values

ValueCountFrequency (%)
실패 10000
100.0%

Length

2023-12-13T04:13:45.809916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:13:46.176034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
실패 10000
100.0%

Interactions

2023-12-13T04:13:45.325850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T04:13:45.435281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:13:45.499666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

메시지 아이디결과 코드
35647206404실패
80336221530실패
69855217221실패
3921275960실패
36038206569실패
52075211559실패
6714286153실패
29616204285실패
35755206504실패
16403464329실패
메시지 아이디결과 코드
18756466609실패
6573286153실패
74724217937실패
32350205030실패
20581200796실패
75062217940실패
5928285923실패
27088203723실패
9164306152실패
93198227665실패

Duplicate rows

Most frequently occurring

메시지 아이디결과 코드# duplicates
1127222845실패117
1302305648실패110
1303306152실패105
1279269199실패100
1280275960실패99
1248227665실패96
951217132실패94
1251227673실패92
1278267649실패91
1377466609실패87