Overview

Dataset statistics

Number of variables3
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory341.8 KiB
Average record size in memory35.0 B

Variable types

Numeric3

Dataset

Description내일채움공제 가입기업의 매출액별 공제가입자수 현황에 대한 데이터로 가입기업의 당해년도 매출액과 누적 공제가입자 수를 제공합니다.
URLhttps://www.data.go.kr/data/15069014/fileData.do

Alerts

일련번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:04:23.555252
Analysis finished2023-12-12 19:04:25.411267
Duration1.86 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5002.6339
Minimum1
Maximum10006
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T04:04:25.529436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile500.95
Q12500.75
median5003.5
Q37503.25
95-th percentile9506.05
Maximum10006
Range10005
Interquartile range (IQR)5002.5

Descriptive statistics

Standard deviation2888.785
Coefficient of variation (CV)0.57745281
Kurtosis-1.2000571
Mean5002.6339
Median Absolute Deviation (MAD)2501.5
Skewness0.00031750813
Sum50026339
Variance8345078.8
MonotonicityNot monotonic
2023-12-13T04:04:25.745016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3652 1
 
< 0.1%
3895 1
 
< 0.1%
2080 1
 
< 0.1%
5089 1
 
< 0.1%
9782 1
 
< 0.1%
8821 1
 
< 0.1%
1229 1
 
< 0.1%
5027 1
 
< 0.1%
3502 1
 
< 0.1%
4046 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
10006 1
< 0.1%
10005 1
< 0.1%
10004 1
< 0.1%
10003 1
< 0.1%
10002 1
< 0.1%
10001 1
< 0.1%
10000 1
< 0.1%
9999 1
< 0.1%
9998 1
< 0.1%
9997 1
< 0.1%

매출액
Real number (ℝ)

Distinct9994
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.0997041 × 1010
Minimum9484000
Maximum3.5387 × 1011
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T04:04:25.953016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9484000
5-th percentile6.4495156 × 108
Q12.2618667 × 109
median5.3857501 × 109
Q31.2267726 × 1010
95-th percentile4.0501652 × 1010
Maximum3.5387 × 1011
Range3.5386052 × 1011
Interquartile range (IQR)1.0005859 × 1010

Descriptive statistics

Standard deviation1.6997875 × 1010
Coefficient of variation (CV)1.5456771
Kurtosis42.962577
Mean1.0997041 × 1010
Median Absolute Deviation (MAD)3.8230703 × 109
Skewness4.8238057
Sum1.0997041 × 1014
Variance2.8892774 × 1020
MonotonicityNot monotonic
2023-12-13T04:04:26.193441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
25000000 5
 
0.1%
48000000 2
 
< 0.1%
45000000 2
 
< 0.1%
28820211816 1
 
< 0.1%
5536015921 1
 
< 0.1%
500372731 1
 
< 0.1%
2791252326 1
 
< 0.1%
2314436909 1
 
< 0.1%
5337580606 1
 
< 0.1%
4721770929 1
 
< 0.1%
Other values (9984) 9984
99.8%
ValueCountFrequency (%)
9484000 1
 
< 0.1%
18102915 1
 
< 0.1%
23210000 1
 
< 0.1%
23607000 1
 
< 0.1%
25000000 5
0.1%
27003696 1
 
< 0.1%
28813809 1
 
< 0.1%
31188881 1
 
< 0.1%
31425000 1
 
< 0.1%
32834000 1
 
< 0.1%
ValueCountFrequency (%)
353870000000 1
< 0.1%
292847000000 1
< 0.1%
200898000000 1
< 0.1%
185269000000 1
< 0.1%
176401000000 1
< 0.1%
171274000000 1
< 0.1%
169718000000 1
< 0.1%
161476000000 1
< 0.1%
160551000000 1
< 0.1%
154261000000 1
< 0.1%

가입자수
Real number (ℝ)

Distinct91
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.1051
Minimum1
Maximum447
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T04:04:26.405333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q35
95-th percentile18
Maximum447
Range446
Interquartile range (IQR)4

Descriptive statistics

Standard deviation10.45539
Coefficient of variation (CV)2.0480285
Kurtosis427.98265
Mean5.1051
Median Absolute Deviation (MAD)1
Skewness14.524478
Sum51051
Variance109.31519
MonotonicityNot monotonic
2023-12-13T04:04:26.618456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 3319
33.2%
2 1948
19.5%
3 1091
 
10.9%
4 741
 
7.4%
5 592
 
5.9%
6 398
 
4.0%
7 310
 
3.1%
8 238
 
2.4%
9 163
 
1.6%
10 153
 
1.5%
Other values (81) 1047
 
10.5%
ValueCountFrequency (%)
1 3319
33.2%
2 1948
19.5%
3 1091
 
10.9%
4 741
 
7.4%
5 592
 
5.9%
6 398
 
4.0%
7 310
 
3.1%
8 238
 
2.4%
9 163
 
1.6%
10 153
 
1.5%
ValueCountFrequency (%)
447 1
< 0.1%
277 1
< 0.1%
240 1
< 0.1%
179 1
< 0.1%
167 1
< 0.1%
160 1
< 0.1%
134 1
< 0.1%
129 1
< 0.1%
122 1
< 0.1%
117 1
< 0.1%

Interactions

2023-12-13T04:04:24.811557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:04:23.949946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:04:24.345935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:04:24.951767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:04:24.070412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:04:24.493525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:04:25.110285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:04:24.221051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:04:24.647809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:04:26.755745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호매출액가입자수
일련번호1.0000.1230.033
매출액0.1231.0000.442
가입자수0.0330.4421.000
2023-12-13T04:04:26.877443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호매출액가입자수
일련번호1.000-0.293-0.117
매출액-0.2931.0000.365
가입자수-0.1170.3651.000

Missing values

2023-12-13T04:04:25.276425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:04:25.363448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일련번호매출액가입자수
36513652288202118165
9111911214851215892
657165729161625741
5729573030697888225
61346135208135843302
683684590833460291
23962397212956347574
6645664621750643734
342343359920046112
366367933924900011
일련번호매출액가입자수
80788079111073281143
3820382165575303841
29930069442462881
939940648060771405
5741574269737959252
69856986120837720136
3300330121105283531
49754976216067404699
5011501249377825281
9338933914771329161