Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 3870 |
Missing cells | 1248 |
Missing cells (%) | 4.6% |
Duplicate rows | 516 |
Duplicate rows (%) | 13.3% |
Total size in memory | 223.1 KiB |
Average record size in memory | 59.0 B |
Variable types
Numeric | 3 |
---|---|
Boolean | 1 |
DateTime | 1 |
Categorical | 1 |
Text | 1 |
Dataset
Description | 가축분뇨 전자인계관리시스템에서 관리하고 있는 정보 중에 가축분뇨 및 액비의 배출,운반, 처리 와 관리하고 있는 회원정보(사용자유형 등)으로 등록되어 관리되고 있는 정보 입니다. |
---|---|
Author | 한국환경공단 |
URL | https://www.data.go.kr/data/15041844/fileData.do |
사용여부 has constant value "" | Constant |
Dataset has 516 (13.3%) duplicate rows | Duplicates |
사용자유형 is highly overall correlated with 사용자구분 | High correlation |
사용자구분 is highly overall correlated with 사용자유형 | High correlation |
업체사용구분 has 416 (10.7%) missing values | Missing |
관할지사 has 416 (10.7%) missing values | Missing |
관할관청 has 416 (10.7%) missing values | Missing |
사용자유형 is highly skewed (γ1 = 31.30024297) | Skewed |
Reproduction
Analysis started | 2023-12-12 01:31:47.901427 |
---|---|
Analysis finished | 2023-12-12 01:31:50.173356 |
Duration | 2.27 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
사용자유형
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 9 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.3268734 |
Minimum | 1 |
---|---|
Maximum | 99 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 34.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 1 |
95-th percentile | 3 |
Maximum | 99 |
Range | 98 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 2.8514304 |
---|---|
Coefficient of variation (CV) | 2.1489845 |
Kurtosis | 1066.4046 |
Mean | 1.3268734 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 31.300243 |
Sum | 5135 |
Variance | 8.1306553 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 3453 | |
3 | 318 | 8.2% |
2 | 46 | 1.2% |
6 | 30 | 0.8% |
9 | 9 | 0.2% |
7 | 7 | 0.2% |
8 | 3 | 0.1% |
99 | 3 | 0.1% |
5 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1 | 3453 | |
2 | 46 | 1.2% |
3 | 318 | 8.2% |
5 | 1 | < 0.1% |
6 | 30 | 0.8% |
7 | 7 | 0.2% |
8 | 3 | 0.1% |
9 | 9 | 0.2% |
99 | 3 | 0.1% |
Value | Count | Frequency (%) |
99 | 3 | 0.1% |
9 | 9 | 0.2% |
8 | 3 | 0.1% |
7 | 7 | 0.2% |
6 | 30 | 0.8% |
5 | 1 | < 0.1% |
3 | 318 | 8.2% |
2 | 46 | 1.2% |
1 | 3453 |
사용여부
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.9 KiB |
True |
---|
Value | Count | Frequency (%) |
True | 3870 |
가입일자
Date
Distinct | 104 |
---|---|
Distinct (%) | 2.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 30.4 KiB |
Minimum | 2013-11-01 00:00:00 |
---|---|
Maximum | 2022-12-01 00:00:00 |
사용자구분
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 30.4 KiB |
A2 | |
---|---|
A1 | |
<NA> |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.2149871 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | A2 |
---|---|
2nd row | A2 |
3rd row | A2 |
4th row | A2 |
5th row | A2 |
Common Values
Value | Count | Frequency (%) |
A2 | 2646 | |
A1 | 808 | 20.9% |
<NA> | 416 | 10.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
a2 | 2646 | |
a1 | 808 | 20.9% |
na | 416 | 10.7% |
업체사용구분
Text
MISSING
 
Distinct | 68 |
---|---|
Distinct (%) | 2.0% |
Missing | 416 |
Missing (%) | 10.7% |
Memory size | 30.4 KiB |
Value | Count | Frequency (%) |
01 | 2011 | |
02 | 415 | 12.0% |
01,04 | 258 | 7.5% |
03 | 222 | 6.4% |
04 | 187 | 5.4% |
05 | 133 | 3.9% |
02,05 | 23 | 0.7% |
06 | 22 | 0.6% |
02,03 | 18 | 0.5% |
01,02 | 10 | 0.3% |
Other values (58) | 155 | 4.5% |
Most occurring characters
Value | Count | Frequency (%) |
0 | 3999 | |
1 | 2363 | |
, | 555 | 6.5% |
2 | 533 | 6.2% |
4 | 500 | 5.8% |
3 | 303 | 3.5% |
5 | 231 | 2.7% |
6 | 76 | 0.9% |
7 | 7 | 0.1% |
9 | 5 | 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 8018 | |
Other Punctuation | 555 | 6.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 3999 | |
1 | 2363 | |
2 | 533 | 6.6% |
4 | 500 | 6.2% |
3 | 303 | 3.8% |
5 | 231 | 2.9% |
6 | 76 | 0.9% |
7 | 7 | 0.1% |
9 | 5 | 0.1% |
8 | 1 | < 0.1% |
Other Punctuation
Value | Count | Frequency (%) |
, | 555 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 8573 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 3999 | |
1 | 2363 | |
, | 555 | 6.5% |
2 | 533 | 6.2% |
4 | 500 | 5.8% |
3 | 303 | 3.5% |
5 | 231 | 2.7% |
6 | 76 | 0.9% |
7 | 7 | 0.1% |
9 | 5 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 8573 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 3999 | |
1 | 2363 | |
, | 555 | 6.5% |
2 | 533 | 6.2% |
4 | 500 | 5.8% |
3 | 303 | 3.5% |
5 | 231 | 2.7% |
6 | 76 | 0.9% |
7 | 7 | 0.1% |
9 | 5 | 0.1% |
관할지사
Real number (ℝ)
MISSING
 
Distinct | 10 |
---|---|
Distinct (%) | 0.3% |
Missing | 416 |
Missing (%) | 10.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 906.0388 |
Minimum | 901 |
---|---|
Maximum | 910 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 34.1 KiB |
Quantile statistics
Minimum | 901 |
---|---|
5-th percentile | 901 |
Q1 | 904 |
median | 906 |
Q3 | 908 |
95-th percentile | 909 |
Maximum | 910 |
Range | 9 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.5106089 |
---|---|
Coefficient of variation (CV) | 0.0027709729 |
Kurtosis | -0.81491397 |
Mean | 906.0388 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.40221258 |
Sum | 3129458 |
Variance | 6.3031571 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
906 | 581 | |
908 | 525 | |
909 | 512 | |
907 | 406 | |
904 | 346 | |
905 | 303 | |
902 | 237 | |
901 | 198 | 5.1% |
903 | 187 | 4.8% |
910 | 159 | 4.1% |
(Missing) | 416 |
Value | Count | Frequency (%) |
901 | 198 | 5.1% |
902 | 237 | |
903 | 187 | 4.8% |
904 | 346 | |
905 | 303 | |
906 | 581 | |
907 | 406 | |
908 | 525 | |
909 | 512 | |
910 | 159 | 4.1% |
Value | Count | Frequency (%) |
910 | 159 | 4.1% |
909 | 512 | |
908 | 525 | |
907 | 406 | |
906 | 581 | |
905 | 303 | |
904 | 346 | |
903 | 187 | 4.8% |
902 | 237 | |
901 | 198 | 5.1% |
관할관청
Real number (ℝ)
MISSING
 
Distinct | 149 |
---|---|
Distinct (%) | 4.3% |
Missing | 416 |
Missing (%) | 10.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1167.5052 |
Minimum | 123 |
---|---|
Maximum | 1701 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 34.1 KiB |
Quantile statistics
Minimum | 123 |
---|---|
5-th percentile | 821 |
Q1 | 1009 |
median | 1203 |
Q3 | 1320 |
95-th percentile | 1515 |
Maximum | 1701 |
Range | 1578 |
Interquartile range (IQR) | 311 |
Descriptive statistics
Standard deviation | 240.99416 |
---|---|
Coefficient of variation (CV) | 0.20641806 |
Kurtosis | 0.20048352 |
Mean | 1167.5052 |
Median Absolute Deviation (MAD) | 193 |
Skewness | -0.28720983 |
Sum | 4032563 |
Variance | 58078.187 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
829 | 125 | 3.2% |
1213 | 118 | 3.0% |
1602 | 110 | 2.8% |
1203 | 102 | 2.6% |
1104 | 91 | 2.4% |
1015 | 82 | 2.1% |
912 | 76 | 2.0% |
1401 | 67 | 1.7% |
1409 | 65 | 1.7% |
1209 | 63 | 1.6% |
Other values (139) | 2555 | |
(Missing) | 416 | 10.7% |
Value | Count | Frequency (%) |
123 | 1 | < 0.1% |
203 | 3 | 0.1% |
209 | 1 | < 0.1% |
210 | 1 | < 0.1% |
313 | 1 | < 0.1% |
411 | 31 | |
413 | 1 | < 0.1% |
417 | 4 | 0.1% |
511 | 4 | 0.1% |
603 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1701 | 5 | 0.1% |
1602 | 110 | |
1601 | 49 | |
1515 | 56 | |
1514 | 4 | 0.1% |
1513 | 12 | 0.3% |
1512 | 56 | |
1511 | 48 | |
1510 | 21 | 0.5% |
1509 | 5 | 0.1% |
사용자유형 | 사용자구분 | 업체사용구분 | 관할지사 | 관할관청 | |
---|---|---|---|---|---|
사용자유형 | 1.000 | NaN | NaN | NaN | NaN |
사용자구분 | NaN | 1.000 | 0.263 | 0.217 | 0.173 |
업체사용구분 | NaN | 0.263 | 1.000 | 0.463 | 0.360 |
관할지사 | NaN | 0.217 | 0.463 | 1.000 | 0.932 |
관할관청 | NaN | 0.173 | 0.360 | 0.932 | 1.000 |
사용자유형 | 관할지사 | 관할관청 | 사용자구분 | |
---|---|---|---|---|
사용자유형 | 1.000 | -0.010 | 0.026 | 1.000 |
관할지사 | -0.010 | 1.000 | 0.162 | 0.210 |
관할관청 | 0.026 | 0.162 | 1.000 | 0.132 |
사용자구분 | 1.000 | 0.210 | 0.132 | 1.000 |
사용자유형 | 사용여부 | 가입일자 | 사용자구분 | 업체사용구분 | 관할지사 | 관할관청 | |
---|---|---|---|---|---|---|---|
0 | 1 | Y | 2013-12 | A2 | 03 | 910 | 1601 |
1 | 1 | Y | 2013-12 | A2 | 03 | 910 | 1602 |
2 | 1 | Y | 2013-12 | A2 | 02,03,05,06 | 910 | 1601 |
3 | 1 | Y | 2013-12 | A2 | 01,04 | 910 | 1602 |
4 | 1 | Y | 2013-12 | A2 | 05,04,06 | 910 | 1602 |
5 | 1 | Y | 2013-12 | A2 | 05,06 | 910 | 1602 |
6 | 1 | Y | 2013-12 | A2 | 02,03,06 | 910 | 1602 |
7 | 1 | Y | 2013-12 | A2 | 05,06 | 910 | 1602 |
8 | 1 | Y | 2013-12 | A2 | 06 | 910 | 1601 |
9 | 1 | Y | 2013-12 | A2 | 05 | 910 | 1602 |
사용자유형 | 사용여부 | 가입일자 | 사용자구분 | 업체사용구분 | 관할지사 | 관할관청 | |
---|---|---|---|---|---|---|---|
3860 | 1 | Y | 2022-10 | A1 | 03 | 906 | 1213 |
3861 | 1 | Y | 2022-08 | A2 | 01 | 908 | 1019 |
3862 | 1 | Y | 2022-09 | A2 | 01 | 908 | 1011 |
3863 | 1 | Y | 2022-09 | A2 | 01 | 905 | 1509 |
3864 | 1 | Y | 2022-10 | A2 | 03 | 906 | 1213 |
3865 | 3 | Y | 2022-11 | <NA> | <NA> | <NA> | <NA> |
3866 | 1 | Y | 2022-12 | A2 | 03 | 907 | 1321 |
3867 | 1 | Y | 2022-12 | A1 | 01 | 908 | 1019 |
3868 | 6 | Y | 2022-10 | <NA> | <NA> | <NA> | <NA> |
3869 | 3 | Y | 2022-11 | <NA> | <NA> | <NA> | <NA> |
Most frequently occurring
사용자유형 | 사용여부 | 가입일자 | 사용자구분 | 업체사용구분 | 관할지사 | 관할관청 | # duplicates | |
---|---|---|---|---|---|---|---|---|
247 | 1 | Y | 2017-01 | A2 | 01 | 909 | 1104 | 37 |
273 | 1 | Y | 2017-02 | A2 | 01 | 904 | 1401 | 35 |
464 | 3 | Y | 2016-12 | <NA> | <NA> | <NA> | <NA> | 33 |
138 | 1 | Y | 2016-12 | A2 | 01 | 902 | 829 | 31 |
391 | 1 | Y | 2018-12 | A1 | 01 | 906 | 1209 | 28 |
147 | 1 | Y | 2016-12 | A2 | 01 | 905 | 1512 | 27 |
455 | 3 | Y | 2015-12 | <NA> | <NA> | <NA> | <NA> | 27 |
135 | 1 | Y | 2016-12 | A2 | 01 | 901 | 829 | 25 |
142 | 1 | Y | 2016-12 | A2 | 01 | 904 | 1409 | 25 |
123 | 1 | Y | 2016-12 | A1 | 01 | 908 | 1015 | 24 |