Dataset statistics
Number of variables | 14 |
---|---|
Number of observations | 33 |
Missing cells | 77 |
Missing cells (%) | 16.7% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 3.8 KiB |
Average record size in memory | 116.9 B |
Variable types
Unsupported | 11 |
---|---|
Categorical | 3 |
Dataset
Description | 컨테이너,입국장,우편특송물류센에서국제식물검역인증원의검역을통해발견된외래병해충검출정보로월별지역정보,관리,잠정,비검역,건수,마리수등을제공한다 |
---|---|
Author | 국제식물검역인증원 |
URL | https://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220714000000002160 |
Unnamed: 2 is highly correlated with Unnamed: 3 and 1 other fields | High correlation |
Unnamed: 3 is highly correlated with Unnamed: 2 and 1 other fields | High correlation |
ㅇ 컨테이너 및 적재장소 점검 관련 병해충 발견 실적 is highly correlated with Unnamed: 2 and 1 other fields | High correlation |
Unnamed: 0 has 33 (100.0%) missing values | Missing |
ㅇ 컨테이너 및 적재장소 점검 관련 병해충 발견 실적 has 26 (78.8%) missing values | Missing |
Unnamed: 2 has 15 (45.5%) missing values | Missing |
Unnamed: 3 has 3 (9.1%) missing values | Missing |
Unnamed: 0 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2022-08-12 14:52:41.729094 |
---|---|
Analysis finished | 2022-08-12 14:52:42.958246 |
Duration | 1.23 second |
Software version | pandas-profiling v3.2.0 |
Download configuration | config.json |
Distinct | 7 |
---|---|
Distinct (%) | 100.0% |
Missing | 26 |
Missing (%) | 78.8% |
Memory size | 392.0 B |
지역 | |
---|---|
인천공항 관리팀 | |
영남 지역관리팀 | |
중부 지역관리팀 | |
호남 지역관리팀 | |
Other values (2) |
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 5.571428571 |
Min length | 2 |
Unique
Unique | 7 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 지역 |
---|---|
2nd row | 인천공항 관리팀 |
3rd row | 영남 지역관리팀 |
4th row | 중부 지역관리팀 |
5th row | 호남 지역관리팀 |
Common Values
Value | Count | Frequency (%) |
지역 | 1 | 3.0% |
인천공항 관리팀 | 1 | 3.0% |
영남 지역관리팀 | 1 | 3.0% |
중부 지역관리팀 | 1 | 3.0% |
호남 지역관리팀 | 1 | 3.0% |
합계 | 1 | 3.0% |
총계 | 1 | 3.0% |
(Missing) | 26 |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
지역관리팀 | 3 | |
지역 | 1 | 9.1% |
인천공항 | 1 | 9.1% |
관리팀 | 1 | 9.1% |
영남 | 1 | 9.1% |
중부 | 1 | 9.1% |
호남 | 1 | 9.1% |
합계 | 1 | 9.1% |
총계 | 1 | 9.1% |
Distinct | 6 |
---|---|
Distinct (%) | 33.3% |
Missing | 15 |
Missing (%) | 45.5% |
Memory size | 392.0 B |
관리 | |
---|---|
잠정 | |
비검역 | |
구분 | |
건수 |
Length
Max length | 3 |
---|---|
Median length | 2 |
Mean length | 2.333333333 |
Min length | 2 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 16.7% |
Sample
1st row | 구분 |
---|---|
2nd row | 관리 |
3rd row | 잠정 |
4th row | 비검역 |
5th row | 관리 |
Common Values
Value | Count | Frequency (%) |
관리 | 5 | 15.2% |
잠정 | 5 | 15.2% |
비검역 | 5 | 15.2% |
구분 | 1 | 3.0% |
건수 | 1 | 3.0% |
마리수 | 1 | 3.0% |
(Missing) | 15 |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
관리 | 5 | |
잠정 | 5 | |
비검역 | 5 | |
구분 | 1 | 5.6% |
건수 | 1 | 5.6% |
마리수 | 1 | 5.6% |
Distinct | 2 |
---|---|
Distinct (%) | 6.7% |
Missing | 3 |
Missing (%) | 9.1% |
Memory size | 392.0 B |
건수 | |
---|---|
마리수 |
Length
Max length | 3 |
---|---|
Median length | 2.5 |
Mean length | 2.5 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 건수 |
---|---|
2nd row | 마리수 |
3rd row | 건수 |
4th row | 마리수 |
5th row | 건수 |
Common Values
Value | Count | Frequency (%) |
건수 | 15 | |
마리수 | 15 | |
(Missing) | 3 | 9.1% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
건수 | 15 | |
마리수 | 15 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
Unnamed: 0 | ㅇ 컨테이너 및 적재장소 점검 관련 병해충 발견 실적 | Unnamed: 2 | Unnamed: 3 | Unnamed: 4 | Unnamed: 5 | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | Unnamed: 9 | Unnamed: 10 | Unnamed: 11 | Unnamed: 12 | Unnamed: 13 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | <NA> | 지역 | 구분 | <NA> | 4월 | 5월 | 6월 | 7월 | 8월 | 9월 | 10월 | 11월 | 12월 | 계 |
1 | <NA> | 인천공항 관리팀 | 관리 | 건수 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
2 | <NA> | <NA> | <NA> | 마리수 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
3 | <NA> | <NA> | 잠정 | 건수 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 |
4 | <NA> | <NA> | <NA> | 마리수 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | 3 |
5 | <NA> | <NA> | 비검역 | 건수 | 0 | 1 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 5 |
6 | <NA> | <NA> | <NA> | 마리수 | 0 | 1 | 5 | 6 | 0 | 0 | 0 | 0 | 0 | 12 |
7 | <NA> | 영남 지역관리팀 | 관리 | 건수 | 4 | 10 | 15 | 18 | 14 | 20 | 16 | 3 | 1 | 101 |
8 | <NA> | <NA> | <NA> | 마리수 | 44 | 110 | 149 | 251 | 161 | 181 | 198 | 30 | 15 | 1139 |
9 | <NA> | <NA> | 잠정 | 건수 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 3 |
Last rows
Unnamed: 0 | ㅇ 컨테이너 및 적재장소 점검 관련 병해충 발견 실적 | Unnamed: 2 | Unnamed: 3 | Unnamed: 4 | Unnamed: 5 | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | Unnamed: 9 | Unnamed: 10 | Unnamed: 11 | Unnamed: 12 | Unnamed: 13 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
23 | <NA> | <NA> | 비검역 | 건수 | 15 | 12 | 28 | 68 | 91 | 81 | 65 | 45 | 5 | 410 |
24 | <NA> | <NA> | <NA> | 마리수 | 181 | 112 | 112 | 316 | 485 | 475 | 304 | 218 | 24 | 2227 |
25 | <NA> | 합계 | 관리 | 건수 | 4 | 24 | 42 | 43 | 44 | 46 | 25 | 13 | 2 | 243 |
26 | <NA> | <NA> | <NA> | 마리수 | 44 | 196 | 446 | 400 | 379 | 327 | 234 | 95 | 21 | 2142 |
27 | <NA> | <NA> | 잠정 | 건수 | 3 | 7 | 2 | 1 | 0 | 1 | 1 | 0 | 0 | 15 |
28 | <NA> | <NA> | <NA> | 마리수 | 6 | 39 | 13 | 1 | 0 | 10 | 3 | 0 | 0 | 72 |
29 | <NA> | <NA> | 비검역 | 건수 | 38 | 71 | 146 | 135 | 139 | 136 | 93 | 50 | 6 | 814 |
30 | <NA> | <NA> | <NA> | 마리수 | 314 | 476 | 612 | 1007 | 730 | 766 | 442 | 245 | 26 | 4618 |
31 | <NA> | 총계 | 건수 | <NA> | 45 | 102 | 190 | 179 | 183 | 183 | 119 | 63 | 8 | 1072 |
32 | <NA> | <NA> | 마리수 | <NA> | 364 | 711 | 1071 | 1408 | 1109 | 1103 | 679 | 340 | 47 | 6832 |