Overview

Dataset statistics

Number of variables4
Number of observations1011
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory33.7 KiB
Average record size in memory34.1 B

Variable types

Categorical1
Text1
Numeric2

Dataset

Description인천광역시 남촌농산물도매시장에 입점해있는 법인별 거래실적에 대한 데이터로 법인명, 품목, 물량, 금액등을 볼 수 있습니다.
Author인천광역시
URLhttps://www.data.go.kr/data/15051662/fileData.do

Alerts

물량 is highly overall correlated with 금액High correlation
금액 is highly overall correlated with 물량High correlation

Reproduction

Analysis started2024-04-21 01:18:35.677082
Analysis finished2024-04-21 01:18:37.538991
Duration1.86 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

법인명
Categorical

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
(주)대인농산
292 
인천원예농협
253 
인천농산(주)
234 
덕풍청과(주)
232 

Length

Max length7
Median length7
Mean length6.7497527
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row(주)대인농산
2nd row(주)대인농산
3rd row(주)대인농산
4th row(주)대인농산
5th row(주)대인농산

Common Values

ValueCountFrequency (%)
(주)대인농산 292
28.9%
인천원예농협 253
25.0%
인천농산(주) 234
23.1%
덕풍청과(주) 232
22.9%

Length

2024-04-21T10:18:37.617666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:18:37.711629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주)대인농산 292
28.9%
인천원예농협 253
25.0%
인천농산(주 234
23.1%
덕풍청과(주 232
22.9%

품목
Text

Distinct452
Distinct (%)44.7%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
2024-04-21T10:18:37.897851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length18
Mean length9.6874382
Min length5

Characters and Unicode

Total characters9794
Distinct characters303
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique167 ?
Unique (%)16.5%

Sample

1st row가지(가지(일반))
2nd row가지(건가지)
3rd row감귤(비가림감귤)
4th row감귤(성전)
5th row감귤(조생귤)
ValueCountFrequency (%)
전분 14
 
1.3%
14
 
1.3%
마늘(깐마늘 9
 
0.9%
가지(가지(일반 4
 
0.4%
숙주나물(숙주나물(일반 4
 
0.4%
사과(기타 4
 
0.4%
사과(미시마 4
 
0.4%
사과(후지 4
 
0.4%
상추(쫑상추 4
 
0.4%
상추(청상추 4
 
0.4%
Other values (446) 985
93.8%
2024-04-21T10:18:38.206754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 1486
 
15.2%
) 1486
 
15.2%
322
 
3.3%
321
 
3.3%
249
 
2.5%
203
 
2.1%
188
 
1.9%
187
 
1.9%
170
 
1.7%
136
 
1.4%
Other values (293) 5046
51.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6768
69.1%
Open Punctuation 1486
 
15.2%
Close Punctuation 1486
 
15.2%
Space Separator 39
 
0.4%
Other Punctuation 10
 
0.1%
Decimal Number 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
322
 
4.8%
321
 
4.7%
249
 
3.7%
203
 
3.0%
188
 
2.8%
187
 
2.8%
170
 
2.5%
136
 
2.0%
130
 
1.9%
130
 
1.9%
Other values (288) 4732
69.9%
Open Punctuation
ValueCountFrequency (%)
( 1486
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1486
100.0%
Space Separator
ValueCountFrequency (%)
39
100.0%
Other Punctuation
ValueCountFrequency (%)
, 10
100.0%
Decimal Number
ValueCountFrequency (%)
1 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6768
69.1%
Common 3026
30.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
322
 
4.8%
321
 
4.7%
249
 
3.7%
203
 
3.0%
188
 
2.8%
187
 
2.8%
170
 
2.5%
136
 
2.0%
130
 
1.9%
130
 
1.9%
Other values (288) 4732
69.9%
Common
ValueCountFrequency (%)
( 1486
49.1%
) 1486
49.1%
39
 
1.3%
, 10
 
0.3%
1 5
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6768
69.1%
ASCII 3026
30.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 1486
49.1%
) 1486
49.1%
39
 
1.3%
, 10
 
0.3%
1 5
 
0.2%
Hangul
ValueCountFrequency (%)
322
 
4.8%
321
 
4.7%
249
 
3.7%
203
 
3.0%
188
 
2.8%
187
 
2.8%
170
 
2.5%
136
 
2.0%
130
 
1.9%
130
 
1.9%
Other values (288) 4732
69.9%

물량
Real number (ℝ)

HIGH CORRELATION 

Distinct807
Distinct (%)79.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11512.801
Minimum0.2
Maximum401120
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.0 KiB
2024-04-21T10:18:38.328397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.2
5-th percentile9
Q1148
median1192
Q35962.5
95-th percentile51406.5
Maximum401120
Range401119.8
Interquartile range (IQR)5814.5

Descriptive statistics

Standard deviation34968.54
Coefficient of variation (CV)3.0373615
Kurtosis54.065902
Mean11512.801
Median Absolute Deviation (MAD)1171
Skewness6.5452785
Sum11639442
Variance1.2227988 × 109
MonotonicityNot monotonic
2024-04-21T10:18:38.453254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10.0 17
 
1.7%
20.0 14
 
1.4%
4.0 12
 
1.2%
40.0 9
 
0.9%
60.0 8
 
0.8%
30.0 8
 
0.8%
50.0 6
 
0.6%
16.0 6
 
0.6%
200.0 5
 
0.5%
2.0 5
 
0.5%
Other values (797) 921
91.1%
ValueCountFrequency (%)
0.2 1
 
0.1%
0.35 1
 
0.1%
0.44 1
 
0.1%
0.6 1
 
0.1%
1.0 5
0.5%
1.2 1
 
0.1%
1.5 2
 
0.2%
2.0 5
0.5%
2.1 1
 
0.1%
2.12 1
 
0.1%
ValueCountFrequency (%)
401120.0 1
0.1%
369934.0 1
0.1%
345946.4 1
0.1%
337884.0 1
0.1%
326445.0 1
0.1%
269011.0 1
0.1%
224030.0 1
0.1%
186932.0 1
0.1%
176270.0 1
0.1%
160248.0 1
0.1%

금액
Real number (ℝ)

HIGH CORRELATION 

Distinct933
Distinct (%)92.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34325645
Minimum2000
Maximum9.512025 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.0 KiB
2024-04-21T10:18:38.595296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000
5-th percentile40000
Q1590000
median4675000
Q325697350
95-th percentile1.784404 × 108
Maximum9.512025 × 108
Range9.512005 × 108
Interquartile range (IQR)25107350

Descriptive statistics

Standard deviation81210181
Coefficient of variation (CV)2.3658748
Kurtosis29.829536
Mean34325645
Median Absolute Deviation (MAD)4576000
Skewness4.6207709
Sum3.4703227 × 1010
Variance6.5950934 × 1015
MonotonicityNot monotonic
2024-04-21T10:18:38.711931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100000 5
 
0.5%
20000 5
 
0.5%
30000 4
 
0.4%
50000 4
 
0.4%
120000 4
 
0.4%
92000 4
 
0.4%
132000 4
 
0.4%
6000 3
 
0.3%
34000 3
 
0.3%
90000 3
 
0.3%
Other values (923) 972
96.1%
ValueCountFrequency (%)
2000 1
 
0.1%
3000 1
 
0.1%
4000 1
 
0.1%
5000 1
 
0.1%
6000 3
0.3%
9000 3
0.3%
10000 2
0.2%
12800 1
 
0.1%
13000 2
0.2%
14000 1
 
0.1%
ValueCountFrequency (%)
951202500 1
0.1%
699273500 1
0.1%
611608200 1
0.1%
503768000 1
0.1%
481959100 1
0.1%
459715200 1
0.1%
438502500 1
0.1%
429670700 1
0.1%
416978500 1
0.1%
398847000 1
0.1%

Interactions

2024-04-21T10:18:37.228395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:18:36.985069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:18:37.330289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:18:37.126745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T10:18:38.785103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법인명물량금액
법인명1.0000.0430.044
물량0.0431.0000.830
금액0.0440.8301.000
2024-04-21T10:18:38.860375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
물량금액법인명
물량1.0000.9550.027
금액0.9551.0000.028
법인명0.0270.0281.000

Missing values

2024-04-21T10:18:37.430245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T10:18:37.499989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

법인명품목물량금액
0(주)대인농산가지(가지(일반))11350.050457500
1(주)대인농산가지(건가지)1.021000
2(주)대인농산감귤(비가림감귤)1285.04957500
3(주)대인농산감귤(성전)1545.09091000
4(주)대인농산감귤(조생귤)860.06427000
5(주)대인농산감귤(천헤향)17854.0110487500
6(주)대인농산감귤(카라향)1355.04925000
7(주)대인농산감자(감자(수입))1160.02088000
8(주)대인농산감자(기타)28090.047846400
9(주)대인농산감자(돼지감자)631.0598000
법인명품목물량금액
1001인천원예농협피망(단고추)(반홍피망)290.01484000
1002인천원예농협피망(단고추)(청피망)4097.025151100
1003인천원예농협피망(단고추)(홍피망)132.0547000
1004인천원예농협호박(기타)55.0295000
1005인천원예농협호박(단호박(수입))38790.060859500
1006인천원예농협호박(애호박)30978.0143564000
1007인천원예농협호박(쥬키니호박)2520.036955000
1008인천원예농협호박잎(생호박잎)30.0132000
1009인천원예농협홍고추(홍고추(일반))1182.012179000
1010인천원예농협홍고추(홍청양)110.01303000