Overview

Dataset statistics

Number of variables4
Number of observations1221
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory40.7 KiB
Average record size in memory34.1 B

Variable types

Categorical1
Text1
Numeric2

Dataset

Description인천광역시 남촌농산물도매시장에 입점해있는 법인별 거래실적에 대한 데이터로 법인명, 품목, 물량, 금액등을 볼 수 있습니다.
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15051662&srcSe=7661IVAWM27C61E190

Alerts

물량 is highly overall correlated with 금액High correlation
금액 is highly overall correlated with 물량High correlation

Reproduction

Analysis started2024-04-20 18:42:13.911454
Analysis finished2024-04-20 18:42:15.652229
Duration1.74 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

법인명
Categorical

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
(주)대인농산
354 
인천원예농협
300 
덕풍청과(주)
285 
인천농산(주)
282 

Length

Max length7
Median length7
Mean length6.7542998
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row(주)대인농산
2nd row(주)대인농산
3rd row(주)대인농산
4th row(주)대인농산
5th row(주)대인농산

Common Values

ValueCountFrequency (%)
(주)대인농산 354
29.0%
인천원예농협 300
24.6%
덕풍청과(주) 285
23.3%
인천농산(주) 282
23.1%

Length

2024-04-21T03:42:15.722788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:42:15.830755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주)대인농산 354
29.0%
인천원예농협 300
24.6%
덕풍청과(주 285
23.3%
인천농산(주 282
23.1%

품목
Text

Distinct544
Distinct (%)44.6%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
2024-04-21T03:42:16.000417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length18
Mean length9.2932023
Min length5

Characters and Unicode

Total characters11347
Distinct characters325
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique214 ?
Unique (%)17.5%

Sample

1st row가지(가지(일반))
2nd row가지(건가지)
3rd row감귤(극조생감귤)
4th row감귤(기타)
5th row감귤(하우스감귤)
ValueCountFrequency (%)
16
 
1.3%
전분 16
 
1.3%
마늘(깐마늘 8
 
0.6%
가지(가지(일반 4
 
0.3%
상추(쫑상추 4
 
0.3%
셀러리(양미나리)(셀러리(일반 4
 
0.3%
생강(생강(일반 4
 
0.3%
새싹(새싹(일반 4
 
0.3%
새송이(새송이(일반 4
 
0.3%
상추(청상추 4
 
0.3%
Other values (539) 1197
94.6%
2024-04-21T03:42:16.313982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 1718
 
15.1%
) 1718
 
15.1%
359
 
3.2%
354
 
3.1%
272
 
2.4%
229
 
2.0%
225
 
2.0%
210
 
1.9%
187
 
1.6%
150
 
1.3%
Other values (315) 5925
52.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7852
69.2%
Open Punctuation 1718
 
15.1%
Close Punctuation 1718
 
15.1%
Space Separator 44
 
0.4%
Other Punctuation 10
 
0.1%
Decimal Number 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
359
 
4.6%
354
 
4.5%
272
 
3.5%
229
 
2.9%
225
 
2.9%
210
 
2.7%
187
 
2.4%
150
 
1.9%
147
 
1.9%
135
 
1.7%
Other values (309) 5584
71.1%
Decimal Number
ValueCountFrequency (%)
1 4
80.0%
8 1
 
20.0%
Open Punctuation
ValueCountFrequency (%)
( 1718
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1718
100.0%
Space Separator
ValueCountFrequency (%)
44
100.0%
Other Punctuation
ValueCountFrequency (%)
, 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7852
69.2%
Common 3495
30.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
359
 
4.6%
354
 
4.5%
272
 
3.5%
229
 
2.9%
225
 
2.9%
210
 
2.7%
187
 
2.4%
150
 
1.9%
147
 
1.9%
135
 
1.7%
Other values (309) 5584
71.1%
Common
ValueCountFrequency (%)
( 1718
49.2%
) 1718
49.2%
44
 
1.3%
, 10
 
0.3%
1 4
 
0.1%
8 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7852
69.2%
ASCII 3495
30.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 1718
49.2%
) 1718
49.2%
44
 
1.3%
, 10
 
0.3%
1 4
 
0.1%
8 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
359
 
4.6%
354
 
4.5%
272
 
3.5%
229
 
2.9%
225
 
2.9%
210
 
2.7%
187
 
2.4%
150
 
1.9%
147
 
1.9%
135
 
1.7%
Other values (309) 5584
71.1%

물량
Real number (ℝ)

HIGH CORRELATION 

Distinct934
Distinct (%)76.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12622.832
Minimum0.2
Maximum371281
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.9 KiB
2024-04-21T03:42:16.424500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.2
5-th percentile10
Q1132
median960
Q36350
95-th percentile58350
Maximum371281
Range371280.8
Interquartile range (IQR)6218

Descriptive statistics

Standard deviation39689.201
Coefficient of variation (CV)3.144239
Kurtosis40.936016
Mean12622.832
Median Absolute Deviation (MAD)943
Skewness5.8992794
Sum15412478
Variance1.5752326 × 109
MonotonicityNot monotonic
2024-04-21T03:42:16.533869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10.0 20
 
1.6%
20.0 13
 
1.1%
8.0 12
 
1.0%
16.0 12
 
1.0%
40.0 11
 
0.9%
1.0 9
 
0.7%
4.0 9
 
0.7%
12.0 8
 
0.7%
60.0 7
 
0.6%
120.0 7
 
0.6%
Other values (924) 1113
91.2%
ValueCountFrequency (%)
0.2 1
 
0.1%
0.5 1
 
0.1%
1.0 9
0.7%
1.27 1
 
0.1%
1.4 2
 
0.2%
2.0 5
0.4%
2.5 1
 
0.1%
3.0 1
 
0.1%
3.6 1
 
0.1%
4.0 9
0.7%
ValueCountFrequency (%)
371281.0 1
0.1%
368520.0 1
0.1%
354721.8 1
0.1%
353336.0 1
0.1%
348045.0 1
0.1%
343908.0 1
0.1%
335186.0 1
0.1%
309812.0 1
0.1%
305934.0 1
0.1%
303275.0 1
0.1%

금액
Real number (ℝ)

HIGH CORRELATION 

Distinct1108
Distinct (%)90.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26586775
Minimum1000
Maximum1.4785455 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.9 KiB
2024-04-21T03:42:16.649329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000
5-th percentile32000
Q1383000
median3147500
Q320183000
95-th percentile1.352474 × 108
Maximum1.4785455 × 109
Range1.4785445 × 109
Interquartile range (IQR)19800000

Descriptive statistics

Standard deviation72861410
Coefficient of variation (CV)2.7405133
Kurtosis139.16851
Mean26586775
Median Absolute Deviation (MAD)3087500
Skewness8.8956997
Sum3.2462452 × 1010
Variance5.3087851 × 1015
MonotonicityNot monotonic
2024-04-21T03:42:16.763124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
40000 10
 
0.8%
48000 7
 
0.6%
32000 6
 
0.5%
50000 6
 
0.5%
10000 5
 
0.4%
200000 5
 
0.4%
22000 4
 
0.3%
20000 4
 
0.3%
24000 4
 
0.3%
148000 4
 
0.3%
Other values (1098) 1166
95.5%
ValueCountFrequency (%)
1000 3
0.2%
5000 2
 
0.2%
6000 3
0.2%
7000 2
 
0.2%
8000 2
 
0.2%
9100 1
 
0.1%
10000 5
0.4%
11000 2
 
0.2%
11500 1
 
0.1%
12000 1
 
0.1%
ValueCountFrequency (%)
1478545500 1
0.1%
590257200 1
0.1%
506007000 1
0.1%
460194500 1
0.1%
457374800 1
0.1%
408151500 1
0.1%
397474100 1
0.1%
377569500 1
0.1%
377079400 1
0.1%
370624500 1
0.1%

Interactions

2024-04-21T03:42:15.375331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:42:15.158510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:42:15.458483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:42:15.290392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T03:42:16.845052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법인명물량금액
법인명1.0000.0310.025
물량0.0311.0000.671
금액0.0250.6711.000
2024-04-21T03:42:16.922241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
물량금액법인명
물량1.0000.9550.020
금액0.9551.0000.020
법인명0.0200.0201.000

Missing values

2024-04-21T03:42:15.556942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T03:42:15.621030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

법인명품목물량금액
0(주)대인농산가지(가지(일반))24415.036775500
1(주)대인농산가지(건가지)6.054000
2(주)대인농산감귤(극조생감귤)226321.0377569500
3(주)대인농산감귤(기타)1670.02171000
4(주)대인농산감귤(하우스감귤)23438.055794900
5(주)대인농산감귤(황금향)1200.03696000
6(주)대인농산감자(기타)35985.052778500
7(주)대인농산감자(돼지감자)2342.03456700
8(주)대인농산감자(두백)1260.01585000
9(주)대인농산감자(수미)420.0383000
법인명품목물량금액
1211인천원예농협호박(기타)96.0300000
1212인천원예농협호박(늙은호박)3607.02989300
1213인천원예농협호박(단호박)10180.022168500
1214인천원예농협호박(애호박)53508.0100219900
1215인천원예농협호박(쥬키니호박)3730.06340000
1216인천원예농협호박(풋호박)3900.05814400
1217인천원예농협호박잎(생호박잎)167.5412500
1218인천원예농협호박잎(호박순)50.0200000
1219인천원예농협홍고추(홍고추(일반))5348.017302500
1220인천원예농협홍고추(홍청양)797.03143700