Overview

Dataset statistics

Number of variables12
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)< 0.1%
Total size in memory1.0 MiB
Average record size in memory110.0 B

Variable types

Numeric5
Categorical4
Text3

Dataset

Description학교급식전자조달시스템(https://school.eat.co.kr:442/index.jsp)을 통해 수집된 학교급식 계약 현황 - 학교급식 계약명, 계약형태, 계약방법, 계약금액, 구매사, 출하지 등 * 학교급식전자조달시스템: 학교가 급식재료를 구하기 위하여 업체를 선정하고 계약을 체결하는 급식재료 전문 전자조달 시스템
URLhttps://www.data.go.kr/data/3055737/fileData.do

Alerts

계약형태명 has constant value ""Constant
Dataset has 2 (< 0.1%) duplicate rowsDuplicates
계약서일련번호 is highly overall correlated with 계약일자 and 2 other fieldsHigh correlation
계약일자 is highly overall correlated with 계약서일련번호 and 2 other fieldsHigh correlation
납품시작일자 is highly overall correlated with 계약서일련번호 and 2 other fieldsHigh correlation
납품종료일자 is highly overall correlated with 계약서일련번호 and 2 other fieldsHigh correlation
구매사시도명 is highly overall correlated with 출하자시도명High correlation
출하자시도명 is highly overall correlated with 구매사시도명High correlation
변경계약차수 is highly imbalanced (98.6%)Imbalance
계약일자 is highly skewed (γ1 = -31.2114989)Skewed
납품시작일자 is highly skewed (γ1 = -38.0159091)Skewed

Reproduction

Analysis started2023-12-12 22:04:02.918324
Analysis finished2023-12-12 22:04:08.192964
Duration5.27 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

계약서일련번호
Real number (ℝ)

HIGH CORRELATION 

Distinct9996
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3188687.8
Minimum3150466
Maximum3224377
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:04:08.258093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3150466
5-th percentile3155762.9
Q13171129
median3188099
Q33207306.5
95-th percentile3221083.8
Maximum3224377
Range73911
Interquartile range (IQR)36177.5

Descriptive statistics

Standard deviation20852.523
Coefficient of variation (CV)0.0065395311
Kurtosis-1.1614878
Mean3188687.8
Median Absolute Deviation (MAD)18046.5
Skewness0.0015784644
Sum3.1886878 × 1010
Variance4.3482773 × 108
MonotonicityNot monotonic
2023-12-13T07:04:08.422038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3155890 4
 
< 0.1%
3171172 2
 
< 0.1%
3191608 1
 
< 0.1%
3168816 1
 
< 0.1%
3206188 1
 
< 0.1%
3159575 1
 
< 0.1%
3170277 1
 
< 0.1%
3188663 1
 
< 0.1%
3195910 1
 
< 0.1%
3179692 1
 
< 0.1%
Other values (9986) 9986
99.9%
ValueCountFrequency (%)
3150466 1
< 0.1%
3150485 1
< 0.1%
3150500 1
< 0.1%
3150508 1
< 0.1%
3150514 1
< 0.1%
3150522 1
< 0.1%
3150540 1
< 0.1%
3150551 1
< 0.1%
3150574 1
< 0.1%
3150590 1
< 0.1%
ValueCountFrequency (%)
3224377 1
< 0.1%
3224366 1
< 0.1%
3224365 1
< 0.1%
3224362 1
< 0.1%
3224356 1
< 0.1%
3224346 1
< 0.1%
3224327 1
< 0.1%
3224326 1
< 0.1%
3224324 1
< 0.1%
3224321 1
< 0.1%

변경계약차수
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9987 
1
 
13

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9987
99.9%
1 13
 
0.1%

Length

2023-12-13T07:04:08.541010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:04:08.901591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9987
99.9%
1 13
 
0.1%
Distinct9784
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T07:04:09.140151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length53
Mean length28.8208
Min length5

Characters and Unicode

Total characters288208
Distinct characters523
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9597 ?
Unique (%)96.0%

Sample

1st row2022년 3월 급식 식재료(수산물) 구매 계약
2nd row2022.3월 성암국제무역고등학교 식재료(공산품) 소액수의
3rd row문현중학교 2022년4월분 학교급식 식재료(공산품) 구매계약 체결
4th row명주초 영동초 구정초 가금류 종합계약 소액수의
5th row화홍고 2022년3월분 급식재료(농산물 잡곡) 견적 요청
ValueCountFrequency (%)
2022년 4302
 
7.5%
3월 3965
 
6.9%
식재료 3431
 
6.0%
계약 3097
 
5.4%
구매 2901
 
5.1%
4월 2486
 
4.3%
소액수의 1970
 
3.4%
견적요청 1393
 
2.4%
학교급식 1182
 
2.1%
구입 1092
 
1.9%
Other values (7948) 31408
54.9%
2023-12-13T07:04:09.666104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
47551
 
16.5%
2 19662
 
6.8%
11297
 
3.9%
10955
 
3.8%
9915
 
3.4%
8592
 
3.0%
6608
 
2.3%
0 6442
 
2.2%
6291
 
2.2%
5993
 
2.1%
Other values (513) 154902
53.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 193983
67.3%
Space Separator 47551
 
16.5%
Decimal Number 34712
 
12.0%
Close Punctuation 5133
 
1.8%
Open Punctuation 5127
 
1.8%
Other Punctuation 852
 
0.3%
Uppercase Letter 334
 
0.1%
Lowercase Letter 170
 
0.1%
Math Symbol 163
 
0.1%
Dash Punctuation 156
 
0.1%
Other values (3) 27
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11297
 
5.8%
10955
 
5.6%
9915
 
5.1%
8592
 
4.4%
6608
 
3.4%
6291
 
3.2%
5993
 
3.1%
5757
 
3.0%
5716
 
2.9%
5648
 
2.9%
Other values (463) 117211
60.4%
Lowercase Letter
ValueCountFrequency (%)
n 66
38.8%
o 59
34.7%
g 15
 
8.8%
m 15
 
8.8%
e 4
 
2.4%
a 3
 
1.8%
y 2
 
1.2%
b 2
 
1.2%
u 1
 
0.6%
p 1
 
0.6%
Other values (2) 2
 
1.2%
Decimal Number
ValueCountFrequency (%)
2 19662
56.6%
0 6442
 
18.6%
3 5099
 
14.7%
4 3044
 
8.8%
1 331
 
1.0%
5 83
 
0.2%
8 24
 
0.1%
7 18
 
0.1%
6 6
 
< 0.1%
9 3
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
. 817
95.9%
/ 10
 
1.2%
· 9
 
1.1%
: 7
 
0.8%
& 4
 
0.5%
; 3
 
0.4%
# 1
 
0.1%
% 1
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
O 83
24.9%
N 76
22.8%
G 75
22.5%
M 59
17.7%
A 20
 
6.0%
B 19
 
5.7%
T 2
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 4874
95.0%
] 258
 
5.0%
1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 4868
94.9%
[ 258
 
5.0%
1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 162
99.4%
1
 
0.6%
Space Separator
ValueCountFrequency (%)
47551
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 156
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 23
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 193983
67.3%
Common 93721
32.5%
Latin 504
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11297
 
5.8%
10955
 
5.6%
9915
 
5.1%
8592
 
4.4%
6608
 
3.4%
6291
 
3.2%
5993
 
3.1%
5757
 
3.0%
5716
 
2.9%
5648
 
2.9%
Other values (463) 117211
60.4%
Common
ValueCountFrequency (%)
47551
50.7%
2 19662
21.0%
0 6442
 
6.9%
3 5099
 
5.4%
) 4874
 
5.2%
( 4868
 
5.2%
4 3044
 
3.2%
. 817
 
0.9%
1 331
 
0.4%
[ 258
 
0.3%
Other values (21) 775
 
0.8%
Latin
ValueCountFrequency (%)
O 83
16.5%
N 76
15.1%
G 75
14.9%
n 66
13.1%
M 59
11.7%
o 59
11.7%
A 20
 
4.0%
B 19
 
3.8%
g 15
 
3.0%
m 15
 
3.0%
Other values (9) 17
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 193976
67.3%
ASCII 94211
32.7%
None 11
 
< 0.1%
Compat Jamo 7
 
< 0.1%
Punctuation 2
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
47551
50.5%
2 19662
20.9%
0 6442
 
6.8%
3 5099
 
5.4%
) 4874
 
5.2%
( 4868
 
5.2%
4 3044
 
3.2%
. 817
 
0.9%
1 331
 
0.4%
[ 258
 
0.3%
Other values (35) 1265
 
1.3%
Hangul
ValueCountFrequency (%)
11297
 
5.8%
10955
 
5.6%
9915
 
5.1%
8592
 
4.4%
6608
 
3.4%
6291
 
3.2%
5993
 
3.1%
5757
 
3.0%
5716
 
2.9%
5648
 
2.9%
Other values (460) 117204
60.4%
None
ValueCountFrequency (%)
· 9
81.8%
1
 
9.1%
1
 
9.1%
Compat Jamo
ValueCountFrequency (%)
3
42.9%
3
42.9%
1
 
14.3%
Punctuation
ValueCountFrequency (%)
2
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

계약형태명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
총액계약
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row총액계약
2nd row총액계약
3rd row총액계약
4th row총액계약
5th row총액계약

Common Values

ValueCountFrequency (%)
총액계약 10000
100.0%

Length

2023-12-13T07:04:09.808915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:04:09.923955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
총액계약 10000
100.0%

계약일자
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct80
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20220246
Minimum20210927
Maximum20220428
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:04:10.056687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20210927
5-th percentile20220128
Q120220223
median20220225
Q320220323
95-th percentile20220328
Maximum20220428
Range9501
Interquartile range (IQR)100

Descriptive statistics

Standard deviation277.89736
Coefficient of variation (CV)1.374352 × 10-5
Kurtosis1015.4989
Mean20220246
Median Absolute Deviation (MAD)3
Skewness-31.211499
Sum2.0220246 × 1011
Variance77226.943
MonotonicityNot monotonic
2023-12-13T07:04:10.230338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20220224 1418
14.2%
20220225 1197
12.0%
20220223 1152
11.5%
20220324 907
9.1%
20220325 799
8.0%
20220222 725
 
7.2%
20220228 676
 
6.8%
20220323 536
 
5.4%
20220221 391
 
3.9%
20220328 302
 
3.0%
Other values (70) 1897
19.0%
ValueCountFrequency (%)
20210927 1
 
< 0.1%
20211126 1
 
< 0.1%
20211228 1
 
< 0.1%
20211230 1
 
< 0.1%
20211231 5
 
0.1%
20220101 4
 
< 0.1%
20220103 14
0.1%
20220104 12
0.1%
20220105 5
 
0.1%
20220106 12
0.1%
ValueCountFrequency (%)
20220428 1
 
< 0.1%
20220420 1
 
< 0.1%
20220407 2
 
< 0.1%
20220406 2
 
< 0.1%
20220404 2
 
< 0.1%
20220401 24
 
0.2%
20220331 64
 
0.6%
20220330 82
 
0.8%
20220329 116
 
1.2%
20220328 302
3.0%

납품시작일자
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct86
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20220317
Minimum20200302
Maximum20220501
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:04:10.375887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20200302
5-th percentile20220207
Q120220301
median20220302
Q320220401
95-th percentile20220401
Maximum20220501
Range20199
Interquartile range (IQR)100

Descriptive statistics

Standard deviation336.24034
Coefficient of variation (CV)1.6628836 × 10-5
Kurtosis1722.3114
Mean20220317
Median Absolute Deviation (MAD)1
Skewness-38.015909
Sum2.0220317 × 1011
Variance113057.56
MonotonicityNot monotonic
2023-12-13T07:04:10.565562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20220302 2991
29.9%
20220401 2904
29.0%
20220301 2209
22.1%
20220303 693
 
6.9%
20220404 200
 
2.0%
20220304 96
 
1.0%
20220203 94
 
0.9%
20220124 82
 
0.8%
20220125 60
 
0.6%
20220207 57
 
0.6%
Other values (76) 614
 
6.1%
ValueCountFrequency (%)
20200302 1
 
< 0.1%
20210301 1
 
< 0.1%
20210901 1
 
< 0.1%
20211001 1
 
< 0.1%
20211201 4
 
< 0.1%
20211228 1
 
< 0.1%
20220101 5
 
0.1%
20220103 15
0.1%
20220104 2
 
< 0.1%
20220105 2
 
< 0.1%
ValueCountFrequency (%)
20220501 1
 
< 0.1%
20220426 1
 
< 0.1%
20220425 1
 
< 0.1%
20220422 1
 
< 0.1%
20220421 2
 
< 0.1%
20220418 3
< 0.1%
20220414 3
< 0.1%
20220413 1
 
< 0.1%
20220412 5
0.1%
20220411 7
0.1%

납품종료일자
Real number (ℝ)

HIGH CORRELATION 

Distinct108
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20220474
Minimum20200331
Maximum20230331
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:04:10.753700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20200331
5-th percentile20220225
Q120220331
median20220331
Q320220429
95-th percentile20220430
Maximum20230331
Range30000
Interquartile range (IQR)98

Descriptive statistics

Standard deviation1081.5586
Coefficient of variation (CV)5.348829 × 10-5
Kurtosis86.248156
Mean20220474
Median Absolute Deviation (MAD)0
Skewness7.6583336
Sum2.0220474 × 1011
Variance1169768.9
MonotonicityNot monotonic
2023-12-13T07:04:10.958885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20220331 5213
52.1%
20220429 1960
 
19.6%
20220430 1036
 
10.4%
20220428 197
 
2.0%
20220330 165
 
1.7%
20230228 108
 
1.1%
20220210 92
 
0.9%
20220425 85
 
0.9%
20220329 83
 
0.8%
20220427 83
 
0.8%
Other values (98) 978
 
9.8%
ValueCountFrequency (%)
20200331 1
 
< 0.1%
20211029 1
 
< 0.1%
20211231 1
 
< 0.1%
20220103 3
< 0.1%
20220104 2
 
< 0.1%
20220105 1
 
< 0.1%
20220106 6
0.1%
20220110 1
 
< 0.1%
20220111 1
 
< 0.1%
20220112 1
 
< 0.1%
ValueCountFrequency (%)
20230331 1
 
< 0.1%
20230228 108
1.1%
20230224 2
 
< 0.1%
20230215 1
 
< 0.1%
20230209 1
 
< 0.1%
20230131 1
 
< 0.1%
20230102 1
 
< 0.1%
20221231 10
 
0.1%
20221230 3
 
< 0.1%
20221227 1
 
< 0.1%

계약금액
Real number (ℝ)

Distinct9683
Distinct (%)96.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8498883.4
Minimum430
Maximum2.2934075 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:04:11.166478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum430
5-th percentile589982.5
Q12246450
median4920460
Q311254860
95-th percentile27597897
Maximum2.2934075 × 108
Range2.2934032 × 108
Interquartile range (IQR)9008410

Descriptive statistics

Standard deviation10016175
Coefficient of variation (CV)1.1785284
Kurtosis36.388994
Mean8498883.4
Median Absolute Deviation (MAD)3403260
Skewness3.7316049
Sum8.4988834 × 1010
Variance1.0032377 × 1014
MonotonicityNot monotonic
2023-12-13T07:04:11.332447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2113600 8
 
0.1%
2642000 6
 
0.1%
2245700 5
 
0.1%
2377800 5
 
0.1%
528400 4
 
< 0.1%
3170400 4
 
< 0.1%
1056800 4
 
< 0.1%
1215320 4
 
< 0.1%
1966200 4
 
< 0.1%
3240000 4
 
< 0.1%
Other values (9673) 9952
99.5%
ValueCountFrequency (%)
430 1
 
< 0.1%
470 1
 
< 0.1%
480 4
< 0.1%
530 1
 
< 0.1%
1010 1
 
< 0.1%
24800 1
 
< 0.1%
25920 1
 
< 0.1%
28560 1
 
< 0.1%
28880 1
 
< 0.1%
31200 1
 
< 0.1%
ValueCountFrequency (%)
229340750 1
< 0.1%
143365248 1
< 0.1%
101094000 1
< 0.1%
99533600 1
< 0.1%
93800180 1
< 0.1%
92984080 1
< 0.1%
85985000 1
< 0.1%
85803920 1
< 0.1%
84814700 1
< 0.1%
83894570 1
< 0.1%
Distinct5447
Distinct (%)54.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T07:04:11.663669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length18
Mean length6.7102
Min length4

Characters and Unicode

Total characters67102
Distinct characters422
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2752 ?
Unique (%)27.5%

Sample

1st row시곡중학교
2nd row성암국제무역고등학교
3rd row문현중학교
4th row영동초등학교
5th row화홍고등학교
ValueCountFrequency (%)
옥산초등학교 10
 
0.1%
백운초등학교 8
 
0.1%
서울원당초등학교 8
 
0.1%
성산초등학교 8
 
0.1%
송정초등학교 8
 
0.1%
교동초등학교 8
 
0.1%
안성중학교 8
 
0.1%
대덕초등학교 7
 
0.1%
남원중학교 7
 
0.1%
전주지곡초등학교 7
 
0.1%
Other values (5442) 9932
99.2%
2023-12-13T07:04:12.113894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9945
 
14.8%
9830
 
14.6%
7378
 
11.0%
5199
 
7.7%
2432
 
3.6%
2353
 
3.5%
1245
 
1.9%
958
 
1.4%
877
 
1.3%
868
 
1.3%
Other values (412) 26017
38.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67074
> 99.9%
Space Separator 11
 
< 0.1%
Close Punctuation 6
 
< 0.1%
Open Punctuation 6
 
< 0.1%
Decimal Number 3
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9945
 
14.8%
9830
 
14.7%
7378
 
11.0%
5199
 
7.8%
2432
 
3.6%
2353
 
3.5%
1245
 
1.9%
958
 
1.4%
877
 
1.3%
868
 
1.3%
Other values (406) 25989
38.7%
Decimal Number
ValueCountFrequency (%)
2 2
66.7%
3 1
33.3%
Space Separator
ValueCountFrequency (%)
11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 67074
> 99.9%
Common 26
 
< 0.1%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9945
 
14.8%
9830
 
14.7%
7378
 
11.0%
5199
 
7.8%
2432
 
3.6%
2353
 
3.5%
1245
 
1.9%
958
 
1.4%
877
 
1.3%
868
 
1.3%
Other values (406) 25989
38.7%
Common
ValueCountFrequency (%)
11
42.3%
) 6
23.1%
( 6
23.1%
2 2
 
7.7%
3 1
 
3.8%
Latin
ValueCountFrequency (%)
e 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 67074
> 99.9%
ASCII 28
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9945
 
14.8%
9830
 
14.7%
7378
 
11.0%
5199
 
7.8%
2432
 
3.6%
2353
 
3.5%
1245
 
1.9%
958
 
1.4%
877
 
1.3%
868
 
1.3%
Other values (406) 25989
38.7%
ASCII
ValueCountFrequency (%)
11
39.3%
) 6
21.4%
( 6
21.4%
2 2
 
7.1%
e 2
 
7.1%
3 1
 
3.6%
Distinct2494
Distinct (%)24.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T07:04:12.382069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length20
Mean length8.5308
Min length2

Characters and Unicode

Total characters85308
Distinct characters525
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1079 ?
Unique (%)10.8%

Sample

1st row해정수산(주)
2nd row정원푸드
3rd row혜인푸드라인(주)
4th row주식회사 드림농산
5th row성한
ValueCountFrequency (%)
주식회사 753
 
6.3%
경기도농수산진흥원 374
 
3.1%
농업회사법인 231
 
1.9%
수협 165
 
1.4%
인천가공물류센터 165
 
1.4%
화성푸드통합지원센터 163
 
1.4%
농협성남유통센터 118
 
1.0%
주)농협하나로유통 118
 
1.0%
전주푸드통합지원센터 101
 
0.8%
재단법인 101
 
0.8%
Other values (2528) 9633
80.8%
2023-12-13T07:04:12.764590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3716
 
4.4%
2924
 
3.4%
) 2791
 
3.3%
( 2770
 
3.2%
2492
 
2.9%
2428
 
2.8%
2422
 
2.8%
2396
 
2.8%
2206
 
2.6%
2045
 
2.4%
Other values (515) 59118
69.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 77251
90.6%
Close Punctuation 2807
 
3.3%
Open Punctuation 2786
 
3.3%
Space Separator 1922
 
2.3%
Uppercase Letter 338
 
0.4%
Lowercase Letter 76
 
0.1%
Other Punctuation 67
 
0.1%
Decimal Number 57
 
0.1%
Dash Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3716
 
4.8%
2924
 
3.8%
2492
 
3.2%
2428
 
3.1%
2422
 
3.1%
2396
 
3.1%
2206
 
2.9%
2045
 
2.6%
1925
 
2.5%
1872
 
2.4%
Other values (465) 52825
68.4%
Uppercase Letter
ValueCountFrequency (%)
F 85
25.1%
S 45
13.3%
C 32
 
9.5%
D 29
 
8.6%
B 23
 
6.8%
M 21
 
6.2%
P 17
 
5.0%
O 17
 
5.0%
L 17
 
5.0%
J 14
 
4.1%
Other values (7) 38
11.2%
Lowercase Letter
ValueCountFrequency (%)
o 12
15.8%
s 8
10.5%
a 7
9.2%
e 6
7.9%
c 6
7.9%
r 6
7.9%
m 6
7.9%
d 5
 
6.6%
f 4
 
5.3%
h 3
 
3.9%
Other values (5) 13
17.1%
Decimal Number
ValueCountFrequency (%)
3 18
31.6%
0 9
15.8%
1 7
 
12.3%
6 7
 
12.3%
2 7
 
12.3%
8 4
 
7.0%
5 2
 
3.5%
7 2
 
3.5%
4 1
 
1.8%
Other Punctuation
ValueCountFrequency (%)
& 38
56.7%
. 24
35.8%
/ 5
 
7.5%
Close Punctuation
ValueCountFrequency (%)
) 2791
99.4%
] 16
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 2770
99.4%
[ 16
 
0.6%
Space Separator
ValueCountFrequency (%)
1922
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 77251
90.6%
Common 7643
 
9.0%
Latin 414
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3716
 
4.8%
2924
 
3.8%
2492
 
3.2%
2428
 
3.1%
2422
 
3.1%
2396
 
3.1%
2206
 
2.9%
2045
 
2.6%
1925
 
2.5%
1872
 
2.4%
Other values (465) 52825
68.4%
Latin
ValueCountFrequency (%)
F 85
20.5%
S 45
 
10.9%
C 32
 
7.7%
D 29
 
7.0%
B 23
 
5.6%
M 21
 
5.1%
P 17
 
4.1%
O 17
 
4.1%
L 17
 
4.1%
J 14
 
3.4%
Other values (22) 114
27.5%
Common
ValueCountFrequency (%)
) 2791
36.5%
( 2770
36.2%
1922
25.1%
& 38
 
0.5%
. 24
 
0.3%
3 18
 
0.2%
[ 16
 
0.2%
] 16
 
0.2%
0 9
 
0.1%
1 7
 
0.1%
Other values (8) 32
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 77251
90.6%
ASCII 8057
 
9.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3716
 
4.8%
2924
 
3.8%
2492
 
3.2%
2428
 
3.1%
2422
 
3.1%
2396
 
3.1%
2206
 
2.9%
2045
 
2.6%
1925
 
2.5%
1872
 
2.4%
Other values (465) 52825
68.4%
ASCII
ValueCountFrequency (%)
) 2791
34.6%
( 2770
34.4%
1922
23.9%
F 85
 
1.1%
S 45
 
0.6%
& 38
 
0.5%
C 32
 
0.4%
D 29
 
0.4%
. 24
 
0.3%
B 23
 
0.3%
Other values (40) 298
 
3.7%

구매사시도명
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도
3156 
서울특별시
1680 
전라북도
1058 
부산광역시
706 
경상남도
605 
Other values (22)
2795 

Length

Max length7
Median length5
Mean length4.1105
Min length2

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row경기도
2nd row서울특별시
3rd row서울특별시
4th row강원도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 3156
31.6%
서울특별시 1680
16.8%
전라북도 1058
 
10.6%
부산광역시 706
 
7.1%
경상남도 605
 
6.0%
전라남도 566
 
5.7%
광주광역시 384
 
3.8%
인천광역시 353
 
3.5%
대전광역시 342
 
3.4%
강원도 300
 
3.0%
Other values (17) 850
 
8.5%

Length

2023-12-13T07:04:12.918887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 3156
31.6%
서울특별시 1680
16.8%
전라북도 1058
 
10.6%
부산광역시 706
 
7.1%
경상남도 605
 
6.0%
전라남도 566
 
5.7%
광주광역시 384
 
3.8%
인천광역시 353
 
3.5%
대전광역시 342
 
3.4%
강원도 300
 
3.0%
Other values (17) 850
 
8.5%

출하자시도명
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도
3446 
전라북도
1107 
서울특별시
1046 
부산광역시
707 
전라남도
668 
Other values (17)
3026 

Length

Max length7
Median length5
Mean length4.0385
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row경기도
2nd row서울특별시
3rd row서울특별시
4th row강원도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 3446
34.5%
전라북도 1107
 
11.1%
서울특별시 1046
 
10.5%
부산광역시 707
 
7.1%
전라남도 668
 
6.7%
경상남도 607
 
6.1%
인천광역시 547
 
5.5%
광주광역시 370
 
3.7%
대전광역시 343
 
3.4%
강원도 294
 
2.9%
Other values (12) 865
 
8.6%

Length

2023-12-13T07:04:13.038588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 3446
34.5%
전라북도 1107
 
11.1%
서울특별시 1046
 
10.5%
부산광역시 707
 
7.1%
전라남도 668
 
6.7%
경상남도 607
 
6.1%
인천광역시 547
 
5.5%
광주광역시 370
 
3.7%
대전광역시 343
 
3.4%
강원도 294
 
2.9%
Other values (12) 865
 
8.6%

Interactions

2023-12-13T07:04:07.474723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:05.205937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:05.888849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:06.460336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:07.010664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:07.563508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:05.335064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:06.013026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:06.562476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:07.115900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:07.640681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:05.471126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:06.119867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:06.675213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:07.199906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:07.739869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:05.624622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:06.247781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:06.787186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:07.298779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:07.826540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:05.755982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:06.358345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:06.906618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:04:07.393347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:04:13.118682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계약서일련번호변경계약차수계약일자납품시작일자납품종료일자계약금액구매사시도명출하자시도명
계약서일련번호1.0000.0840.0960.0930.1270.0520.4000.357
변경계약차수0.0841.0000.2270.0000.0100.0000.0000.000
계약일자0.0960.2271.0000.2660.2160.0000.0400.048
납품시작일자0.0930.0000.2661.0000.7270.0000.0870.082
납품종료일자0.1270.0100.2160.7271.0000.1330.2330.197
계약금액0.0520.0000.0000.0000.1331.0000.1780.163
구매사시도명0.4000.0000.0400.0870.2330.1781.0000.983
출하자시도명0.3570.0000.0480.0820.1970.1630.9831.000
2023-12-13T07:04:13.226648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
변경계약차수출하자시도명구매사시도명
변경계약차수1.0000.0000.000
출하자시도명0.0001.0000.801
구매사시도명0.0000.8011.000
2023-12-13T07:04:13.328463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계약서일련번호계약일자납품시작일자납품종료일자계약금액변경계약차수구매사시도명출하자시도명
계약서일련번호1.0000.9350.7860.6490.0090.0640.1560.140
계약일자0.9351.0000.7740.6710.0560.1370.0640.065
납품시작일자0.7860.7741.0000.666-0.0260.0000.0480.049
납품종료일자0.6490.6710.6661.0000.2050.0140.0960.092
계약금액0.0090.056-0.0260.2051.0000.0000.0750.070
변경계약차수0.0640.1370.0000.0140.0001.0000.0000.000
구매사시도명0.1560.0640.0480.0960.0750.0001.0000.801
출하자시도명0.1400.0650.0490.0920.0700.0000.8011.000

Missing values

2023-12-13T07:04:07.946218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:04:08.107981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

계약서일련번호변경계약차수계약명계약형태명계약일자납품시작일자납품종료일자계약금액구매사명출하자명구매사시도명출하자시도명
35993319160802022년 3월 급식 식재료(수산물) 구매 계약총액계약2022030120220301202203312408800시곡중학교해정수산(주)경기도경기도
24071317899302022.3월 성암국제무역고등학교 식재료(공산품) 소액수의총액계약20220223202203022022033130152000성암국제무역고등학교정원푸드서울특별시서울특별시
5748632166120문현중학교 2022년4월분 학교급식 식재료(공산품) 구매계약 체결총액계약20220325202204012022042914140000문현중학교혜인푸드라인(주)서울특별시서울특별시
6325232227940명주초 영동초 구정초 가금류 종합계약 소액수의총액계약202203252022041220220428322830영동초등학교주식회사 드림농산강원도강원도
953831633540화홍고 2022년3월분 급식재료(농산물 잡곡) 견적 요청총액계약20220228202203012022033112311000화홍고등학교성한경기도경기도
3000431852750군산서해초등학교 공산품류 소액수의총액계약2022022420220302202203318415200군산서해초등학교전북친환경생산자영농조합법인전라북도전라북도
2830231834870서울 광남중학교 2022년 3월 학교급식 식재료 구매(소액수의)계약(수산물)총액계약2022022820220301202203314139800광남중학교주식회사국도에프엔비서울특별시경기도
24431317938402022년 3월 이북초등학교 학교급식식품(육류) 구입총액계약202202232022030120220331975250이북초등학교한수유통경상남도경상남도
1620231706420인천백학초등학교 2022년 3 4월 식재료 계약(축산물)총액계약2022022320220301202204309092700인천백학초등학교(주)동부급식인천광역시인천광역시
17634317222602022년 3월 거제양정초등학교 급식용 친환경쌀 구입총액계약2022022320220301202203313132000거제양정초등학교평화영농조합법인경상남도경상남도
계약서일련번호변경계약차수계약명계약형태명계약일자납품시작일자납품종료일자계약금액구매사명출하자명구매사시도명출하자시도명
28380318357002022년 3월 중문초등학교 공산품 식재료 견적요청총액계약20220224202203022022033110247000중문초등학교우성종합유통제주특별자치도제주특별자치도
149831526590사남초 당리초 사동초 가금류 종합계약 소액수의총액계약2022011720220125202202181192860당리초등학교부산에프에스부산광역시부산광역시
13981316819902022년 3월 학교급식 식재료(곡류) 구매계약총액계약202202222022030320220331680000서연중학교영농조합법인 나눔과 섬김서울특별시경상북도
2780231829600안화초등학교 3월 곡류 식재료 구매계약총액계약2022022420220301202203312245700안화초등학교화성푸드통합지원센터경기도경기도
1952631742240한여울초등학교 2022학년도 3월 학교 급식품(공산품) 식재료 구매 계약총액계약20220223202203012022033111692260한여울초등학교스쿨푸드경기도경기도
11219316524802022년 3월 학교급식(NON-GMO가공식품) 구매 계약총액계약202202222022030120220330884510수원원일중학교참마루협동조합경기도경기도
49350320780804월 대전내동초 학교급식 식재료(김치류) 구매 소액수의 계약총액계약2022032420220401202204303148740대전내동초등학교농업회사법인 호천식품(주)대전광역시대전광역시
39782319556502022년 3월 성남서초 학교급식물품(수산물) 구매총액계약2022022520220303202203311787020성남서초등학교(주)농협하나로유통 농협성남유통센터경기도경기도
4321315724102022년 3~8월 축산물 소액수의 견적제출 안내공고총액계약20220221202203012022083121586000덕적고등학교(주)동부급식인천광역시인천광역시
225531538850도포초등학교 일반농 수 공산품 식재료 견적요청총액계약2022012120220207202202101022540도포초등학교영암농협하나로마트전라남도전라남도

Duplicate rows

Most frequently occurring

계약서일련번호변경계약차수계약명계약형태명계약일자납품시작일자납품종료일자계약금액구매사명출하자명구매사시도명출하자시도명# duplicates
031558901회룡초등학교 농산 식재료 견적요청총액계약2022010120220103202201061966200회룡초등학교경기도농수산진흥원경기도경기도4
131711721한양초등학교 1-2월 김치류 식재료 견적요청총액계약2021123120220124202202101352600한양초등학교수안보농협남한강김치서울특별시서울특별시2