Overview

Dataset statistics

Number of variables6
Number of observations182
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.0 KiB
Average record size in memory50.7 B

Variable types

Text3
Categorical1
Numeric2

Dataset

Description전라남도 소액 경인쇄 발주 전자 추첨 시스템 발주별 현황 (발주명, 발주부서, 낙찰업체명 등)에 관한 데이터를 조회하실 수 있습니다.
Author전라남도
URLhttps://www.data.go.kr/data/15067470/fileData.do

Alerts

기초금액 is highly overall correlated with 계약금액High correlation
계약금액 is highly overall correlated with 기초금액High correlation
발주명 has unique valuesUnique
추첨일 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:46:04.473644
Analysis finished2023-12-12 21:46:05.585934
Duration1.11 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

발주명
Text

UNIQUE 

Distinct182
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-13T06:46:05.796932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length31.5
Mean length22.10989
Min length12

Characters and Unicode

Total characters4024
Distinct characters266
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique182 ?
Unique (%)100.0%

Sample

1st row2010 지방세정 연찬회 지방세 연구과제 모음집
2nd row2010년 사방사업 홍보책자 제작
3rd row2009년 기준 사업체조사 보고서
4th row제2차 가로경관 10개년 계획
5th row"중소기업 이렇게 도와드립니다" 책자 제작
ValueCountFrequency (%)
제작 97
 
11.2%
책자 54
 
6.2%
발간 44
 
5.1%
2019년 18
 
2.1%
전라남도 17
 
2.0%
인쇄 15
 
1.7%
2022년 12
 
1.4%
사업시행지침서 12
 
1.4%
2020년 11
 
1.3%
2020년도 10
 
1.2%
Other values (287) 576
66.5%
2023-12-13T06:46:06.276495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
684
 
17.0%
2 233
 
5.8%
0 175
 
4.3%
134
 
3.3%
119
 
3.0%
113
 
2.8%
1 108
 
2.7%
86
 
2.1%
85
 
2.1%
84
 
2.1%
Other values (256) 2203
54.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2706
67.2%
Space Separator 684
 
17.0%
Decimal Number 597
 
14.8%
Other Punctuation 9
 
0.2%
Open Punctuation 6
 
0.1%
Close Punctuation 6
 
0.1%
Uppercase Letter 6
 
0.1%
Lowercase Letter 6
 
0.1%
Math Symbol 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
134
 
5.0%
119
 
4.4%
113
 
4.2%
86
 
3.2%
85
 
3.1%
84
 
3.1%
84
 
3.1%
76
 
2.8%
74
 
2.7%
64
 
2.4%
Other values (231) 1787
66.0%
Decimal Number
ValueCountFrequency (%)
2 233
39.0%
0 175
29.3%
1 108
18.1%
9 31
 
5.2%
8 16
 
2.7%
7 14
 
2.3%
6 8
 
1.3%
5 4
 
0.7%
3 4
 
0.7%
4 4
 
0.7%
Uppercase Letter
ValueCountFrequency (%)
M 2
33.3%
P 1
16.7%
D 1
16.7%
R 1
16.7%
G 1
16.7%
Lowercase Letter
ValueCountFrequency (%)
a 2
33.3%
n 2
33.3%
o 1
16.7%
t 1
16.7%
Other Punctuation
ValueCountFrequency (%)
· 5
55.6%
" 4
44.4%
Space Separator
ValueCountFrequency (%)
684
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2706
67.2%
Common 1306
32.5%
Latin 12
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
134
 
5.0%
119
 
4.4%
113
 
4.2%
86
 
3.2%
85
 
3.1%
84
 
3.1%
84
 
3.1%
76
 
2.8%
74
 
2.7%
64
 
2.4%
Other values (231) 1787
66.0%
Common
ValueCountFrequency (%)
684
52.4%
2 233
 
17.8%
0 175
 
13.4%
1 108
 
8.3%
9 31
 
2.4%
8 16
 
1.2%
7 14
 
1.1%
6 8
 
0.6%
( 6
 
0.5%
) 6
 
0.5%
Other values (6) 25
 
1.9%
Latin
ValueCountFrequency (%)
M 2
16.7%
a 2
16.7%
n 2
16.7%
o 1
8.3%
t 1
8.3%
P 1
8.3%
D 1
8.3%
R 1
8.3%
G 1
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2706
67.2%
ASCII 1313
32.6%
None 5
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
684
52.1%
2 233
 
17.7%
0 175
 
13.3%
1 108
 
8.2%
9 31
 
2.4%
8 16
 
1.2%
7 14
 
1.1%
6 8
 
0.6%
( 6
 
0.5%
) 6
 
0.5%
Other values (14) 32
 
2.4%
Hangul
ValueCountFrequency (%)
134
 
5.0%
119
 
4.4%
113
 
4.2%
86
 
3.2%
85
 
3.1%
84
 
3.1%
84
 
3.1%
76
 
2.8%
74
 
2.7%
64
 
2.4%
Other values (231) 1787
66.0%
None
ValueCountFrequency (%)
· 5
100.0%

발주부서
Categorical

Distinct33
Distinct (%)18.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
농업기술원
47 
세무회계과
23 
회계과
20 
예산담당관
11 
스마트정보담당관
11 
Other values (28)
70 

Length

Max length9
Median length5
Mean length4.989011
Min length3

Unique

Unique11 ?
Unique (%)6.0%

Sample

1st row세무회계과
2nd row세무회계과
3rd row세무회계과
4th row세무회계과
5th row세무회계과

Common Values

ValueCountFrequency (%)
농업기술원 47
25.8%
세무회계과 23
12.6%
회계과 20
11.0%
예산담당관 11
 
6.0%
스마트정보담당관 11
 
6.0%
세정과 6
 
3.3%
사회재난과 6
 
3.3%
법무담당관 5
 
2.7%
동물방역과 5
 
2.7%
농식품유통과 4
 
2.2%
Other values (23) 44
24.2%

Length

2023-12-13T06:46:06.462229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
농업기술원 47
25.8%
세무회계과 23
12.6%
회계과 20
11.0%
예산담당관 11
 
6.0%
스마트정보담당관 11
 
6.0%
세정과 6
 
3.3%
사회재난과 6
 
3.3%
법무담당관 5
 
2.7%
동물방역과 5
 
2.7%
농식품유통과 4
 
2.2%
Other values (23) 44
24.2%

기초금액
Real number (ℝ)

HIGH CORRELATION 

Distinct141
Distinct (%)77.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7051712
Minimum2240000
Maximum21600000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-13T06:46:06.609185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2240000
5-th percentile3104900
Q14176750
median6309000
Q38862000
95-th percentile15844750
Maximum21600000
Range19360000
Interquartile range (IQR)4685250

Descriptive statistics

Standard deviation3867519.4
Coefficient of variation (CV)0.54845113
Kurtosis2.9357711
Mean7051712
Median Absolute Deviation (MAD)2255532.5
Skewness1.6442981
Sum1.2834116 × 109
Variance1.4957707 × 1013
MonotonicityNot monotonic
2023-12-13T06:46:06.776232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5000000 11
 
6.0%
4000000 8
 
4.4%
9000000 7
 
3.8%
6500000 4
 
2.2%
3575000 3
 
1.6%
4500000 3
 
1.6%
6870000 2
 
1.1%
9500000 2
 
1.1%
8717000 2
 
1.1%
8400000 2
 
1.1%
Other values (131) 138
75.8%
ValueCountFrequency (%)
2240000 1
0.5%
2400000 1
0.5%
2470000 1
0.5%
2500000 1
0.5%
2594000 1
0.5%
2837100 1
0.5%
3020000 1
0.5%
3037650 1
0.5%
3078000 1
0.5%
3100000 1
0.5%
ValueCountFrequency (%)
21600000 1
0.5%
20325000 1
0.5%
20000000 2
1.1%
19600000 1
0.5%
18943200 1
0.5%
16762000 1
0.5%
16247000 1
0.5%
16060000 1
0.5%
15845000 1
0.5%
15840000 1
0.5%

계약금액
Real number (ℝ)

HIGH CORRELATION 

Distinct143
Distinct (%)78.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6442121.4
Minimum2016000
Maximum19440000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-13T06:46:06.942768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2016000
5-th percentile2881000
Q13867000
median5767500
Q38065500
95-th percentile14260131
Maximum19440000
Range17424000
Interquartile range (IQR)4198500

Descriptive statistics

Standard deviation3494978
Coefficient of variation (CV)0.54251974
Kurtosis2.8826593
Mean6442121.4
Median Absolute Deviation (MAD)2077650
Skewness1.6316873
Sum1.1724661 × 109
Variance1.2214872 × 1013
MonotonicityNot monotonic
2023-12-13T06:46:07.106539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4500000 10
 
5.5%
8100000 8
 
4.4%
3600000 7
 
3.8%
4050000 3
 
1.6%
5850000 3
 
1.6%
3217500 2
 
1.1%
7650000 2
 
1.1%
8550000 2
 
1.1%
11250000 2
 
1.1%
6183000 2
 
1.1%
Other values (133) 141
77.5%
ValueCountFrequency (%)
2016000 1
0.5%
2223000 1
0.5%
2277000 1
0.5%
2370000 1
0.5%
2594000 1
0.5%
2600000 1
0.5%
2718000 1
0.5%
2770200 1
0.5%
2878200 1
0.5%
2880000 1
0.5%
ValueCountFrequency (%)
19440000 1
0.5%
18292500 1
0.5%
18000000 2
1.1%
17640000 1
0.5%
17048880 1
0.5%
16762000 1
0.5%
14622300 1
0.5%
14454000 1
0.5%
14260500 1
0.5%
14253120 1
0.5%

추첨일
Text

UNIQUE 

Distinct182
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-13T06:46:07.403388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length22
Mean length21.934066
Min length10

Characters and Unicode

Total characters3992
Distinct characters16
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique182 ?
Unique (%)100.0%

Sample

1st row2010-10-26 오전 08:53:38
2nd row2010-12-16 오전 09:57:32
3rd row2010-12-24 오후 10:53:45
4th row2011-02-15 오후 02:51:26
5th row2011-03-04 오후 05:37:20
ValueCountFrequency (%)
오후 110
 
20.2%
오전 71
 
13.1%
2017-12-06 3
 
0.6%
2021-01-12 3
 
0.6%
2021-02-09 3
 
0.6%
2019-06-11 2
 
0.4%
2019-11-15 2
 
0.4%
2017-09-26 2
 
0.4%
2016-12-21 2
 
0.4%
2020-01-30 2
 
0.4%
Other values (335) 344
63.2%
2023-12-13T06:46:07.924787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 678
17.0%
2 517
13.0%
1 484
12.1%
- 364
9.1%
362
9.1%
: 362
9.1%
181
 
4.5%
3 172
 
4.3%
4 161
 
4.0%
5 136
 
3.4%
Other values (6) 575
14.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2542
63.7%
Dash Punctuation 364
 
9.1%
Space Separator 362
 
9.1%
Other Punctuation 362
 
9.1%
Other Letter 362
 
9.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 678
26.7%
2 517
20.3%
1 484
19.0%
3 172
 
6.8%
4 161
 
6.3%
5 136
 
5.4%
9 129
 
5.1%
8 102
 
4.0%
6 88
 
3.5%
7 75
 
3.0%
Other Letter
ValueCountFrequency (%)
181
50.0%
110
30.4%
71
 
19.6%
Dash Punctuation
ValueCountFrequency (%)
- 364
100.0%
Space Separator
ValueCountFrequency (%)
362
100.0%
Other Punctuation
ValueCountFrequency (%)
: 362
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3630
90.9%
Hangul 362
 
9.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 678
18.7%
2 517
14.2%
1 484
13.3%
- 364
10.0%
362
10.0%
: 362
10.0%
3 172
 
4.7%
4 161
 
4.4%
5 136
 
3.7%
9 129
 
3.6%
Other values (3) 265
 
7.3%
Hangul
ValueCountFrequency (%)
181
50.0%
110
30.4%
71
 
19.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3630
90.9%
Hangul 362
 
9.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 678
18.7%
2 517
14.2%
1 484
13.3%
- 364
10.0%
362
10.0%
: 362
10.0%
3 172
 
4.7%
4 161
 
4.4%
5 136
 
3.7%
9 129
 
3.6%
Other values (3) 265
 
7.3%
Hangul
ValueCountFrequency (%)
181
50.0%
110
30.4%
71
 
19.6%
Distinct78
Distinct (%)42.9%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-13T06:46:08.234685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length5.1813187
Min length3

Characters and Unicode

Total characters943
Distinct characters129
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)13.7%

Sample

1st row옥당인쇄소
2nd row반도문화
3rd row일신인쇄사
4th row반도문화
5th row옥당인쇄소
ValueCountFrequency (%)
반도문화 9
 
4.9%
성훈인쇄 6
 
3.2%
평화인쇄 5
 
2.7%
동방인쇄광고 5
 
2.7%
동성기획 5
 
2.7%
성민종합기획 5
 
2.7%
중앙그래픽 5
 
2.7%
지구인쇄출판사 4
 
2.2%
제일기획 4
 
2.2%
시보사 4
 
2.2%
Other values (70) 133
71.9%
2023-12-13T06:46:08.678977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
87
 
9.2%
56
 
5.9%
39
 
4.1%
39
 
4.1%
38
 
4.0%
35
 
3.7%
29
 
3.1%
29
 
3.1%
27
 
2.9%
23
 
2.4%
Other values (119) 541
57.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 896
95.0%
Close Punctuation 19
 
2.0%
Open Punctuation 16
 
1.7%
Lowercase Letter 4
 
0.4%
Uppercase Letter 4
 
0.4%
Space Separator 3
 
0.3%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
87
 
9.7%
56
 
6.2%
39
 
4.4%
39
 
4.4%
38
 
4.2%
35
 
3.9%
29
 
3.2%
29
 
3.2%
27
 
3.0%
23
 
2.6%
Other values (109) 494
55.1%
Uppercase Letter
ValueCountFrequency (%)
P 1
25.0%
C 1
25.0%
M 1
25.0%
D 1
25.0%
Lowercase Letter
ValueCountFrequency (%)
m 2
50.0%
d 2
50.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 897
95.1%
Common 38
 
4.0%
Latin 8
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
87
 
9.7%
56
 
6.2%
39
 
4.3%
39
 
4.3%
38
 
4.2%
35
 
3.9%
29
 
3.2%
29
 
3.2%
27
 
3.0%
23
 
2.6%
Other values (110) 495
55.2%
Latin
ValueCountFrequency (%)
m 2
25.0%
d 2
25.0%
P 1
12.5%
C 1
12.5%
M 1
12.5%
D 1
12.5%
Common
ValueCountFrequency (%)
) 19
50.0%
( 16
42.1%
3
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 896
95.0%
ASCII 46
 
4.9%
None 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
87
 
9.7%
56
 
6.2%
39
 
4.4%
39
 
4.4%
38
 
4.2%
35
 
3.9%
29
 
3.2%
29
 
3.2%
27
 
3.0%
23
 
2.6%
Other values (109) 494
55.1%
ASCII
ValueCountFrequency (%)
) 19
41.3%
( 16
34.8%
3
 
6.5%
m 2
 
4.3%
d 2
 
4.3%
P 1
 
2.2%
C 1
 
2.2%
M 1
 
2.2%
D 1
 
2.2%
None
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-13T06:46:05.160634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:46:04.917910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:46:05.276087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:46:05.040068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:46:08.784462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발주부서기초금액계약금액낙찰업체명
발주부서1.0000.7940.8050.000
기초금액0.7941.0000.9980.735
계약금액0.8050.9981.0000.769
낙찰업체명0.0000.7350.7691.000
2023-12-13T06:46:08.892019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기초금액계약금액발주부서
기초금액1.0000.9860.397
계약금액0.9861.0000.409
발주부서0.3970.4091.000

Missing values

2023-12-13T06:46:05.402212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:46:05.533975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

발주명발주부서기초금액계약금액추첨일낙찰업체명
02010 지방세정 연찬회 지방세 연구과제 모음집세무회계과319800028782002010-10-26 오전 08:53:38옥당인쇄소
12010년 사방사업 홍보책자 제작세무회계과900000081000002010-12-16 오전 09:57:32반도문화
22009년 기준 사업체조사 보고서세무회계과570000051300002010-12-24 오후 10:53:45일신인쇄사
3제2차 가로경관 10개년 계획세무회계과454677040920002011-02-15 오후 02:51:26반도문화
4"중소기업 이렇게 도와드립니다" 책자 제작세무회계과880800079300002011-03-04 오후 05:37:20옥당인쇄소
52010년 주민등록 인구통계 책자 제작세무회계과371900033470002011-03-10 오후 05:38:24반도문화
62011년 자연재난 표준행동 매뉴얼세무회계과423600038130002011-04-06 오후 04:52:43지구인쇄출판사
7"사랑의 푸드뱅크" 리플렛 제작세무회계과471117143340002011-05-08 오후 09:48:41성민종합기획
82011년도 공공사업 계약심사 사례집세무회계과405338536500002011-07-22 오후 02:09:23평화인쇄
9남도 치유의 숲 조성계획 책자 제작세무회계과470363243200002011-08-29 오전 07:49:00한국인쇄
발주명발주부서기초금액계약금액추첨일낙찰업체명
1722021 전남통계연보 발간스마트정보담당관11600000104400002022-03-04 오후 05:10:07유한회사 호남광고산업
1732021회계연도 성과보고서 제작예산담당관실13000000123500002022-03-09 오후 04:02:33은하수
1742022년 지방도 설계시공 길라잡이 발간도로교통과925000083250002022-03-25 오후 02:27:02(주)장강신문
1752021년도 농촌지도사업보고서 발간농업기술원602745055020002022-04-12 오전 10:20:12디자인아트
176코로나19 예방접종 예진표 인쇄감염병관리과12847000115623002022-04-22 오전 10:05:55(주)프리비
1772021년 현장활용 농업기술 우수성과 책자발간농업기술원394931036000002022-04-25 오전 09:29:31애드필디자인기획
1782022년 농업과학기술 연구개발사업 과제계획서 유인농업기술원657400060500002022-05-11 오전 11:15:07유아이디자인
1792022년 제3회 임용 필기시험 실시계획 책자제작총무과530000050000002022-05-18 오전 09:31:06제일인쇄기획
1802021년도 시험연구보고서 책자 발간농업기술원653000058770002022-07-05 오전 10:08:40정문사
1812020년 기준 전라남도 사업체조사 책자 인쇄스마트정보담당관900000081000002022-11-29㈜엠에스미디어