Overview

Dataset statistics

Number of variables4
Number of observations478
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)0.4%
Total size in memory15.5 KiB
Average record size in memory33.3 B

Variable types

DateTime1
Categorical1
Numeric1
Text1

Dataset

Description시흥도시공사에서 구매한 장애인 생산품의 구매실적입니다. 구매업체, 금액, 품목 등이 기재되어 있습니다.본 데이터는 집계상황에 따라 업데이트가 다소 지연될 수 있습니다.
Author시흥도시공사
URLhttps://www.data.go.kr/data/15098880/fileData.do

Alerts

Dataset has 2 (0.4%) duplicate rowsDuplicates
구매처 is highly imbalanced (72.6%)Imbalance

Reproduction

Analysis started2024-03-14 20:17:46.629263
Analysis finished2024-03-14 20:17:47.628590
Duration1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct297
Distinct (%)62.1%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
Minimum2021-01-12 00:00:00
Maximum2023-12-21 00:00:00
2024-03-15T05:17:47.856403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T05:17:48.504819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

구매처
Categorical

IMBALANCE 

Distinct21
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
경기장애인생산품판매시설
398 
어우리 터
 
27
늘푸른직업재활원
 
13
위메이드보호작업장
 
9
희망일굼터
 
6
Other values (16)
 
25

Length

Max length26
Median length12
Mean length11.502092
Min length2

Unique

Unique11 ?
Unique (%)2.3%

Sample

1st row경기장애인생산품판매시설
2nd row경기장애인생산품판매시설
3rd row어우리 터
4th row성남시한가람보호작업장
5th row경기장애인생산품판매시설

Common Values

ValueCountFrequency (%)
경기장애인생산품판매시설 398
83.3%
어우리 터 27
 
5.6%
늘푸른직업재활원 13
 
2.7%
위메이드보호작업장 9
 
1.9%
희망일굼터 6
 
1.3%
인천장애인생산품판매시설 4
 
0.8%
내음공간 3
 
0.6%
(사)장애인생산품판매지원협회 아름다운동행 사업단 3
 
0.6%
강원복지회(태백장애인근로사업장) 2
 
0.4%
(사)대한문화체육교육협회 가구사업부 2
 
0.4%
Other values (11) 11
 
2.3%

Length

2024-03-15T05:17:48.917670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기장애인생산품판매시설 398
77.0%
어우리 27
 
5.2%
27
 
5.2%
늘푸른직업재활원 13
 
2.5%
위메이드보호작업장 9
 
1.7%
희망일굼터 6
 
1.2%
사업단 4
 
0.8%
인천장애인생산품판매시설 4
 
0.8%
내음공간 3
 
0.6%
사)장애인생산품판매지원협회 3
 
0.6%
Other values (18) 23
 
4.4%

우선구매 실적
Real number (ℝ)

Distinct284
Distinct (%)59.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1159524.2
Minimum22900
Maximum49056600
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2024-03-15T05:17:49.309911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum22900
5-th percentile65000
Q1215750
median533000
Q31372500
95-th percentile2923485
Maximum49056600
Range49033700
Interquartile range (IQR)1156750

Descriptive statistics

Standard deviation2962512.1
Coefficient of variation (CV)2.5549377
Kurtosis151.34865
Mean1159524.2
Median Absolute Deviation (MAD)371250
Skewness10.621996
Sum5.5425258 × 108
Variance8.776478 × 1012
MonotonicityNot monotonic
2024-03-15T05:17:49.780159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
220000 10
 
2.1%
1936000 10
 
2.1%
380000 9
 
1.9%
840000 9
 
1.9%
73000 9
 
1.9%
1662500 8
 
1.7%
565750 8
 
1.7%
250800 8
 
1.7%
1567500 8
 
1.7%
1913300 7
 
1.5%
Other values (274) 392
82.0%
ValueCountFrequency (%)
22900 1
 
0.2%
24000 1
 
0.2%
25000 1
 
0.2%
28000 1
 
0.2%
31000 1
 
0.2%
33300 1
 
0.2%
34500 1
 
0.2%
36000 1
 
0.2%
46000 1
 
0.2%
51000 3
0.6%
ValueCountFrequency (%)
49056600 1
0.2%
21710510 1
0.2%
17700000 1
0.2%
14067000 1
0.2%
13274500 1
0.2%
11310000 1
0.2%
10986000 1
0.2%
8999500 1
0.2%
8097000 1
0.2%
7980000 1
0.2%
Distinct92
Distinct (%)19.2%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
2024-03-15T05:17:50.596195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length86
Median length76
Mean length9.2029289
Min length5

Characters and Unicode

Total characters4399
Distinct characters131
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)9.4%

Sample

1st row시흥도시공사
2nd row시흥도시공사 대외협력팀
3rd row시흥도시공사 교통사업팀
4th row시흥도시공사
5th row시흥도시공사
ValueCountFrequency (%)
시흥도시공사 350
53.4%
abc행복학습타운 23
 
3.5%
공공체육시설 16
 
2.4%
갯골생태공원 16
 
2.4%
체육시설3부 13
 
2.0%
공원레저부 11
 
1.7%
체육시설1부 10
 
1.5%
국민체육센터 10
 
1.5%
능곡어울림센터 10
 
1.5%
교통사업팀 10
 
1.5%
Other values (67) 186
28.4%
2024-03-15T05:17:51.669462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
772
17.5%
436
 
9.9%
386
 
8.8%
371
 
8.4%
366
 
8.3%
366
 
8.3%
83
 
1.9%
66
 
1.5%
66
 
1.5%
54
 
1.2%
Other values (121) 1433
32.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3869
88.0%
Space Separator 371
 
8.4%
Uppercase Letter 66
 
1.5%
Other Punctuation 47
 
1.1%
Decimal Number 40
 
0.9%
Lowercase Letter 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
772
20.0%
436
 
11.3%
386
 
10.0%
366
 
9.5%
366
 
9.5%
83
 
2.1%
66
 
1.7%
66
 
1.7%
54
 
1.4%
53
 
1.4%
Other values (109) 1221
31.6%
Decimal Number
ValueCountFrequency (%)
3 15
37.5%
1 15
37.5%
2 9
22.5%
4 1
 
2.5%
Uppercase Letter
ValueCountFrequency (%)
C 22
33.3%
B 22
33.3%
A 22
33.3%
Lowercase Letter
ValueCountFrequency (%)
c 2
33.3%
b 2
33.3%
a 2
33.3%
Space Separator
ValueCountFrequency (%)
371
100.0%
Other Punctuation
ValueCountFrequency (%)
, 47
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3869
88.0%
Common 458
 
10.4%
Latin 72
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
772
20.0%
436
 
11.3%
386
 
10.0%
366
 
9.5%
366
 
9.5%
83
 
2.1%
66
 
1.7%
66
 
1.7%
54
 
1.4%
53
 
1.4%
Other values (109) 1221
31.6%
Common
ValueCountFrequency (%)
371
81.0%
, 47
 
10.3%
3 15
 
3.3%
1 15
 
3.3%
2 9
 
2.0%
4 1
 
0.2%
Latin
ValueCountFrequency (%)
C 22
30.6%
B 22
30.6%
A 22
30.6%
c 2
 
2.8%
b 2
 
2.8%
a 2
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3869
88.0%
ASCII 530
 
12.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
772
20.0%
436
 
11.3%
386
 
10.0%
366
 
9.5%
366
 
9.5%
83
 
2.1%
66
 
1.7%
66
 
1.7%
54
 
1.4%
53
 
1.4%
Other values (109) 1221
31.6%
ASCII
ValueCountFrequency (%)
371
70.0%
, 47
 
8.9%
C 22
 
4.2%
B 22
 
4.2%
A 22
 
4.2%
3 15
 
2.8%
1 15
 
2.8%
2 9
 
1.7%
c 2
 
0.4%
b 2
 
0.4%
Other values (2) 3
 
0.6%

Interactions

2024-03-15T05:17:46.861048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T05:17:51.879014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구매처우선구매 실적거래처
구매처1.0000.7430.000
우선구매 실적0.7431.0000.000
거래처0.0000.0001.000
2024-03-15T05:17:52.121415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우선구매 실적구매처
우선구매 실적1.0000.441
구매처0.4411.000

Missing values

2024-03-15T05:17:47.207129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T05:17:47.516220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구입일자구매처우선구매 실적거래처
02021-12-24경기장애인생산품판매시설7980000시흥도시공사
12021-12-20경기장애인생산품판매시설1000000시흥도시공사 대외협력팀
22021-12-14어우리 터1567500시흥도시공사 교통사업팀
32021-12-13성남시한가람보호작업장1750000시흥도시공사
42021-12-09경기장애인생산품판매시설1134000시흥도시공사
52021-12-09경기장애인생산품판매시설5312000시흥도시공사
62021-12-09어우리 터250800시흥도시공사 공공체육시설
72021-12-09어우리 터1662500시흥도시공사 ABC행복학습타운
82021-12-07늘푸른직업재활원1936000시흥도시공사
92021-12-07경기장애인생산품판매시설186000시흥도시공사 스포츠시설팀
구입일자구매처우선구매 실적거래처
4682023-11-29경기장애인생산품판매시설402000시흥도시공사 체육시설3부
4692023-12-06경기장애인생산품판매시설3165000시흥도시공사
4702023-12-06경기장애인생산품판매시설1065000시흥도시공사 체육시설1부
4712023-12-12경기장애인생산품판매시설8999500시흥도시공사 경영지원부
4722023-12-14경기장애인생산품판매시설171900시흥도시공사 기획예산부
4732023-12-15경기장애인생산품판매시설739000시흥도시공사 체육시설1부
4742023-12-15내음공간49056600시흥도시공사
4752023-12-18경기장애인생산품판매시설420000시흥도시공사 환경관리부
4762023-12-19희망일굼터240000시흥도시공사
4772023-12-21경기장애인생산품판매시설1236000시흥도시공사 재무관리부

Duplicate rows

Most frequently occurring

구입일자구매처우선구매 실적거래처# duplicates
02021-07-14경기장애인생산품판매시설189000시흥도시공사2
12022-08-22경기장애인생산품판매시설598400시흥도시공사2