Overview

Dataset statistics

Number of variables4
Number of observations41
Missing cells1
Missing cells (%)0.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 KiB
Average record size in memory36.1 B

Variable types

Text2
Numeric1
Categorical1

Dataset

Description창원시설공단이 관리하는 창원시 적현 매립장 폐기물 처리 및 운송업체의 업체명, 대표자명, 사업자등록번호, 사업장 종류 현황
Author창원시설공단
URLhttps://www.data.go.kr/data/3077815/fileData.do

Alerts

사업장종류 is highly imbalanced (71.9%)Imbalance
대표자명 has 1 (2.4%) missing valuesMissing
업체명 has unique valuesUnique

Reproduction

Analysis started2024-03-15 00:08:42.110501
Analysis finished2024-03-15 00:08:43.147837
Duration1.04 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업체명
Text

UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size456.0 B
2024-03-15T09:08:43.756311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length15
Mean length8.3902439
Min length3

Characters and Unicode

Total characters344
Distinct characters111
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)100.0%

Sample

1st row(주)그린자원
2nd row지테크환경(주)
3rd row창원시 생활폐기물 재활용처리 종합단지
4th row(주)에코비트워터(동부맑은물재생센터)
5th row주식회사 공환경
ValueCountFrequency (%)
주식회사 4
 
8.2%
주)그린자원 1
 
2.0%
주일산업 1
 
2.0%
강산환경 1
 
2.0%
미래환경(김해 1
 
2.0%
주)우리산업 1
 
2.0%
다한다환경 1
 
2.0%
주)지구환경 1
 
2.0%
태경환경 1
 
2.0%
주)만수 1
 
2.0%
Other values (36) 36
73.5%
2024-03-15T09:08:45.018690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26
 
7.6%
( 25
 
7.3%
) 25
 
7.3%
16
 
4.7%
15
 
4.4%
12
 
3.5%
12
 
3.5%
9
 
2.6%
9
 
2.6%
6
 
1.7%
Other values (101) 189
54.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 280
81.4%
Open Punctuation 25
 
7.3%
Close Punctuation 25
 
7.3%
Space Separator 9
 
2.6%
Uppercase Letter 5
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
9.3%
16
 
5.7%
15
 
5.4%
12
 
4.3%
12
 
4.3%
9
 
3.2%
6
 
2.1%
6
 
2.1%
6
 
2.1%
5
 
1.8%
Other values (93) 167
59.6%
Uppercase Letter
ValueCountFrequency (%)
K 1
20.0%
C 1
20.0%
O 1
20.0%
N 1
20.0%
E 1
20.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 280
81.4%
Common 59
 
17.2%
Latin 5
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
9.3%
16
 
5.7%
15
 
5.4%
12
 
4.3%
12
 
4.3%
9
 
3.2%
6
 
2.1%
6
 
2.1%
6
 
2.1%
5
 
1.8%
Other values (93) 167
59.6%
Latin
ValueCountFrequency (%)
K 1
20.0%
C 1
20.0%
O 1
20.0%
N 1
20.0%
E 1
20.0%
Common
ValueCountFrequency (%)
( 25
42.4%
) 25
42.4%
9
 
15.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 280
81.4%
ASCII 64
 
18.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
26
 
9.3%
16
 
5.7%
15
 
5.4%
12
 
4.3%
12
 
4.3%
9
 
3.2%
6
 
2.1%
6
 
2.1%
6
 
2.1%
5
 
1.8%
Other values (93) 167
59.6%
ASCII
ValueCountFrequency (%)
( 25
39.1%
) 25
39.1%
9
 
14.1%
K 1
 
1.6%
C 1
 
1.6%
O 1
 
1.6%
N 1
 
1.6%
E 1
 
1.6%

대표자명
Text

MISSING 

Distinct39
Distinct (%)97.5%
Missing1
Missing (%)2.4%
Memory size456.0 B
2024-03-15T09:08:45.802936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length3
Mean length3.225
Min length3

Characters and Unicode

Total characters129
Distinct characters78
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)95.0%

Sample

1st row김상효
2nd row박재규
3rd row김인석
4th row조영복
5th row양문석
ValueCountFrequency (%)
김인석 2
 
4.9%
김상효 1
 
2.4%
박준규 1
 
2.4%
박철규 1
 
2.4%
신원식 1
 
2.4%
추민형 1
 
2.4%
하진화 1
 
2.4%
이진환 1
 
2.4%
김종원 1
 
2.4%
배유한 1
 
2.4%
Other values (30) 30
73.2%
2024-03-15T09:08:46.691135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
7.0%
8
 
6.2%
5
 
3.9%
5
 
3.9%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
2
 
1.6%
Other values (68) 85
65.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 126
97.7%
Space Separator 1
 
0.8%
Open Punctuation 1
 
0.8%
Close Punctuation 1
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
7.1%
8
 
6.3%
5
 
4.0%
5
 
4.0%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
2
 
1.6%
Other values (65) 82
65.1%
Space Separator
ValueCountFrequency (%)
1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 126
97.7%
Common 3
 
2.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
7.1%
8
 
6.3%
5
 
4.0%
5
 
4.0%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
2
 
1.6%
Other values (65) 82
65.1%
Common
ValueCountFrequency (%)
1
33.3%
( 1
33.3%
) 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 126
97.7%
ASCII 3
 
2.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9
 
7.1%
8
 
6.3%
5
 
4.0%
5
 
4.0%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
2
 
1.6%
Other values (65) 82
65.1%
ASCII
ValueCountFrequency (%)
1
33.3%
( 1
33.3%
) 1
33.3%

사업자등록번호
Real number (ℝ)

Distinct40
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.9713189 × 109
Minimum2.1401178 × 109
Maximum7.8186012 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size497.0 B
2024-03-15T09:08:46.926723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.1401178 × 109
5-th percentile3.8786003 × 109
Q16.0881151 × 109
median6.0924754 × 109
Q36.151407 × 109
95-th percentile7.4005012 × 109
Maximum7.8186012 × 109
Range5.6784834 × 109
Interquartile range (IQR)63291908

Descriptive statistics

Standard deviation9.9676248 × 108
Coefficient of variation (CV)0.16692501
Kurtosis6.6634378
Mean5.9713189 × 109
Median Absolute Deviation (MAD)24363098
Skewness-2.0449053
Sum2.4482408 × 1011
Variance9.9353545 × 1017
MonotonicityNot monotonic
2024-03-15T09:08:47.178680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
6778601176 2
 
4.9%
6158191108 1
 
2.4%
6088139327 1
 
2.4%
7688702425 1
 
2.4%
6098154883 1
 
2.4%
6080424196 1
 
2.4%
6098128876 1
 
2.4%
7818601209 1
 
2.4%
7400501233 1
 
2.4%
2140117766 1
 
2.4%
Other values (30) 30
73.2%
ValueCountFrequency (%)
2140117766 1
2.4%
3028508322 1
2.4%
3878600274 1
2.4%
5018701901 1
2.4%
5068104518 1
2.4%
6038149567 1
2.4%
6038173941 1
2.4%
6060684789 1
2.4%
6068112278 1
2.4%
6080424196 1
2.4%
ValueCountFrequency (%)
7818601209 1
2.4%
7688702425 1
2.4%
7400501233 1
2.4%
6778601176 2
4.9%
6220214778 1
2.4%
6178124449 1
2.4%
6158193445 1
2.4%
6158191108 1
2.4%
6151503502 1
2.4%
6151407044 1
2.4%

사업장종류
Categorical

IMBALANCE 

Distinct2
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size456.0 B
운반업체
39 
위탁처리업체
 
2

Length

Max length6
Median length4
Mean length4.097561
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row운반업체
2nd row운반업체
3rd row운반업체
4th row운반업체
5th row운반업체

Common Values

ValueCountFrequency (%)
운반업체 39
95.1%
위탁처리업체 2
 
4.9%

Length

2024-03-15T09:08:47.442377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T09:08:47.766458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
운반업체 39
95.1%
위탁처리업체 2
 
4.9%

Interactions

2024-03-15T09:08:42.415103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T09:08:47.888274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명대표자명사업자등록번호사업장종류
업체명1.0001.0001.0001.000
대표자명1.0001.0001.0001.000
사업자등록번호1.0001.0001.0000.291
사업장종류1.0001.0000.2911.000
2024-03-15T09:08:48.146538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업자등록번호사업장종류
사업자등록번호1.0000.245
사업장종류0.2451.000

Missing values

2024-03-15T09:08:42.749207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T09:08:43.035309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명대표자명사업자등록번호사업장종류
0(주)그린자원김상효6158191108운반업체
1지테크환경(주)박재규6038173941운반업체
2창원시 생활폐기물 재활용처리 종합단지<NA>6098305751운반업체
3(주)에코비트워터(동부맑은물재생센터)김인석6778601176운반업체
4주식회사 공환경조영복6088172175운반업체
5(주)엠에스양문석3878600274운반업체
6선명개발양용석6151503502운반업체
7영진자원김연희6220214778운반업체
8(주)에코원류정환6088194470운반업체
9(주)에코비트워터(진해물재생센터)김인석6778601176운반업체
업체명대표자명사업자등록번호사업장종류
31월드자원박철규6090251289운반업체
32(주)서진인바이러테크김태균6088139327운반업체
33(주)교남환경안승길6098146762운반업체
34(주)두리환경정채경5018701901운반업체
35고은상사손형달6090981986운반업체
36우신이엔브이(주)서주환6098117796운반업체
37(주)동아유화김익수6068112278운반업체
38(주)오케이환경이화수6038149567운반업체
39KC환경서비스(주)창원사업부이강욱6088115136위탁처리업체
40동양에코주식회사류용탁5068104518위탁처리업체