Overview

Dataset statistics

Number of variables4
Number of observations63
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory35.1 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description하수도법 제45조에 따른 인천광역시 분뇨수집,운반업체 63개에 대한 일반현황(관할 구, 업체명, 대표자, 소재지) 정보를 제공합니다.
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=3045502&srcSe=7661IVAWM27C61E190

Alerts

연번 is highly overall correlated with 지역High correlation
지역 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-28 10:38:12.361554
Analysis finished2024-01-28 10:38:12.835307
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct63
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32
Minimum1
Maximum63
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size699.0 B
2024-01-28T19:38:12.887091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.1
Q116.5
median32
Q347.5
95-th percentile59.9
Maximum63
Range62
Interquartile range (IQR)31

Descriptive statistics

Standard deviation18.330303
Coefficient of variation (CV)0.57282196
Kurtosis-1.2
Mean32
Median Absolute Deviation (MAD)16
Skewness0
Sum2016
Variance336
MonotonicityStrictly increasing
2024-01-28T19:38:12.998685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.6%
2 1
 
1.6%
35 1
 
1.6%
36 1
 
1.6%
37 1
 
1.6%
38 1
 
1.6%
39 1
 
1.6%
40 1
 
1.6%
41 1
 
1.6%
42 1
 
1.6%
Other values (53) 53
84.1%
ValueCountFrequency (%)
1 1
1.6%
2 1
1.6%
3 1
1.6%
4 1
1.6%
5 1
1.6%
6 1
1.6%
7 1
1.6%
8 1
1.6%
9 1
1.6%
10 1
1.6%
ValueCountFrequency (%)
63 1
1.6%
62 1
1.6%
61 1
1.6%
60 1
1.6%
59 1
1.6%
58 1
1.6%
57 1
1.6%
56 1
1.6%
55 1
1.6%
54 1
1.6%

지역
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)15.9%
Missing0
Missing (%)0.0%
Memory size636.0 B
미추홀구
15 
남동구
12 
서구
11 
중구
계양구
Other values (5)
13 

Length

Max length4
Median length3
Mean length2.9365079
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중구
2nd row중구
3rd row중구
4th row중구
5th row중구

Common Values

ValueCountFrequency (%)
미추홀구 15
23.8%
남동구 12
19.0%
서구 11
17.5%
중구 6
 
9.5%
계양구 6
 
9.5%
부평구 4
 
6.3%
강화군 3
 
4.8%
동구 2
 
3.2%
연수구 2
 
3.2%
옹진군 2
 
3.2%

Length

2024-01-28T19:38:13.106076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T19:38:13.202289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미추홀구 15
23.8%
남동구 12
19.0%
서구 11
17.5%
중구 6
 
9.5%
계양구 6
 
9.5%
부평구 4
 
6.3%
강화군 3
 
4.8%
동구 2
 
3.2%
연수구 2
 
3.2%
옹진군 2
 
3.2%
Distinct58
Distinct (%)92.1%
Missing0
Missing (%)0.0%
Memory size636.0 B
2024-01-28T19:38:13.394275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length5.2380952
Min length4

Characters and Unicode

Total characters330
Distinct characters80
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)85.7%

Sample

1st row경동환경
2nd row그린환경
3rd row㈜새천년인천환경
4th row영종환경
5th row용유환경
ValueCountFrequency (%)
그린환경 3
 
4.8%
정진환경 2
 
3.2%
신명환경 2
 
3.2%
㈜새천년인천환경 2
 
3.2%
영부환경 1
 
1.6%
경동환경 1
 
1.6%
영일환경 1
 
1.6%
주은환경㈜ 1
 
1.6%
푸른환경 1
 
1.6%
㈜현대환경 1
 
1.6%
Other values (48) 48
76.2%
2024-01-28T19:38:13.698774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
53
 
16.1%
52
 
15.8%
25
 
7.6%
13
 
3.9%
12
 
3.6%
10
 
3.0%
8
 
2.4%
6
 
1.8%
6
 
1.8%
5
 
1.5%
Other values (70) 140
42.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 295
89.4%
Other Symbol 25
 
7.6%
Close Punctuation 5
 
1.5%
Open Punctuation 5
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
53
18.0%
52
17.6%
13
 
4.4%
12
 
4.1%
10
 
3.4%
8
 
2.7%
6
 
2.0%
6
 
2.0%
5
 
1.7%
5
 
1.7%
Other values (67) 125
42.4%
Other Symbol
ValueCountFrequency (%)
25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 320
97.0%
Common 10
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
53
16.6%
52
 
16.2%
25
 
7.8%
13
 
4.1%
12
 
3.8%
10
 
3.1%
8
 
2.5%
6
 
1.9%
6
 
1.9%
5
 
1.6%
Other values (68) 130
40.6%
Common
ValueCountFrequency (%)
) 5
50.0%
( 5
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 295
89.4%
None 25
 
7.6%
ASCII 10
 
3.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
53
18.0%
52
17.6%
13
 
4.4%
12
 
4.1%
10
 
3.4%
8
 
2.7%
6
 
2.0%
6
 
2.0%
5
 
1.7%
5
 
1.7%
Other values (67) 125
42.4%
None
ValueCountFrequency (%)
25
100.0%
ASCII
ValueCountFrequency (%)
) 5
50.0%
( 5
50.0%
Distinct56
Distinct (%)88.9%
Missing0
Missing (%)0.0%
Memory size636.0 B
2024-01-28T19:38:13.896641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.1111111
Min length3

Characters and Unicode

Total characters196
Distinct characters87
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)79.4%

Sample

1st row김효정
2nd row김상렬
3rd row서기자
4th row김동주
5th row추화미
ValueCountFrequency (%)
서기자 3
 
4.5%
조미선 2
 
3.0%
2
 
3.0%
홍지선 2
 
3.0%
1 2
 
3.0%
김신영 2
 
3.0%
황안상 2
 
3.0%
홍시영 2
 
3.0%
김효정 1
 
1.5%
김도연 1
 
1.5%
Other values (48) 48
71.6%
2024-01-28T19:38:14.206121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14
 
7.1%
8
 
4.1%
7
 
3.6%
7
 
3.6%
6
 
3.1%
6
 
3.1%
5
 
2.6%
5
 
2.6%
5
 
2.6%
5
 
2.6%
Other values (77) 128
65.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 190
96.9%
Space Separator 4
 
2.0%
Decimal Number 2
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
7.4%
8
 
4.2%
7
 
3.7%
7
 
3.7%
6
 
3.2%
6
 
3.2%
5
 
2.6%
5
 
2.6%
5
 
2.6%
5
 
2.6%
Other values (75) 122
64.2%
Space Separator
ValueCountFrequency (%)
4
100.0%
Decimal Number
ValueCountFrequency (%)
1 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 190
96.9%
Common 6
 
3.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
7.4%
8
 
4.2%
7
 
3.7%
7
 
3.7%
6
 
3.2%
6
 
3.2%
5
 
2.6%
5
 
2.6%
5
 
2.6%
5
 
2.6%
Other values (75) 122
64.2%
Common
ValueCountFrequency (%)
4
66.7%
1 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 190
96.9%
ASCII 6
 
3.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
14
 
7.4%
8
 
4.2%
7
 
3.7%
7
 
3.7%
6
 
3.2%
6
 
3.2%
5
 
2.6%
5
 
2.6%
5
 
2.6%
5
 
2.6%
Other values (75) 122
64.2%
ASCII
ValueCountFrequency (%)
4
66.7%
1 2
33.3%

Interactions

2024-01-28T19:38:12.667830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T19:38:14.281708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지역업체명대표자
연번1.0000.9610.7100.784
지역0.9611.0000.9140.814
업체명0.7100.9141.0000.969
대표자0.7840.8140.9691.000
2024-01-28T19:38:14.349431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지역
연번1.0000.660
지역0.6601.000

Missing values

2024-01-28T19:38:12.747558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T19:38:12.809437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번지역업체명대표자
01중구경동환경김효정
12중구그린환경김상렬
23중구㈜새천년인천환경서기자
34중구영종환경김동주
45중구용유환경추화미
56중구중부환경홍시영
67동구대우환경㈜황안상
78동구동구정화조이신용
89미추홀구㈜인천환경개발공사서기자
910미추홀구금수정화조박희경
연번지역업체명대표자
5354서구서인천환경㈜윤태복
5455서구세정실업엄유리
5556서구연희환경홍주형
5657서구정진환경강신도
5758서구청명환경임세진
5859옹진군(주)한구환경건설정영미
5960옹진군덕적면사무소김태식
6061강화군(주)강화부일환경금철연
6162강화군삼산환경(주)한의탁
6263강화군강화환경도경덕