Overview

Dataset statistics

Number of variables6
Number of observations106
Missing cells12
Missing cells (%)1.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.3 KiB
Average record size in memory51.2 B

Variable types

Numeric1
Text4
Categorical1

Dataset

Description충청남도 보령시의 대기배출시설현황(업체명, 소재지 주소, 업종, 종별, 전화번호 등의 항목을 제공)에 대한 데이터입니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=337&beforeMenuCd=DOM_000000201001001000&publicdatapk=15083824

Alerts

전화번호 has 12 (11.3%) missing valuesMissing
연번 has unique valuesUnique
소재지 has unique valuesUnique

Reproduction

Analysis started2024-01-09 20:03:17.223741
Analysis finished2024-01-09 20:03:18.392340
Duration1.17 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct106
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean53.5
Minimum1
Maximum106
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-01-10T05:03:18.502366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.25
Q127.25
median53.5
Q379.75
95-th percentile100.75
Maximum106
Range105
Interquartile range (IQR)52.5

Descriptive statistics

Standard deviation30.743563
Coefficient of variation (CV)0.57464604
Kurtosis-1.2
Mean53.5
Median Absolute Deviation (MAD)26.5
Skewness0
Sum5671
Variance945.16667
MonotonicityStrictly increasing
2024-01-10T05:03:18.746060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
81 1
 
0.9%
79 1
 
0.9%
78 1
 
0.9%
77 1
 
0.9%
76 1
 
0.9%
75 1
 
0.9%
74 1
 
0.9%
73 1
 
0.9%
72 1
 
0.9%
Other values (96) 96
90.6%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
106 1
0.9%
105 1
0.9%
104 1
0.9%
103 1
0.9%
102 1
0.9%
101 1
0.9%
100 1
0.9%
99 1
0.9%
98 1
0.9%
97 1
0.9%
Distinct104
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size980.0 B
2024-01-10T05:03:19.049167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length19
Mean length9.6226415
Min length3

Characters and Unicode

Total characters1020
Distinct characters210
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)96.2%

Sample

1st row영보연탄
2nd row보령자동차공업사
3rd row㈜보령레미콘
4th row보령플라이애쉬시멘트산업㈜ 주교지점
5th row해태스치로폴
ValueCountFrequency (%)
주식회사 4
 
3.1%
만세보령농협쌀조합공동사업법인 2
 
1.6%
보령시 2
 
1.6%
한국가스공사 2
 
1.6%
전북지역본부 2
 
1.6%
삼원환경산업㈜ 2
 
1.6%
주택관리공단㈜ 1
 
0.8%
영보연탄 1
 
0.8%
에이지티㈜ 1
 
0.8%
㈜대천리조트 1
 
0.8%
Other values (110) 110
85.9%
2024-01-10T05:03:19.524639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
52
 
5.1%
35
 
3.4%
32
 
3.1%
31
 
3.0%
30
 
2.9%
27
 
2.6%
1 24
 
2.4%
22
 
2.2%
22
 
2.2%
21
 
2.1%
Other values (200) 724
71.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 831
81.5%
Decimal Number 82
 
8.0%
Other Symbol 52
 
5.1%
Space Separator 22
 
2.2%
Dash Punctuation 10
 
1.0%
Close Punctuation 10
 
1.0%
Open Punctuation 10
 
1.0%
Uppercase Letter 2
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
4.2%
32
 
3.9%
31
 
3.7%
30
 
3.6%
27
 
3.2%
22
 
2.6%
21
 
2.5%
18
 
2.2%
17
 
2.0%
17
 
2.0%
Other values (182) 581
69.9%
Decimal Number
ValueCountFrequency (%)
1 24
29.3%
0 19
23.2%
9 8
 
9.8%
6 8
 
9.8%
4 7
 
8.5%
7 6
 
7.3%
5 6
 
7.3%
2 4
 
4.9%
Close Punctuation
ValueCountFrequency (%)
) 9
90.0%
] 1
 
10.0%
Open Punctuation
ValueCountFrequency (%)
( 9
90.0%
[ 1
 
10.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
50.0%
F 1
50.0%
Other Symbol
ValueCountFrequency (%)
52
100.0%
Space Separator
ValueCountFrequency (%)
22
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 883
86.6%
Common 135
 
13.2%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
52
 
5.9%
35
 
4.0%
32
 
3.6%
31
 
3.5%
30
 
3.4%
27
 
3.1%
22
 
2.5%
21
 
2.4%
18
 
2.0%
17
 
1.9%
Other values (183) 598
67.7%
Common
ValueCountFrequency (%)
1 24
17.8%
22
16.3%
0 19
14.1%
- 10
7.4%
) 9
 
6.7%
( 9
 
6.7%
9 8
 
5.9%
6 8
 
5.9%
4 7
 
5.2%
7 6
 
4.4%
Other values (5) 13
9.6%
Latin
ValueCountFrequency (%)
A 1
50.0%
F 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 831
81.5%
ASCII 137
 
13.4%
None 52
 
5.1%

Most frequent character per block

None
ValueCountFrequency (%)
52
100.0%
Hangul
ValueCountFrequency (%)
35
 
4.2%
32
 
3.9%
31
 
3.7%
30
 
3.6%
27
 
3.2%
22
 
2.6%
21
 
2.5%
18
 
2.2%
17
 
2.0%
17
 
2.0%
Other values (182) 581
69.9%
ASCII
ValueCountFrequency (%)
1 24
17.5%
22
16.1%
0 19
13.9%
- 10
7.3%
) 9
 
6.6%
( 9
 
6.6%
9 8
 
5.8%
6 8
 
5.8%
4 7
 
5.1%
7 6
 
4.4%
Other values (7) 15
10.9%

소재지
Text

UNIQUE 

Distinct106
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size980.0 B
2024-01-10T05:03:19.908141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length36
Mean length20.688679
Min length13

Characters and Unicode

Total characters2193
Distinct characters130
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique106 ?
Unique (%)100.0%

Sample

1st row충남 보령시 청라면 의평리 238-1
2nd row충남 보령시 동대동 835-5
3rd row충남 보령시 미산면 도화담 310-2
4th row충남 보령시 주교면 은포리 산57-6
5th row충남 보령시 청소면 야현리 35-4
ValueCountFrequency (%)
보령시 106
20.4%
충남 86
 
16.5%
충청남도 20
 
3.8%
웅천읍 13
 
2.5%
남포면 12
 
2.3%
주교면 11
 
2.1%
천북면 9
 
1.7%
오천면 9
 
1.7%
청소면 8
 
1.5%
관창리 6
 
1.2%
Other values (190) 240
46.2%
2024-01-10T05:03:20.449604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
468
21.3%
124
 
5.7%
115
 
5.2%
112
 
5.1%
108
 
4.9%
106
 
4.8%
1 72
 
3.3%
2 68
 
3.1%
63
 
2.9%
- 61
 
2.8%
Other values (120) 896
40.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1236
56.4%
Space Separator 468
 
21.3%
Decimal Number 409
 
18.7%
Dash Punctuation 61
 
2.8%
Other Punctuation 7
 
0.3%
Open Punctuation 6
 
0.3%
Close Punctuation 6
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
124
 
10.0%
115
 
9.3%
112
 
9.1%
108
 
8.7%
106
 
8.6%
63
 
5.1%
51
 
4.1%
50
 
4.0%
34
 
2.8%
27
 
2.2%
Other values (105) 446
36.1%
Decimal Number
ValueCountFrequency (%)
1 72
17.6%
2 68
16.6%
3 48
11.7%
5 39
9.5%
4 37
9.0%
0 36
8.8%
9 32
7.8%
7 27
 
6.6%
8 26
 
6.4%
6 24
 
5.9%
Space Separator
ValueCountFrequency (%)
468
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 61
100.0%
Other Punctuation
ValueCountFrequency (%)
, 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1236
56.4%
Common 957
43.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
124
 
10.0%
115
 
9.3%
112
 
9.1%
108
 
8.7%
106
 
8.6%
63
 
5.1%
51
 
4.1%
50
 
4.0%
34
 
2.8%
27
 
2.2%
Other values (105) 446
36.1%
Common
ValueCountFrequency (%)
468
48.9%
1 72
 
7.5%
2 68
 
7.1%
- 61
 
6.4%
3 48
 
5.0%
5 39
 
4.1%
4 37
 
3.9%
0 36
 
3.8%
9 32
 
3.3%
7 27
 
2.8%
Other values (5) 69
 
7.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1236
56.4%
ASCII 957
43.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
468
48.9%
1 72
 
7.5%
2 68
 
7.1%
- 61
 
6.4%
3 48
 
5.0%
5 39
 
4.1%
4 37
 
3.9%
0 36
 
3.8%
9 32
 
3.3%
7 27
 
2.8%
Other values (5) 69
 
7.2%
Hangul
ValueCountFrequency (%)
124
 
10.0%
115
 
9.3%
112
 
9.1%
108
 
8.7%
106
 
8.6%
63
 
5.1%
51
 
4.1%
50
 
4.0%
34
 
2.8%
27
 
2.2%
Other values (105) 446
36.1%

업종
Text

Distinct73
Distinct (%)68.9%
Missing0
Missing (%)0.0%
Memory size980.0 B
2024-01-10T05:03:20.718633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length27
Mean length16.339623
Min length3

Characters and Unicode

Total characters1732
Distinct characters160
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)55.7%

Sample

1st row25)코크스 및 연탄제조시설
2nd row54)기타 비금속 광물제품 제조시설
3rd row34)비료 및 질소화합물제조시설
4th row81)운수장비 수선 및 세차또는 세척시설
5th row81)운수장비 수선 및 세차또는 세척시설
ValueCountFrequency (%)
43
 
14.2%
세척시설 14
 
4.6%
수선 12
 
4.0%
81)운수장비 11
 
3.6%
세차또는 11
 
3.6%
제조시설 7
 
2.3%
제조업 7
 
2.3%
6
 
2.0%
광물제품 5
 
1.7%
기타 5
 
1.7%
Other values (120) 182
60.1%
2024-01-10T05:03:21.161061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
197
 
11.4%
81
 
4.7%
72
 
4.2%
) 63
 
3.6%
1 61
 
3.5%
53
 
3.1%
48
 
2.8%
( 39
 
2.3%
39
 
2.3%
2 37
 
2.1%
Other values (150) 1042
60.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1188
68.6%
Decimal Number 223
 
12.9%
Space Separator 197
 
11.4%
Close Punctuation 67
 
3.9%
Open Punctuation 43
 
2.5%
Other Punctuation 14
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
81
 
6.8%
72
 
6.1%
53
 
4.5%
48
 
4.0%
39
 
3.3%
37
 
3.1%
34
 
2.9%
33
 
2.8%
32
 
2.7%
31
 
2.6%
Other values (133) 728
61.3%
Decimal Number
ValueCountFrequency (%)
1 61
27.4%
2 37
16.6%
3 32
14.3%
5 22
 
9.9%
9 18
 
8.1%
0 18
 
8.1%
8 16
 
7.2%
6 9
 
4.0%
4 7
 
3.1%
7 3
 
1.3%
Close Punctuation
ValueCountFrequency (%)
) 63
94.0%
] 4
 
6.0%
Open Punctuation
ValueCountFrequency (%)
( 39
90.7%
[ 4
 
9.3%
Other Punctuation
ValueCountFrequency (%)
, 12
85.7%
. 2
 
14.3%
Space Separator
ValueCountFrequency (%)
197
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1188
68.6%
Common 544
31.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
81
 
6.8%
72
 
6.1%
53
 
4.5%
48
 
4.0%
39
 
3.3%
37
 
3.1%
34
 
2.9%
33
 
2.8%
32
 
2.7%
31
 
2.6%
Other values (133) 728
61.3%
Common
ValueCountFrequency (%)
197
36.2%
) 63
 
11.6%
1 61
 
11.2%
( 39
 
7.2%
2 37
 
6.8%
3 32
 
5.9%
5 22
 
4.0%
9 18
 
3.3%
0 18
 
3.3%
8 16
 
2.9%
Other values (7) 41
 
7.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1188
68.6%
ASCII 544
31.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
197
36.2%
) 63
 
11.6%
1 61
 
11.2%
( 39
 
7.2%
2 37
 
6.8%
3 32
 
5.9%
5 22
 
4.0%
9 18
 
3.3%
0 18
 
3.3%
8 16
 
2.9%
Other values (7) 41
 
7.5%
Hangul
ValueCountFrequency (%)
81
 
6.8%
72
 
6.1%
53
 
4.5%
48
 
4.0%
39
 
3.3%
37
 
3.1%
34
 
2.9%
33
 
2.8%
32
 
2.7%
31
 
2.6%
Other values (133) 728
61.3%

종별
Categorical

Distinct3
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size980.0 B
5
55 
4
48 
3
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5
2nd row5
3rd row4
4th row4
5th row4

Common Values

ValueCountFrequency (%)
5 55
51.9%
4 48
45.3%
3 3
 
2.8%

Length

2024-01-10T05:03:21.309438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:03:21.436195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5 55
51.9%
4 48
45.3%
3 3
 
2.8%

전화번호
Text

MISSING 

Distinct91
Distinct (%)96.8%
Missing12
Missing (%)11.3%
Memory size980.0 B
2024-01-10T05:03:21.678593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1128
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)94.7%

Sample

1st row041-932-8050
2nd row041-933-3454
3rd row041-933-6413
4th row041-931-5114
5th row041-934-2568
ValueCountFrequency (%)
041-931-1345 3
 
3.2%
041-931-2513 2
 
2.1%
041-936-0653 1
 
1.1%
041-549-0311 1
 
1.1%
041-930-9862 1
 
1.1%
041-931-6320 1
 
1.1%
041-932-9994 1
 
1.1%
041-932-2212 1
 
1.1%
041-934-0142 1
 
1.1%
041-931-2476 1
 
1.1%
Other values (81) 81
86.2%
2024-01-10T05:03:22.125149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 188
16.7%
1 172
15.2%
4 146
12.9%
0 144
12.8%
3 140
12.4%
9 120
10.6%
2 55
 
4.9%
5 53
 
4.7%
6 44
 
3.9%
8 43
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 940
83.3%
Dash Punctuation 188
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 172
18.3%
4 146
15.5%
0 144
15.3%
3 140
14.9%
9 120
12.8%
2 55
 
5.9%
5 53
 
5.6%
6 44
 
4.7%
8 43
 
4.6%
7 23
 
2.4%
Dash Punctuation
ValueCountFrequency (%)
- 188
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1128
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 188
16.7%
1 172
15.2%
4 146
12.9%
0 144
12.8%
3 140
12.4%
9 120
10.6%
2 55
 
4.9%
5 53
 
4.7%
6 44
 
3.9%
8 43
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1128
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 188
16.7%
1 172
15.2%
4 146
12.9%
0 144
12.8%
3 140
12.4%
9 120
10.6%
2 55
 
4.9%
5 53
 
4.7%
6 44
 
3.9%
8 43
 
3.8%

Interactions

2024-01-10T05:03:18.015280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:03:22.249473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종종별전화번호
연번1.0000.9050.2430.974
업종0.9051.0000.9000.998
종별0.2430.9001.0000.848
전화번호0.9740.9980.8481.000
2024-01-10T05:03:22.364314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번종별
연번1.0000.107
종별0.1071.000

Missing values

2024-01-10T05:03:18.186417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:03:18.318912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명소재지업종종별전화번호
01영보연탄충남 보령시 청라면 의평리 238-125)코크스 및 연탄제조시설5041-932-8050
12보령자동차공업사충남 보령시 동대동 835-554)기타 비금속 광물제품 제조시설5041-933-3454
23㈜보령레미콘충남 보령시 미산면 도화담 310-234)비료 및 질소화합물제조시설4041-933-6413
34보령플라이애쉬시멘트산업㈜ 주교지점충남 보령시 주교면 은포리 산57-681)운수장비 수선 및 세차또는 세척시설4041-931-5114
45해태스치로폴충남 보령시 청소면 야현리 35-481)운수장비 수선 및 세차또는 세척시설4041-934-2568
56현대시멘트㈜충남 보령시 주산면 창암리 산4981)운수장비 수선 및 세차또는 세척시설5041-934-4665
67한국다기화학충남 보령시 웅천읍 대천리 47-181)운수장비 수선 및 세차또는 세척시설5041-933-9308
78제일레미콘㈜충남 보령시 오천면 교성리 산18-1(221-6)15)비알콜성 음료 및 얼음 제조시설5041-931-2231
89㈜모헨즈보령공장충남 보령시 청소면 장곡리 759-4(750-4)81)운수장비 수선 및 세차또는 세척시설4041-934-7135
910주식회사 한솔기업㈜충남 보령시 명천동 산5-181)운수장비 수선 및 세차또는 세척시설4041-934-7660
연번업체명소재지업종종별전화번호
9697만세보령농협쌀조합공동사업법인-청라(164511-0009197)충남 보령시 청라면 황룡리 832-8곡물 도정업[10611]5041-931-1345
9798㈜대창골재충남 보령시 웅천읍 대창리 215-9981. 운수장비 수선 및 세차 또는 세척시설5041-933-0085
9899㈜보난자마이닝충남 보령시 웅천읍 석재단지길 54그 밖의 비금속광물제품 제조업5<NA>
99100중앙중공업충남 보령시 청룡굴길 46자동차종합수리업(95211)5041-932-3735
100101보령시 하수처리시설충남 보령시 대천방조제로 322하수처리4041-930-3114
101102엠에스산업충남 보령시 성주면 성주산로 778폐기물종합재활용업4<NA>
102103㈜케이디에프충남 보령시 주교면 관창공단길 129합성고무, 플라스틱물질 및 플라스틱제품 제조시설 등4041-549-0311
103104주택관리공단㈜ 명천2관리소충남 보령시 주공로 50아파트(난방)4041-936-0653
104105서울시교육청학생교육원충남 보령시 해수욕장3길 26부동산업(임대)5041-931-2513
105106충청북도해양교육원충남 보령시 해수욕장13길 14-17부동산업(임대)4041-931-2513