Overview

Dataset statistics

Number of variables6
Number of observations37
Missing cells34
Missing cells (%)15.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory52.6 B

Variable types

Numeric1
Text3
Unsupported2

Dataset

Description환경전문공사업등록현황
Author전라북도
URLhttps://www.bigdatahub.go.kr/opendata/dataSet/detail.nm?contentId=37&rlik=49451aebf056b486&serviceId=202409

Alerts

연번 has 1 (2.7%) missing valuesMissing
업소명 has 1 (2.7%) missing valuesMissing
등록번호 has 26 (70.3%) missing valuesMissing
Unnamed: 3 has 4 (10.8%) missing valuesMissing
영업소 소재지 has 1 (2.7%) missing valuesMissing
전화번호 has 1 (2.7%) missing valuesMissing
등록번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-14 02:28:04.326646
Analysis finished2024-03-14 02:28:05.167804
Duration0.84 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

MISSING 

Distinct36
Distinct (%)100.0%
Missing1
Missing (%)2.7%
Infinite0
Infinite (%)0.0%
Mean18.5
Minimum1
Maximum36
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size465.0 B
2024-03-14T11:28:05.218496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.75
Q19.75
median18.5
Q327.25
95-th percentile34.25
Maximum36
Range35
Interquartile range (IQR)17.5

Descriptive statistics

Standard deviation10.535654
Coefficient of variation (CV)0.5694948
Kurtosis-1.2
Mean18.5
Median Absolute Deviation (MAD)9
Skewness0
Sum666
Variance111
MonotonicityStrictly increasing
2024-03-14T11:28:05.316825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
1 1
 
2.7%
20 1
 
2.7%
22 1
 
2.7%
23 1
 
2.7%
24 1
 
2.7%
25 1
 
2.7%
26 1
 
2.7%
27 1
 
2.7%
28 1
 
2.7%
29 1
 
2.7%
Other values (26) 26
70.3%
ValueCountFrequency (%)
1 1
2.7%
2 1
2.7%
3 1
2.7%
4 1
2.7%
5 1
2.7%
6 1
2.7%
7 1
2.7%
8 1
2.7%
9 1
2.7%
10 1
2.7%
ValueCountFrequency (%)
36 1
2.7%
35 1
2.7%
34 1
2.7%
33 1
2.7%
32 1
2.7%
31 1
2.7%
30 1
2.7%
29 1
2.7%
28 1
2.7%
27 1
2.7%

업소명
Text

MISSING 

Distinct36
Distinct (%)100.0%
Missing1
Missing (%)2.7%
Memory size428.0 B
2024-03-14T11:28:05.501439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length9.5
Mean length7.3888889
Min length3

Characters and Unicode

Total characters266
Distinct characters87
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row광진환경㈜
2nd row㈜유성테크
3rd row(유)지구엔비텍
4th row(유)호일엔지니어링
5th row(주)이앤에스테크
ValueCountFrequency (%)
광진환경㈜ 1
 
2.7%
유)은하환경산업 1
 
2.7%
유)부광건설 1
 
2.7%
유)푸른이엔텍 1
 
2.7%
유)일토씨엔엠 1
 
2.7%
유)미래건설 1
 
2.7%
주)파인리포먼스 1
 
2.7%
유)대신환경개발 1
 
2.7%
유)대금환경 1
 
2.7%
주)대한 1
 
2.7%
Other values (27) 27
73.0%
2024-03-14T11:28:05.789915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 24
 
9.0%
) 24
 
9.0%
19
 
7.1%
11
 
4.1%
10
 
3.8%
8
 
3.0%
8
 
3.0%
8
 
3.0%
7
 
2.6%
7
 
2.6%
Other values (77) 140
52.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 206
77.4%
Open Punctuation 24
 
9.0%
Close Punctuation 24
 
9.0%
Other Symbol 11
 
4.1%
Space Separator 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
9.2%
10
 
4.9%
8
 
3.9%
8
 
3.9%
8
 
3.9%
7
 
3.4%
7
 
3.4%
7
 
3.4%
6
 
2.9%
6
 
2.9%
Other values (73) 120
58.3%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Other Symbol
ValueCountFrequency (%)
11
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 217
81.6%
Common 49
 
18.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
8.8%
11
 
5.1%
10
 
4.6%
8
 
3.7%
8
 
3.7%
8
 
3.7%
7
 
3.2%
7
 
3.2%
7
 
3.2%
6
 
2.8%
Other values (74) 126
58.1%
Common
ValueCountFrequency (%)
( 24
49.0%
) 24
49.0%
1
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 206
77.4%
ASCII 49
 
18.4%
None 11
 
4.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 24
49.0%
) 24
49.0%
1
 
2.0%
Hangul
ValueCountFrequency (%)
19
 
9.2%
10
 
4.9%
8
 
3.9%
8
 
3.9%
8
 
3.9%
7
 
3.4%
7
 
3.4%
7
 
3.4%
6
 
2.9%
6
 
2.9%
Other values (73) 120
58.3%
None
ValueCountFrequency (%)
11
100.0%

등록번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing26
Missing (%)70.3%
Memory size428.0 B

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)10.8%
Memory size428.0 B

영업소 소재지
Text

MISSING 

Distinct36
Distinct (%)100.0%
Missing1
Missing (%)2.7%
Memory size428.0 B
2024-03-14T11:28:06.020470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length26
Mean length21.777778
Min length14

Characters and Unicode

Total characters784
Distinct characters121
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row전주시 덕진구 호반3길 7 (덕진동2가)
2nd row전주시 완산구 송정중앙로 29 (효자동1가)
3rd row전주시 덕진구 비석날로 99 (팔복동2가)
4th row익산시 춘포면 궁성로 272
5th row전주시 완산구 서곡6길 23-4 (효자동3가)
ValueCountFrequency (%)
전주시 22
 
13.2%
덕진구 14
 
8.4%
완산구 8
 
4.8%
익산시 5
 
3.0%
군산시 5
 
3.0%
우아동3가 4
 
2.4%
효자동1가 2
 
1.2%
101호 2
 
1.2%
효자동3가 2
 
1.2%
11 2
 
1.2%
Other values (100) 101
60.5%
2024-03-14T11:28:06.364033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
131
 
16.7%
33
 
4.2%
1 31
 
4.0%
31
 
4.0%
( 29
 
3.7%
) 29
 
3.7%
24
 
3.1%
23
 
2.9%
2 23
 
2.9%
22
 
2.8%
Other values (111) 408
52.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 437
55.7%
Decimal Number 141
 
18.0%
Space Separator 131
 
16.7%
Open Punctuation 29
 
3.7%
Close Punctuation 29
 
3.7%
Dash Punctuation 11
 
1.4%
Other Punctuation 6
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
7.6%
31
 
7.1%
24
 
5.5%
23
 
5.3%
22
 
5.0%
22
 
5.0%
22
 
5.0%
17
 
3.9%
17
 
3.9%
17
 
3.9%
Other values (96) 209
47.8%
Decimal Number
ValueCountFrequency (%)
1 31
22.0%
2 23
16.3%
3 20
14.2%
7 14
9.9%
0 12
 
8.5%
4 11
 
7.8%
5 10
 
7.1%
8 10
 
7.1%
9 6
 
4.3%
6 4
 
2.8%
Space Separator
ValueCountFrequency (%)
131
100.0%
Open Punctuation
ValueCountFrequency (%)
( 29
100.0%
Close Punctuation
ValueCountFrequency (%)
) 29
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 437
55.7%
Common 347
44.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
7.6%
31
 
7.1%
24
 
5.5%
23
 
5.3%
22
 
5.0%
22
 
5.0%
22
 
5.0%
17
 
3.9%
17
 
3.9%
17
 
3.9%
Other values (96) 209
47.8%
Common
ValueCountFrequency (%)
131
37.8%
1 31
 
8.9%
( 29
 
8.4%
) 29
 
8.4%
2 23
 
6.6%
3 20
 
5.8%
7 14
 
4.0%
0 12
 
3.5%
4 11
 
3.2%
- 11
 
3.2%
Other values (5) 36
 
10.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 437
55.7%
ASCII 347
44.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
131
37.8%
1 31
 
8.9%
( 29
 
8.4%
) 29
 
8.4%
2 23
 
6.6%
3 20
 
5.8%
7 14
 
4.0%
0 12
 
3.5%
4 11
 
3.2%
- 11
 
3.2%
Other values (5) 36
 
10.4%
Hangul
ValueCountFrequency (%)
33
 
7.6%
31
 
7.1%
24
 
5.5%
23
 
5.3%
22
 
5.0%
22
 
5.0%
22
 
5.0%
17
 
3.9%
17
 
3.9%
17
 
3.9%
Other values (96) 209
47.8%

전화번호
Text

MISSING 

Distinct36
Distinct (%)100.0%
Missing1
Missing (%)2.7%
Memory size428.0 B
2024-03-14T11:28:06.554163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length12
Mean length12.777778
Min length12

Characters and Unicode

Total characters460
Distinct characters18
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row063-254-8150
2nd row063-222-7968
3rd row063-211-8001
4th row063-832-5271
5th row063-273-5251
ValueCountFrequency (%)
063-254-8150 1
 
2.6%
063-858-5111 1
 
2.6%
fax 1
 
2.6%
245-0986 1
 
2.6%
063-452-1367 1
 
2.6%
063-631-5050 1
 
2.6%
063-236-3777 1
 
2.6%
063-451-0552 1
 
2.6%
063-242-7532 1
 
2.6%
063-223-0153 1
 
2.6%
Other values (29) 29
74.4%
2024-03-14T11:28:06.916774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 75
16.3%
3 66
14.3%
0 63
13.7%
6 57
12.4%
2 51
11.1%
5 33
7.2%
4 31
6.7%
1 29
 
6.3%
8 20
 
4.3%
7 18
 
3.9%
Other values (8) 17
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 377
82.0%
Dash Punctuation 75
 
16.3%
Lowercase Letter 3
 
0.7%
Control 2
 
0.4%
Open Punctuation 1
 
0.2%
Space Separator 1
 
0.2%
Close Punctuation 1
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 66
17.5%
0 63
16.7%
6 57
15.1%
2 51
13.5%
5 33
8.8%
4 31
8.2%
1 29
7.7%
8 20
 
5.3%
7 18
 
4.8%
9 9
 
2.4%
Lowercase Letter
ValueCountFrequency (%)
f 1
33.3%
a 1
33.3%
x 1
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 75
100.0%
Control
ValueCountFrequency (%)
2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 457
99.3%
Latin 3
 
0.7%

Most frequent character per script

Common
ValueCountFrequency (%)
- 75
16.4%
3 66
14.4%
0 63
13.8%
6 57
12.5%
2 51
11.2%
5 33
7.2%
4 31
6.8%
1 29
 
6.3%
8 20
 
4.4%
7 18
 
3.9%
Other values (5) 14
 
3.1%
Latin
ValueCountFrequency (%)
f 1
33.3%
a 1
33.3%
x 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 460
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 75
16.3%
3 66
14.3%
0 63
13.7%
6 57
12.4%
2 51
11.1%
5 33
7.2%
4 31
6.7%
1 29
 
6.3%
8 20
 
4.3%
7 18
 
3.9%
Other values (8) 17
 
3.7%

Interactions

2024-03-14T11:28:04.534277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T11:28:07.124874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업소명영업소 소재지전화번호
연번1.0001.0001.0001.000
업소명1.0001.0001.0001.000
영업소 소재지1.0001.0001.0001.000
전화번호1.0001.0001.0001.000

Missing values

2024-03-14T11:28:04.815665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T11:28:04.970834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T11:28:05.099715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번업소명등록번호Unnamed: 3영업소 소재지전화번호
0<NA><NA>대기수질<NA><NA>
11광진환경㈜NaN67전주시 덕진구 호반3길 7 (덕진동2가)063-254-8150
22㈜유성테크1810전주시 완산구 송정중앙로 29 (효자동1가)063-222-7968
33(유)지구엔비텍2285전주시 덕진구 비석날로 99 (팔복동2가)063-211-8001
44(유)호일엔지니어링245익산시 춘포면 궁성로 272063-832-5271
55(주)이앤에스테크27106전주시 완산구 서곡6길 23-4 (효자동3가)063-273-5251
66㈜청송이앤티25NaN전주시 덕진구 동부대로 727 (우아동3가)063-242-4690
77㈜광산테크28NaN익산시 석암로13길 71 (팔봉동)063-835-2823
88바다정수산업㈜NaN14전주시 덕진구 원만성로 5 (만성동)063-211-4331
99(유)도영종합건설NaN38전주시 완산구 메너머2길 21-10 (중화산동2가)063-229-5025
연번업소명등록번호Unnamed: 3영업소 소재지전화번호
2727(유)대금환경NaN129전주시 덕진구 산재마을길 59 (금상동)063-242-7532
2828(주)대한NaN124익산시 익산대로26길 17-1 (남중동)063-858-5111
2929(유)금영이엔텍NaN132전주시 완산구 용머리로 73, 효자프라자 122호 (효자동1가)063-223-0153
3030(유)라인환경기술141134전주시 덕진구 용덕길 3-30063-288-7001
3131(주)이레엔텍NaN135군산시 미성로 417번지 (101호)063-464-9256
3232(주)국성건설엔지니어링NaN136전주시 완산구 서신로 57(서신동)063-251-3330
3333㈜보원건설산업NaN140익산시 황등면 후정4길 58-36063-858-0850
3434(유)도일이엔지NaN142완주군 삼례읍 용전길 7(하리)063-253-6136
3535(유)블루그린링크NaN143군산시 가도로 81 (오식도동)063-225-7407 063-471-7407
3636에너원주식회사144NaN군산시 외항로 884 (소룡동)063-466-7700