Overview

Dataset statistics

Number of variables6
Number of observations761
Missing cells18
Missing cells (%)0.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory36.5 KiB
Average record size in memory49.2 B

Variable types

Numeric1
Text3
Categorical2

Dataset

Description부산광역시 소방시설업 현황에 대한 데이터로 번호, 업체명, 업체구분, 위치, 등록구분, 연락처 항목정보를 제공합니다.
Author부산광역시
URLhttps://www.data.go.kr/data/3056593/fileData.do

Alerts

번호 is highly overall correlated with 업종구분High correlation
업종구분 is highly overall correlated with 번호 and 1 other fieldsHigh correlation
등록구분 is highly overall correlated with 업종구분High correlation
연락처 has 18 (2.4%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2024-03-14 12:21:42.096806
Analysis finished2024-03-14 12:21:43.411696
Duration1.31 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct761
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean381
Minimum1
Maximum761
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.8 KiB
2024-03-14T21:21:43.539690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile39
Q1191
median381
Q3571
95-th percentile723
Maximum761
Range760
Interquartile range (IQR)380

Descriptive statistics

Standard deviation219.82607
Coefficient of variation (CV)0.57697131
Kurtosis-1.2
Mean381
Median Absolute Deviation (MAD)190
Skewness0
Sum289941
Variance48323.5
MonotonicityStrictly increasing
2024-03-14T21:21:44.019708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
512 1
 
0.1%
503 1
 
0.1%
504 1
 
0.1%
505 1
 
0.1%
506 1
 
0.1%
507 1
 
0.1%
508 1
 
0.1%
509 1
 
0.1%
510 1
 
0.1%
Other values (751) 751
98.7%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
761 1
0.1%
760 1
0.1%
759 1
0.1%
758 1
0.1%
757 1
0.1%
756 1
0.1%
755 1
0.1%
754 1
0.1%
753 1
0.1%
752 1
0.1%
Distinct619
Distinct (%)81.3%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
2024-03-14T21:21:44.929730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length15
Mean length8.1156373
Min length3

Characters and Unicode

Total characters6176
Distinct characters256
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique507 ?
Unique (%)66.6%

Sample

1st row성보전기공업(주)
2nd row(주)정관소방
3rd row태성이엔지
4th row금영전기(주)
5th row(주)효승이엔지
ValueCountFrequency (%)
주식회사 124
 
13.8%
주)한국소방 4
 
0.4%
주)중앙기술단 4
 
0.4%
주)대영소방이엔지 4
 
0.4%
주)남경이엔지 4
 
0.4%
주)정엔지니어링 4
 
0.4%
주)더베스트이앤씨 4
 
0.4%
주)골드이엔지 4
 
0.4%
주)근우이앤지 3
 
0.3%
이엔지 3
 
0.3%
Other values (620) 741
82.4%
2024-03-14T21:21:46.047537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
674
 
10.9%
( 541
 
8.8%
) 541
 
8.8%
291
 
4.7%
199
 
3.2%
189
 
3.1%
176
 
2.8%
153
 
2.5%
152
 
2.5%
138
 
2.2%
Other values (246) 3122
50.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4907
79.5%
Open Punctuation 541
 
8.8%
Close Punctuation 541
 
8.8%
Space Separator 138
 
2.2%
Uppercase Letter 32
 
0.5%
Other Punctuation 9
 
0.1%
Lowercase Letter 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
674
 
13.7%
291
 
5.9%
199
 
4.1%
189
 
3.9%
176
 
3.6%
153
 
3.1%
152
 
3.1%
131
 
2.7%
129
 
2.6%
123
 
2.5%
Other values (229) 2690
54.8%
Lowercase Letter
ValueCountFrequency (%)
o 2
25.0%
e 1
12.5%
s 1
12.5%
a 1
12.5%
k 1
12.5%
m 1
12.5%
n 1
12.5%
Uppercase Letter
ValueCountFrequency (%)
E 11
34.4%
G 8
25.0%
N 7
21.9%
C 5
15.6%
F 1
 
3.1%
Other Punctuation
ValueCountFrequency (%)
. 6
66.7%
& 3
33.3%
Open Punctuation
ValueCountFrequency (%)
( 541
100.0%
Close Punctuation
ValueCountFrequency (%)
) 541
100.0%
Space Separator
ValueCountFrequency (%)
138
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4907
79.5%
Common 1229
 
19.9%
Latin 40
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
674
 
13.7%
291
 
5.9%
199
 
4.1%
189
 
3.9%
176
 
3.6%
153
 
3.1%
152
 
3.1%
131
 
2.7%
129
 
2.6%
123
 
2.5%
Other values (229) 2690
54.8%
Latin
ValueCountFrequency (%)
E 11
27.5%
G 8
20.0%
N 7
17.5%
C 5
12.5%
o 2
 
5.0%
e 1
 
2.5%
s 1
 
2.5%
a 1
 
2.5%
k 1
 
2.5%
m 1
 
2.5%
Other values (2) 2
 
5.0%
Common
ValueCountFrequency (%)
( 541
44.0%
) 541
44.0%
138
 
11.2%
. 6
 
0.5%
& 3
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4907
79.5%
ASCII 1269
 
20.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
674
 
13.7%
291
 
5.9%
199
 
4.1%
189
 
3.9%
176
 
3.6%
153
 
3.1%
152
 
3.1%
131
 
2.7%
129
 
2.6%
123
 
2.5%
Other values (229) 2690
54.8%
ASCII
ValueCountFrequency (%)
( 541
42.6%
) 541
42.6%
138
 
10.9%
E 11
 
0.9%
G 8
 
0.6%
N 7
 
0.6%
. 6
 
0.5%
C 5
 
0.4%
& 3
 
0.2%
o 2
 
0.2%
Other values (7) 7
 
0.6%

업종구분
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
공사업
448 
설계업
98 
방염업
85 
관리업
78 
감리업
52 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공사업
2nd row공사업
3rd row방염업
4th row공사업
5th row공사업

Common Values

ValueCountFrequency (%)
공사업 448
58.9%
설계업 98
 
12.9%
방염업 85
 
11.2%
관리업 78
 
10.2%
감리업 52
 
6.8%

Length

2024-03-14T21:21:46.277215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T21:21:46.472531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공사업 448
58.9%
설계업 98
 
12.9%
방염업 85
 
11.2%
관리업 78
 
10.2%
감리업 52
 
6.8%

위치
Text

Distinct667
Distinct (%)87.6%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
2024-03-14T21:21:47.870149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length55
Mean length38.145861
Min length16

Characters and Unicode

Total characters29029
Distinct characters326
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique591 ?
Unique (%)77.7%

Sample

1st row(46020) 부산광역시 기장군 정관읍 정관로 929-9 .
2nd row(46024) 부산광역시 기장군 정관읍 달산2길 2 1층
3rd row(46024) 부산광역시 기장군 정관읍 달산3길 7-6 , 303호
4th row(46033) 부산광역시 기장군 장안읍 좌천1길 34 ()
5th row(46033) 부산광역시 기장군 장안읍 좌천로 83 .
ValueCountFrequency (%)
부산광역시 755
 
14.3%
309
 
5.9%
부산진구 109
 
2.1%
동래구 94
 
1.8%
연제구 79
 
1.5%
강서구 64
 
1.2%
금정구 60
 
1.1%
남구 59
 
1.1%
사상구 55
 
1.0%
해운대구 50
 
0.9%
Other values (1713) 3632
69.0%
2024-03-14T21:21:49.481700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4713
 
16.2%
( 1340
 
4.6%
) 1340
 
4.6%
4 1168
 
4.0%
1 1066
 
3.7%
998
 
3.4%
990
 
3.4%
934
 
3.2%
2 924
 
3.2%
788
 
2.7%
Other values (316) 14768
50.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13593
46.8%
Decimal Number 7337
25.3%
Space Separator 4713
 
16.2%
Open Punctuation 1340
 
4.6%
Close Punctuation 1340
 
4.6%
Other Punctuation 526
 
1.8%
Dash Punctuation 168
 
0.6%
Uppercase Letter 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
998
 
7.3%
990
 
7.3%
934
 
6.9%
788
 
5.8%
787
 
5.8%
784
 
5.8%
758
 
5.6%
658
 
4.8%
346
 
2.5%
341
 
2.5%
Other values (289) 6209
45.7%
Decimal Number
ValueCountFrequency (%)
4 1168
15.9%
1 1066
14.5%
2 924
12.6%
7 737
10.0%
0 689
9.4%
3 627
8.5%
6 577
7.9%
5 562
7.7%
8 547
7.5%
9 440
 
6.0%
Uppercase Letter
ValueCountFrequency (%)
A 3
25.0%
O 1
 
8.3%
T 1
 
8.3%
V 1
 
8.3%
B 1
 
8.3%
W 1
 
8.3%
E 1
 
8.3%
I 1
 
8.3%
K 1
 
8.3%
S 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
, 372
70.7%
. 153
29.1%
/ 1
 
0.2%
Space Separator
ValueCountFrequency (%)
4713
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1340
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1340
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 168
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 15424
53.1%
Hangul 13593
46.8%
Latin 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
998
 
7.3%
990
 
7.3%
934
 
6.9%
788
 
5.8%
787
 
5.8%
784
 
5.8%
758
 
5.6%
658
 
4.8%
346
 
2.5%
341
 
2.5%
Other values (289) 6209
45.7%
Common
ValueCountFrequency (%)
4713
30.6%
( 1340
 
8.7%
) 1340
 
8.7%
4 1168
 
7.6%
1 1066
 
6.9%
2 924
 
6.0%
7 737
 
4.8%
0 689
 
4.5%
3 627
 
4.1%
6 577
 
3.7%
Other values (7) 2243
14.5%
Latin
ValueCountFrequency (%)
A 3
25.0%
O 1
 
8.3%
T 1
 
8.3%
V 1
 
8.3%
B 1
 
8.3%
W 1
 
8.3%
E 1
 
8.3%
I 1
 
8.3%
K 1
 
8.3%
S 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 15436
53.2%
Hangul 13593
46.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4713
30.5%
( 1340
 
8.7%
) 1340
 
8.7%
4 1168
 
7.6%
1 1066
 
6.9%
2 924
 
6.0%
7 737
 
4.8%
0 689
 
4.5%
3 627
 
4.1%
6 577
 
3.7%
Other values (17) 2255
14.6%
Hangul
ValueCountFrequency (%)
998
 
7.3%
990
 
7.3%
934
 
6.9%
788
 
5.8%
787
 
5.8%
784
 
5.8%
758
 
5.6%
658
 
4.8%
346
 
2.5%
341
 
2.5%
Other values (289) 6209
45.7%

등록구분
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
전문
487 
일반(기계, 전기)
82 
<NA>
78 
합판목재류
62 
합성수지류
 
21
Other values (4)
 
31

Length

Max length10
Median length2
Mean length3.5584757
Min length2

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row일반(전기)
2nd row전문
3rd row합판목재류
4th row전문
5th row전문

Common Values

ValueCountFrequency (%)
전문 487
64.0%
일반(기계, 전기) 82
 
10.8%
<NA> 78
 
10.2%
합판목재류 62
 
8.1%
합성수지류 21
 
2.8%
일반(전기) 18
 
2.4%
일반(기계) 11
 
1.4%
합성수지류, 섬유류 1
 
0.1%
섬유류 1
 
0.1%

Length

2024-03-14T21:21:49.734547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T21:21:49.996122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전문 487
57.7%
일반(기계 93
 
11.0%
전기 82
 
9.7%
na 78
 
9.2%
합판목재류 62
 
7.3%
합성수지류 22
 
2.6%
일반(전기 18
 
2.1%
섬유류 2
 
0.2%

연락처
Text

MISSING 

Distinct567
Distinct (%)76.3%
Missing18
Missing (%)2.4%
Memory size6.1 KiB
2024-03-14T21:21:51.113413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.009421
Min length9

Characters and Unicode

Total characters8923
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique434 ?
Unique (%)58.4%

Sample

1st row051-529-0001
2nd row051-727-4119
3rd row051-727-5804
4th row051-555-7050
5th row051-727-9055
ValueCountFrequency (%)
051-759-5442 6
 
0.8%
051-818-7341 5
 
0.7%
051-557-4245 4
 
0.5%
051-703-1599 4
 
0.5%
051-817-3119 4
 
0.5%
070-8819-1119 4
 
0.5%
051-644-9119 4
 
0.5%
051-507-0119 4
 
0.5%
051-317-3119 4
 
0.5%
051-959-1119 4
 
0.5%
Other values (557) 700
94.2%
2024-03-14T21:21:52.682386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 1675
18.8%
- 1485
16.6%
5 1287
14.4%
0 1267
14.2%
9 556
 
6.2%
2 475
 
5.3%
3 462
 
5.2%
7 461
 
5.2%
8 442
 
5.0%
6 422
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7438
83.4%
Dash Punctuation 1485
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1675
22.5%
5 1287
17.3%
0 1267
17.0%
9 556
 
7.5%
2 475
 
6.4%
3 462
 
6.2%
7 461
 
6.2%
8 442
 
5.9%
6 422
 
5.7%
4 391
 
5.3%
Dash Punctuation
ValueCountFrequency (%)
- 1485
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8923
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 1675
18.8%
- 1485
16.6%
5 1287
14.4%
0 1267
14.2%
9 556
 
6.2%
2 475
 
5.3%
3 462
 
5.2%
7 461
 
5.2%
8 442
 
5.0%
6 422
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8923
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1675
18.8%
- 1485
16.6%
5 1287
14.4%
0 1267
14.2%
9 556
 
6.2%
2 475
 
5.3%
3 462
 
5.2%
7 461
 
5.2%
8 442
 
5.0%
6 422
 
4.7%

Interactions

2024-03-14T21:21:42.766636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T21:21:52.949542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업종구분등록구분
번호1.0000.9380.481
업종구분0.9381.0000.959
등록구분0.4810.9591.000
2024-03-14T21:21:53.190048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록구분업종구분
등록구분1.0000.731
업종구분0.7311.000
2024-03-14T21:21:53.421896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업종구분등록구분
번호1.0000.6630.260
업종구분0.6631.0000.731
등록구분0.2600.7311.000

Missing values

2024-03-14T21:21:43.145401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T21:21:43.339216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호업체명업종구분위치등록구분연락처
01성보전기공업(주)공사업(46020) 부산광역시 기장군 정관읍 정관로 929-9 .일반(전기)051-529-0001
12(주)정관소방공사업(46024) 부산광역시 기장군 정관읍 달산2길 2 1층전문051-727-4119
23태성이엔지방염업(46024) 부산광역시 기장군 정관읍 달산3길 7-6 , 303호합판목재류<NA>
34금영전기(주)공사업(46033) 부산광역시 기장군 장안읍 좌천1길 34 ()전문051-727-5804
45(주)효승이엔지공사업(46033) 부산광역시 기장군 장안읍 좌천로 83 .전문051-555-7050
56천광전업(주)공사업(46033) 부산광역시 기장군 좌천5길 9 .전문051-727-9055
67에스엠씨이앤아이티 주식회사공사업(46036) 부산광역시 기장군 장안읍 길천2길 33 1층전문051-461-0388
78대지전설 주식회사공사업(46040) 부산광역시 기장군 일광읍 일광로 549-67 .전문051-724-3096
89삼일전설(주)공사업(46044) 부산광역시 기장군 일광면 삼성2길 6 (삼양빌라 301호)전문051-722-3104
910(주)금강이엔에프공사업(46046) 부산광역시 기장군 일광읍 화용길 172 , 1층전문051-727-3961
번호업체명업종구분위치등록구분연락처
751752세이프엔지니어링(주)관리업부산광역시 연제구 연산동 135<NA>051-714-6119
752753엘에스방재(주)관리업부산광역시 사상구 모라동 1115 - 4<NA>051-304-0206
753754장안기업(주)관리업부산광역시 동래구 사직동 122 - 1<NA>051-817-5253
754755제일소방(주)관리업부산광역시 부산진구 개금동 333<NA>051-711-2119
755756주식회사 동성소방관리업부산광역시 연제구 연산동 398<NA>051-755-9111
756757주식회사 시민소방관리업부산광역시 사하구 장림동 585<NA>051-755-9111
757758주식회사 에스비소방이엔씨관리업부산광역시 수영구 광안동 1641<NA>051-502-1190
758759주식회사 에프이피관리업부산광역시 연제구 연산동 794<NA>051-865-1195
759760주식회사 월드소방관리업부산광역시 연제구 연산동 405<NA>051-759-1198
760761주식회사국제소방이엔지관리업부산광역시 금정구 구서동 178<NA>051-517-6119