Overview

Dataset statistics

Number of variables5
Number of observations192
Missing cells3
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.8 KiB
Average record size in memory41.7 B

Variable types

Numeric1
Text4

Dataset

Description울산광역시 내 정보통신공사업 현황(등록번호, 대표자 명, 상호명, 전화번호, 도로명 주소, 지번 주소 등) 정보를 제공하고 있음.
Author울산광역시
URLhttps://www.data.go.kr/data/3081170/fileData.do

Alerts

전화번호 has 3 (1.6%) missing valuesMissing
등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:06:08.405964
Analysis finished2023-12-12 16:06:09.051570
Duration0.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

등록번호
Real number (ℝ)

UNIQUE 

Distinct192
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean428306.36
Minimum111573
Maximum610239
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-13T01:06:09.128149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum111573
5-th percentile120199.1
Q1520001.75
median520082.5
Q3520136.75
95-th percentile520179.45
Maximum610239
Range498666
Interquartile range (IQR)135

Descriptive statistics

Standard deviation168385.48
Coefficient of variation (CV)0.39314261
Kurtosis-0.40772688
Mean428306.36
Median Absolute Deviation (MAD)64.5
Skewness-1.2427667
Sum82234821
Variance2.835367 × 1010
MonotonicityNot monotonic
2023-12-13T01:06:09.284014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
520183 1
 
0.5%
520077 1
 
0.5%
520037 1
 
0.5%
520035 1
 
0.5%
520034 1
 
0.5%
520033 1
 
0.5%
520032 1
 
0.5%
520031 1
 
0.5%
520029 1
 
0.5%
520026 1
 
0.5%
Other values (182) 182
94.8%
ValueCountFrequency (%)
111573 1
0.5%
113275 1
0.5%
120004 1
0.5%
120008 1
0.5%
120053 1
0.5%
120098 1
0.5%
120099 1
0.5%
120127 1
0.5%
120154 1
0.5%
120165 1
0.5%
ValueCountFrequency (%)
610239 1
0.5%
610089 1
0.5%
550299 1
0.5%
550213 1
0.5%
550172 1
0.5%
530063 1
0.5%
520183 1
0.5%
520182 1
0.5%
520181 1
0.5%
520180 1
0.5%

상호
Text

Distinct191
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-13T01:06:09.546193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length28
Mean length9.1979167
Min length4

Characters and Unicode

Total characters1766
Distinct characters216
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique190 ?
Unique (%)99.0%

Sample

1st row다원아이티
2nd row동건산업전설 주식회사
3rd row주식회사 태영전기
4th row주식회사 노바테크(NOVA Technology Co.,Ltd.)
5th row주식회사 리얼시큐
ValueCountFrequency (%)
주식회사 73
 
26.0%
co.,ltd 3
 
1.1%
세기정보통신 2
 
0.7%
inc 2
 
0.7%
가람정보통신 1
 
0.4%
주)에스케이범통신 1
 
0.4%
주)울산통신공사 1
 
0.4%
주)에스아이인포콤 1
 
0.4%
신성아이시티(주 1
 
0.4%
주)극동자동화 1
 
0.4%
Other values (195) 195
69.4%
2023-12-13T01:06:09.948861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
166
 
9.4%
) 101
 
5.7%
( 101
 
5.7%
89
 
5.0%
83
 
4.7%
77
 
4.4%
77
 
4.4%
61
 
3.5%
45
 
2.5%
42
 
2.4%
Other values (206) 924
52.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1348
76.3%
Close Punctuation 101
 
5.7%
Open Punctuation 101
 
5.7%
Space Separator 89
 
5.0%
Uppercase Letter 76
 
4.3%
Lowercase Letter 34
 
1.9%
Other Punctuation 17
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
166
 
12.3%
83
 
6.2%
77
 
5.7%
77
 
5.7%
61
 
4.5%
45
 
3.3%
42
 
3.1%
32
 
2.4%
31
 
2.3%
29
 
2.2%
Other values (169) 705
52.3%
Uppercase Letter
ValueCountFrequency (%)
E 9
11.8%
O 9
11.8%
C 8
10.5%
L 8
10.5%
T 7
9.2%
N 7
9.2%
I 6
7.9%
A 5
6.6%
S 5
6.6%
R 3
 
3.9%
Other values (7) 9
11.8%
Lowercase Letter
ValueCountFrequency (%)
o 6
17.6%
c 4
11.8%
t 4
11.8%
d 4
11.8%
l 3
8.8%
n 3
8.8%
y 2
 
5.9%
e 2
 
5.9%
h 1
 
2.9%
g 1
 
2.9%
Other values (4) 4
11.8%
Other Punctuation
ValueCountFrequency (%)
. 10
58.8%
, 5
29.4%
& 2
 
11.8%
Close Punctuation
ValueCountFrequency (%)
) 101
100.0%
Open Punctuation
ValueCountFrequency (%)
( 101
100.0%
Space Separator
ValueCountFrequency (%)
89
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1348
76.3%
Common 308
 
17.4%
Latin 110
 
6.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
166
 
12.3%
83
 
6.2%
77
 
5.7%
77
 
5.7%
61
 
4.5%
45
 
3.3%
42
 
3.1%
32
 
2.4%
31
 
2.3%
29
 
2.2%
Other values (169) 705
52.3%
Latin
ValueCountFrequency (%)
E 9
 
8.2%
O 9
 
8.2%
C 8
 
7.3%
L 8
 
7.3%
T 7
 
6.4%
N 7
 
6.4%
o 6
 
5.5%
I 6
 
5.5%
A 5
 
4.5%
S 5
 
4.5%
Other values (21) 40
36.4%
Common
ValueCountFrequency (%)
) 101
32.8%
( 101
32.8%
89
28.9%
. 10
 
3.2%
, 5
 
1.6%
& 2
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1348
76.3%
ASCII 418
 
23.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
166
 
12.3%
83
 
6.2%
77
 
5.7%
77
 
5.7%
61
 
4.5%
45
 
3.3%
42
 
3.1%
32
 
2.4%
31
 
2.3%
29
 
2.2%
Other values (169) 705
52.3%
ASCII
ValueCountFrequency (%)
) 101
24.2%
( 101
24.2%
89
21.3%
. 10
 
2.4%
E 9
 
2.2%
O 9
 
2.2%
C 8
 
1.9%
L 8
 
1.9%
T 7
 
1.7%
N 7
 
1.7%
Other values (27) 69
16.5%
Distinct189
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-13T01:06:10.328632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9895833
Min length2

Characters and Unicode

Total characters574
Distinct characters129
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique186 ?
Unique (%)96.9%

Sample

1st row김문용
2nd row이상관
3rd row안영희
4th row송동석
5th row김재선
ValueCountFrequency (%)
이창희 2
 
1.0%
김기현 2
 
1.0%
이민규 2
 
1.0%
이형규 1
 
0.5%
신영권 1
 
0.5%
이상열 1
 
0.5%
김은진 1
 
0.5%
김문용 1
 
0.5%
허정훈 1
 
0.5%
김원덕 1
 
0.5%
Other values (179) 179
93.2%
2023-12-13T01:06:10.843844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
50
 
8.7%
32
 
5.6%
19
 
3.3%
18
 
3.1%
14
 
2.4%
13
 
2.3%
12
 
2.1%
12
 
2.1%
11
 
1.9%
11
 
1.9%
Other values (119) 382
66.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 574
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
50
 
8.7%
32
 
5.6%
19
 
3.3%
18
 
3.1%
14
 
2.4%
13
 
2.3%
12
 
2.1%
12
 
2.1%
11
 
1.9%
11
 
1.9%
Other values (119) 382
66.6%

Most occurring scripts

ValueCountFrequency (%)
Hangul 574
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
50
 
8.7%
32
 
5.6%
19
 
3.3%
18
 
3.1%
14
 
2.4%
13
 
2.3%
12
 
2.1%
12
 
2.1%
11
 
1.9%
11
 
1.9%
Other values (119) 382
66.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 574
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
50
 
8.7%
32
 
5.6%
19
 
3.3%
18
 
3.1%
14
 
2.4%
13
 
2.3%
12
 
2.1%
12
 
2.1%
11
 
1.9%
11
 
1.9%
Other values (119) 382
66.6%

전화번호
Text

MISSING 

Distinct188
Distinct (%)99.5%
Missing3
Missing (%)1.6%
Memory size1.6 KiB
2023-12-13T01:06:11.155437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.058201
Min length11

Characters and Unicode

Total characters2279
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique187 ?
Unique (%)98.9%

Sample

1st row052-247-6114
2nd row055-381-4474
3rd row052-285-0301
4th row052-225-8582
5th row052-716-5115
ValueCountFrequency (%)
052-254-9247 2
 
1.1%
052-226-9800 1
 
0.5%
052-211-0994 1
 
0.5%
052-247-6114 1
 
0.5%
052-903-1616 1
 
0.5%
052-227-5599 1
 
0.5%
052-260-7007 1
 
0.5%
052-260-1811 1
 
0.5%
052-900-7100 1
 
0.5%
052-257-0001 1
 
0.5%
Other values (178) 178
94.2%
2023-12-13T01:06:11.671471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 411
18.0%
0 382
16.8%
- 378
16.6%
5 283
12.4%
1 134
 
5.9%
7 130
 
5.7%
4 128
 
5.6%
8 118
 
5.2%
6 112
 
4.9%
3 102
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1901
83.4%
Dash Punctuation 378
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 411
21.6%
0 382
20.1%
5 283
14.9%
1 134
 
7.0%
7 130
 
6.8%
4 128
 
6.7%
8 118
 
6.2%
6 112
 
5.9%
3 102
 
5.4%
9 101
 
5.3%
Dash Punctuation
ValueCountFrequency (%)
- 378
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2279
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 411
18.0%
0 382
16.8%
- 378
16.6%
5 283
12.4%
1 134
 
5.9%
7 130
 
5.7%
4 128
 
5.6%
8 118
 
5.2%
6 112
 
4.9%
3 102
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2279
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 411
18.0%
0 382
16.8%
- 378
16.6%
5 283
12.4%
1 134
 
5.9%
7 130
 
5.7%
4 128
 
5.6%
8 118
 
5.2%
6 112
 
4.9%
3 102
 
4.5%

주소
Text

Distinct190
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-13T01:06:12.103294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length37
Mean length26.15625
Min length14

Characters and Unicode

Total characters5022
Distinct characters225
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique188 ?
Unique (%)97.9%

Sample

1st row울산광역시 울주군 청량읍 상남길 15-21, 2동 1층
2nd row울산광역시 울주군 삼남읍 하방중들길 9
3rd row울산광역시 울주군 서생면 에너지산업1로 45, 6블럭-4
4th row울산광역시 동구 보성길 73 (일산동)
5th row울산광역시 남구 대학로 93, 울산대학교산학협동관 403호 (무거동)
ValueCountFrequency (%)
울산 107
 
9.9%
울산광역시 85
 
7.8%
남구 63
 
5.8%
울주군 57
 
5.2%
중구 43
 
4.0%
북구 27
 
2.5%
서생면 18
 
1.7%
15
 
1.4%
범서읍 15
 
1.4%
1층 12
 
1.1%
Other values (408) 644
59.3%
2023-12-13T01:06:12.634906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
894
 
17.8%
258
 
5.1%
256
 
5.1%
1 219
 
4.4%
151
 
3.0%
2 142
 
2.8%
139
 
2.8%
( 137
 
2.7%
) 136
 
2.7%
, 130
 
2.6%
Other values (215) 2560
51.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2803
55.8%
Space Separator 894
 
17.8%
Decimal Number 867
 
17.3%
Open Punctuation 137
 
2.7%
Close Punctuation 136
 
2.7%
Other Punctuation 131
 
2.6%
Dash Punctuation 49
 
1.0%
Uppercase Letter 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
258
 
9.2%
256
 
9.1%
151
 
5.4%
139
 
5.0%
114
 
4.1%
110
 
3.9%
87
 
3.1%
86
 
3.1%
85
 
3.0%
82
 
2.9%
Other values (195) 1435
51.2%
Decimal Number
ValueCountFrequency (%)
1 219
25.3%
2 142
16.4%
3 93
10.7%
0 79
 
9.1%
6 68
 
7.8%
5 65
 
7.5%
4 64
 
7.4%
8 48
 
5.5%
7 46
 
5.3%
9 43
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
C 2
40.0%
B 1
20.0%
K 1
20.0%
T 1
20.0%
Other Punctuation
ValueCountFrequency (%)
, 130
99.2%
/ 1
 
0.8%
Space Separator
ValueCountFrequency (%)
894
100.0%
Open Punctuation
ValueCountFrequency (%)
( 137
100.0%
Close Punctuation
ValueCountFrequency (%)
) 136
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 49
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2803
55.8%
Common 2214
44.1%
Latin 5
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
258
 
9.2%
256
 
9.1%
151
 
5.4%
139
 
5.0%
114
 
4.1%
110
 
3.9%
87
 
3.1%
86
 
3.1%
85
 
3.0%
82
 
2.9%
Other values (195) 1435
51.2%
Common
ValueCountFrequency (%)
894
40.4%
1 219
 
9.9%
2 142
 
6.4%
( 137
 
6.2%
) 136
 
6.1%
, 130
 
5.9%
3 93
 
4.2%
0 79
 
3.6%
6 68
 
3.1%
5 65
 
2.9%
Other values (6) 251
 
11.3%
Latin
ValueCountFrequency (%)
C 2
40.0%
B 1
20.0%
K 1
20.0%
T 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2803
55.8%
ASCII 2219
44.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
894
40.3%
1 219
 
9.9%
2 142
 
6.4%
( 137
 
6.2%
) 136
 
6.1%
, 130
 
5.9%
3 93
 
4.2%
0 79
 
3.6%
6 68
 
3.1%
5 65
 
2.9%
Other values (10) 256
 
11.5%
Hangul
ValueCountFrequency (%)
258
 
9.2%
256
 
9.1%
151
 
5.4%
139
 
5.0%
114
 
4.1%
110
 
3.9%
87
 
3.1%
86
 
3.1%
85
 
3.0%
82
 
2.9%
Other values (195) 1435
51.2%

Interactions

2023-12-13T01:06:08.775200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T01:06:08.903807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:06:09.010123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록번호상호대표자전화번호주소
0520183다원아이티김문용052-247-6114울산광역시 울주군 청량읍 상남길 15-21, 2동 1층
1520182동건산업전설 주식회사이상관055-381-4474울산광역시 울주군 삼남읍 하방중들길 9
2520181주식회사 태영전기안영희052-285-0301울산광역시 울주군 서생면 에너지산업1로 45, 6블럭-4
3520180주식회사 노바테크(NOVA Technology Co.,Ltd.)송동석052-225-8582울산광역시 동구 보성길 73 (일산동)
4520179주식회사 리얼시큐김재선052-716-5115울산광역시 남구 대학로 93, 울산대학교산학협동관 403호 (무거동)
5520178CS미디어김형진052-234-3120울산광역시 동구 화암10길 6, 명선가의새아침 1층 (방어동)
6520177주식회사 더원이대한055-910-7824울산광역시 울주군 삼남읍 방기로 43
7520176주식회사 에프아이티김규태052-716-1195울산광역시 중구 옥교8길 31, 1층 (학산동)
8520175주식회사 인사이트온박승래052-248-5188울산광역시 중구 종가5길 21 (유곡동)
9520173주식회사 테라바이오(TERRABIO.CO.,Ltd)노준혁052-223-9957울산광역시 울주군 언양읍 반천반송산업로 137-13
등록번호상호대표자전화번호주소
182150318주식회사 이로김영택053-213-9818울산 울주군 청량읍 덕하로 133
183120227주식회사 디투제이김수만052-266-9092울산 울주군 삼남읍 신화리로 29, 4호
184111573주식회사 대한오기영052-276-9169울산 울주군 서생면 천산로 293, 4층
185120008(주)태광통신이성원052-245-2340울산 중구 성안2길 41 ,(성안동)
186120053(주)삼우통신이민규052-271-3400울산 북구 진장24길 11(진장동)
187120099(주)현대종합이앤지문임용052-224-3636울산 울주군 서생면 해맞이로 990 ,3층
188120165갑을통신(주)이형규052-287-1300울산 북구 진장24길 11
189120127주식회사 혜정정보기술배기환052-244-9100울산 울주군 서생면 진하6길 3, 1306호
190120098(주)청원통신심상권052-227-2600울산 남구 돋질로306번길 19 ,(삼산동)
191120154유창통신(주)방태열052-244-8000울산 중구 명륜로 54 ,(우정동)