Overview

Dataset statistics

Number of variables5
Number of observations248
Missing cells1
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.1 KiB
Average record size in memory41.5 B

Variable types

Numeric1
Text4

Dataset

Description경상북도 건설업 현황정보에 대한 요청자료이나
Author경상북도 칠곡군
URLhttps://www.data.go.kr/data/15084299/fileData.do

Alerts

순번 has unique valuesUnique
상호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:15:04.583136
Analysis finished2023-12-12 12:15:05.081056
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct248
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean124.5
Minimum1
Maximum248
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-12T21:15:05.148278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.35
Q162.75
median124.5
Q3186.25
95-th percentile235.65
Maximum248
Range247
Interquartile range (IQR)123.5

Descriptive statistics

Standard deviation71.735626
Coefficient of variation (CV)0.57618976
Kurtosis-1.2
Mean124.5
Median Absolute Deviation (MAD)62
Skewness0
Sum30876
Variance5146
MonotonicityStrictly increasing
2023-12-12T21:15:05.279459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
172 1
 
0.4%
159 1
 
0.4%
160 1
 
0.4%
161 1
 
0.4%
162 1
 
0.4%
163 1
 
0.4%
164 1
 
0.4%
165 1
 
0.4%
166 1
 
0.4%
Other values (238) 238
96.0%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
248 1
0.4%
247 1
0.4%
246 1
0.4%
245 1
0.4%
244 1
0.4%
243 1
0.4%
242 1
0.4%
241 1
0.4%
240 1
0.4%
239 1
0.4%

상호
Text

UNIQUE 

Distinct248
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T21:15:05.574340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length7.1935484
Min length3

Characters and Unicode

Total characters1784
Distinct characters231
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique248 ?
Unique (%)100.0%

Sample

1st row(유)성지건설
2nd row(주)거양건설
3rd row(주)거해건설
4th row(주)건명
5th row(주)건흥건설
ValueCountFrequency (%)
유)성지건설 1
 
0.4%
서린eng 1
 
0.4%
에스엠건설(주 1
 
0.4%
보성가스텍 1
 
0.4%
부원건설(주 1
 
0.4%
북삼현대서비스 1
 
0.4%
비젼이엔지(주 1
 
0.4%
삼성sd솔루텍 1
 
0.4%
삼성기전(주 1
 
0.4%
삼원건설(주 1
 
0.4%
Other values (238) 238
96.0%
2023-12-12T21:15:05.997258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
193
 
10.8%
( 170
 
9.5%
) 170
 
9.5%
92
 
5.2%
90
 
5.0%
38
 
2.1%
30
 
1.7%
30
 
1.7%
30
 
1.7%
28
 
1.6%
Other values (221) 913
51.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1431
80.2%
Open Punctuation 170
 
9.5%
Close Punctuation 170
 
9.5%
Uppercase Letter 11
 
0.6%
Lowercase Letter 1
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
193
 
13.5%
92
 
6.4%
90
 
6.3%
38
 
2.7%
30
 
2.1%
30
 
2.1%
30
 
2.1%
28
 
2.0%
28
 
2.0%
26
 
1.8%
Other values (209) 846
59.1%
Uppercase Letter
ValueCountFrequency (%)
G 2
18.2%
N 2
18.2%
E 2
18.2%
S 1
9.1%
D 1
9.1%
T 1
9.1%
R 1
9.1%
K 1
9.1%
Open Punctuation
ValueCountFrequency (%)
( 170
100.0%
Close Punctuation
ValueCountFrequency (%)
) 170
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1431
80.2%
Common 341
 
19.1%
Latin 12
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
193
 
13.5%
92
 
6.4%
90
 
6.3%
38
 
2.7%
30
 
2.1%
30
 
2.1%
30
 
2.1%
28
 
2.0%
28
 
2.0%
26
 
1.8%
Other values (209) 846
59.1%
Latin
ValueCountFrequency (%)
G 2
16.7%
N 2
16.7%
E 2
16.7%
S 1
8.3%
D 1
8.3%
T 1
8.3%
e 1
8.3%
R 1
8.3%
K 1
8.3%
Common
ValueCountFrequency (%)
( 170
49.9%
) 170
49.9%
- 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1431
80.2%
ASCII 353
 
19.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
193
 
13.5%
92
 
6.4%
90
 
6.3%
38
 
2.7%
30
 
2.1%
30
 
2.1%
30
 
2.1%
28
 
2.0%
28
 
2.0%
26
 
1.8%
Other values (209) 846
59.1%
ASCII
ValueCountFrequency (%)
( 170
48.2%
) 170
48.2%
G 2
 
0.6%
N 2
 
0.6%
E 2
 
0.6%
S 1
 
0.3%
D 1
 
0.3%
T 1
 
0.3%
e 1
 
0.3%
- 1
 
0.3%
Other values (2) 2
 
0.6%
Distinct223
Distinct (%)89.9%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T21:15:06.419671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.108871
Min length2

Characters and Unicode

Total characters771
Distinct characters122
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique202 ?
Unique (%)81.5%

Sample

1st row정*주
2nd row박*호
3rd row김*철
4th row손*락
5th row박*우
ValueCountFrequency (%)
이*숙 3
 
1.2%
이*철 3
 
1.2%
박*규 3
 
1.2%
김*호 3
 
1.2%
이*구 2
 
0.8%
김*식 2
 
0.8%
김*훈 2
 
0.8%
이*순 2
 
0.8%
이*현 2
 
0.8%
김*영 2
 
0.8%
Other values (213) 224
90.3%
2023-12-12T21:15:07.015983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 255
33.1%
46
 
6.0%
44
 
5.7%
23
 
3.0%
14
 
1.8%
14
 
1.8%
13
 
1.7%
12
 
1.6%
11
 
1.4%
10
 
1.3%
Other values (112) 329
42.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 509
66.0%
Other Punctuation 262
34.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
46
 
9.0%
44
 
8.6%
23
 
4.5%
14
 
2.8%
14
 
2.8%
13
 
2.6%
12
 
2.4%
11
 
2.2%
10
 
2.0%
10
 
2.0%
Other values (110) 312
61.3%
Other Punctuation
ValueCountFrequency (%)
* 255
97.3%
, 7
 
2.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 509
66.0%
Common 262
34.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
46
 
9.0%
44
 
8.6%
23
 
4.5%
14
 
2.8%
14
 
2.8%
13
 
2.6%
12
 
2.4%
11
 
2.2%
10
 
2.0%
10
 
2.0%
Other values (110) 312
61.3%
Common
ValueCountFrequency (%)
* 255
97.3%
, 7
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 509
66.0%
ASCII 262
34.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 255
97.3%
, 7
 
2.7%
Hangul
ValueCountFrequency (%)
46
 
9.0%
44
 
8.6%
23
 
4.5%
14
 
2.8%
14
 
2.8%
13
 
2.6%
12
 
2.4%
11
 
2.2%
10
 
2.0%
10
 
2.0%
Other values (110) 312
61.3%
Distinct105
Distinct (%)42.3%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T21:15:07.282106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length7
Mean length7.3064516
Min length2

Characters and Unicode

Total characters1812
Distinct characters14
Distinct categories5 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)20.6%

Sample

1st row[39880 ]
2nd row[39906]
3rd row[39859]
4th row[39862]
5th row[39857]
ValueCountFrequency (%)
86
25.9%
39864 13
 
3.9%
39859 12
 
3.6%
39852 11
 
3.3%
39915 10
 
3.0%
39850 9
 
2.7%
39868 9
 
2.7%
39906 8
 
2.4%
39870 8
 
2.4%
39909 8
 
2.4%
Other values (66) 158
47.6%
2023-12-12T21:15:07.784243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 343
18.9%
3 264
14.6%
[ 248
13.7%
] 248
13.7%
8 224
12.4%
84
 
4.6%
0 80
 
4.4%
5 79
 
4.4%
6 62
 
3.4%
4 49
 
2.7%
Other values (4) 131
 
7.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1231
67.9%
Open Punctuation 248
 
13.7%
Close Punctuation 248
 
13.7%
Space Separator 84
 
4.6%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
9 343
27.9%
3 264
21.4%
8 224
18.2%
0 80
 
6.5%
5 79
 
6.4%
6 62
 
5.0%
4 49
 
4.0%
1 47
 
3.8%
2 42
 
3.4%
7 41
 
3.3%
Open Punctuation
ValueCountFrequency (%)
[ 248
100.0%
Close Punctuation
ValueCountFrequency (%)
] 248
100.0%
Space Separator
ValueCountFrequency (%)
84
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1812
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
9 343
18.9%
3 264
14.6%
[ 248
13.7%
] 248
13.7%
8 224
12.4%
84
 
4.6%
0 80
 
4.4%
5 79
 
4.4%
6 62
 
3.4%
4 49
 
2.7%
Other values (4) 131
 
7.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1812
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 343
18.9%
3 264
14.6%
[ 248
13.7%
] 248
13.7%
8 224
12.4%
84
 
4.6%
0 80
 
4.4%
5 79
 
4.4%
6 62
 
3.4%
4 49
 
2.7%
Other values (4) 131
 
7.2%
Distinct234
Distinct (%)94.7%
Missing1
Missing (%)0.4%
Memory size2.1 KiB
2023-12-12T21:15:08.293551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length35
Mean length22.417004
Min length17

Characters and Unicode

Total characters5537
Distinct characters115
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique222 ?
Unique (%)89.9%

Sample

1st row경상북도 칠곡군 왜관읍 석전로3길 31
2nd row경상북도 칠곡군 왜관읍 구상길 130-11
3rd row경상북도 칠곡군 동명면 금암중앙길 40
4th row경상북도 칠곡군 동명면 봉암3길 2
5th row경상북도 칠곡군 동명면 한티로 32
ValueCountFrequency (%)
칠곡군 246
19.1%
경상북도 241
18.7%
왜관읍 78
 
6.0%
지천면 38
 
2.9%
가산면 35
 
2.7%
동명면 33
 
2.6%
2층 19
 
1.5%
약목면 19
 
1.5%
기산면 16
 
1.2%
석적읍 15
 
1.2%
Other values (344) 551
42.7%
2023-12-12T21:15:08.899634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1044
18.9%
276
 
5.0%
261
 
4.7%
260
 
4.7%
253
 
4.6%
252
 
4.6%
251
 
4.5%
1 250
 
4.5%
248
 
4.5%
2 159
 
2.9%
Other values (105) 2283
41.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3414
61.7%
Space Separator 1044
 
18.9%
Decimal Number 961
 
17.4%
Dash Punctuation 98
 
1.8%
Other Punctuation 13
 
0.2%
Open Punctuation 3
 
0.1%
Close Punctuation 3
 
0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
276
 
8.1%
261
 
7.6%
260
 
7.6%
253
 
7.4%
252
 
7.4%
251
 
7.4%
248
 
7.3%
147
 
4.3%
144
 
4.2%
141
 
4.1%
Other values (88) 1181
34.6%
Decimal Number
ValueCountFrequency (%)
1 250
26.0%
2 159
16.5%
3 83
 
8.6%
8 81
 
8.4%
7 75
 
7.8%
6 72
 
7.5%
4 68
 
7.1%
9 61
 
6.3%
5 58
 
6.0%
0 54
 
5.6%
Other Punctuation
ValueCountFrequency (%)
, 8
61.5%
5
38.5%
Space Separator
ValueCountFrequency (%)
1044
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 98
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3414
61.7%
Common 2122
38.3%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
276
 
8.1%
261
 
7.6%
260
 
7.6%
253
 
7.4%
252
 
7.4%
251
 
7.4%
248
 
7.3%
147
 
4.3%
144
 
4.2%
141
 
4.1%
Other values (88) 1181
34.6%
Common
ValueCountFrequency (%)
1044
49.2%
1 250
 
11.8%
2 159
 
7.5%
- 98
 
4.6%
3 83
 
3.9%
8 81
 
3.8%
7 75
 
3.5%
6 72
 
3.4%
4 68
 
3.2%
9 61
 
2.9%
Other values (6) 131
 
6.2%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3414
61.7%
ASCII 2118
38.3%
None 5
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1044
49.3%
1 250
 
11.8%
2 159
 
7.5%
- 98
 
4.6%
3 83
 
3.9%
8 81
 
3.8%
7 75
 
3.5%
6 72
 
3.4%
4 68
 
3.2%
9 61
 
2.9%
Other values (6) 127
 
6.0%
Hangul
ValueCountFrequency (%)
276
 
8.1%
261
 
7.6%
260
 
7.6%
253
 
7.4%
252
 
7.4%
251
 
7.4%
248
 
7.3%
147
 
4.3%
144
 
4.2%
141
 
4.1%
Other values (88) 1181
34.6%
None
ValueCountFrequency (%)
5
100.0%

Interactions

2023-12-12T21:15:04.857769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T21:15:04.956971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:15:05.044801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번상호대표자우편번호(도로명주소)영업소재지(도로명주소)
01(유)성지건설정*주[39880 ]경상북도 칠곡군 왜관읍 석전로3길 31
12(주)거양건설박*호[39906]경상북도 칠곡군 왜관읍 구상길 130-11
23(주)거해건설김*철[39859]경상북도 칠곡군 동명면 금암중앙길 40
34(주)건명손*락[39862]경상북도 칠곡군 동명면 봉암3길 2
45(주)건흥건설박*우[39857]경상북도 칠곡군 동명면 한티로 32
56(주)경원이*원[39848]경상북도 칠곡군 석적읍 중지3길 48-27
67(주)경원테크손*원[39852]경상북도 칠곡군 가산면 경북대로 1695
78(주)공간조경정*경[39914]경상북도 칠곡군 기산면 평복1길 84-21,2층
89(주)금강이엔지정*조[39864]경상북도 칠곡군 지천면 신동로2길 97-1
910(주)금오석면환경길*범[39907]경상북도 칠곡군 왜관읍 삼청2길 22-1
순번상호대표자우편번호(도로명주소)영업소재지(도로명주소)
238239한국종합엔지니어링(주)양*구[39868]경상북도 칠곡군 지천면 금호로1길 32 2층
239240한일건설(주)권*안[39871]경상북도 칠곡군 왜관읍 현대로 75
240241현대가스김*호[39825]경상북도 칠곡군 약목면 약목로7길 10
241242현대설비공사심*석[39907]경상북도 칠곡군 왜관읍 삼청2길 22-1 1층
242243협동조경(주)조*제[39864 ]경상북도 칠곡군 지천면 신동로 160
243244혜인설비정*철[39823]경상북도 칠곡군 약목면 복성13길 18
244245호이팜영농조합법인박*국[39828]경상북도 칠곡군 약목면 칠곡대로 878-7
245246호진건설주식회사김*호[39825]경상북도 칠곡군 약목면 약목로5길 8
246247화인건설(주)최*순[39910]경상북도 칠곡군 왜관읍 공단로11길 68
247248효성종합개발박*만[39805]경상북도 칠곡군 북삼읍 금오대로 75