Overview

Dataset statistics

Number of variables3
Number of observations50
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory26.6 B

Variable types

Unsupported1
Text2

Alerts

Unnamed: 1 has unique valuesUnique
Unnamed: 2 has unique valuesUnique
공공측량업 등록현황 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-14 00:29:05.804509
Analysis finished2024-03-14 00:29:06.033325
Duration0.23 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

공공측량업 등록현황
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size532.0 B

Unnamed: 1
Text

UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2024-03-14T09:29:06.184606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length8.9
Min length4

Characters and Unicode

Total characters445
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)100.0%

Sample

1st row등록번호
2nd row03-000746
3rd row05-03-1034
4th row05-1088
5th row05-03-1025
ValueCountFrequency (%)
등록번호 1
 
2.0%
05-1036 1
 
2.0%
051050 1
 
2.0%
05-03-1029 1
 
2.0%
05-03-1037 1
 
2.0%
03-000632 1
 
2.0%
05-03-1038 1
 
2.0%
05-03-1032 1
 
2.0%
05-03-1107 1
 
2.0%
03-000749 1
 
2.0%
Other values (40) 40
80.0%
2024-03-14T09:29:06.497352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 159
35.7%
- 66
14.8%
3 56
 
12.6%
1 56
 
12.6%
5 47
 
10.6%
7 14
 
3.1%
2 13
 
2.9%
8 8
 
1.8%
4 8
 
1.8%
6 8
 
1.8%
Other values (5) 10
 
2.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 375
84.3%
Dash Punctuation 66
 
14.8%
Other Letter 4
 
0.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 159
42.4%
3 56
 
14.9%
1 56
 
14.9%
5 47
 
12.5%
7 14
 
3.7%
2 13
 
3.5%
8 8
 
2.1%
4 8
 
2.1%
6 8
 
2.1%
9 6
 
1.6%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 66
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 441
99.1%
Hangul 4
 
0.9%

Most frequent character per script

Common
ValueCountFrequency (%)
0 159
36.1%
- 66
15.0%
3 56
 
12.7%
1 56
 
12.7%
5 47
 
10.7%
7 14
 
3.2%
2 13
 
2.9%
8 8
 
1.8%
4 8
 
1.8%
6 8
 
1.8%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 441
99.1%
Hangul 4
 
0.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 159
36.1%
- 66
15.0%
3 56
 
12.7%
1 56
 
12.7%
5 47
 
10.7%
7 14
 
3.2%
2 13
 
2.9%
8 8
 
1.8%
4 8
 
1.8%
6 8
 
1.8%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 2
Text

UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2024-03-14T09:29:06.689433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length12
Mean length9.16
Min length5

Characters and Unicode

Total characters458
Distinct characters92
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)100.0%

Sample

1st row업 체 명
2nd row(유) 세종건설기술
3rd row(유)대도엔지니어링
4th row(유)백제기술공사
5th row(유)승우엔지니어링
ValueCountFrequency (%)
주식회사 5
 
7.9%
엔지니어링 3
 
4.8%
천우 1
 
1.6%
주)한신 1
 
1.6%
한아 1
 
1.6%
주)용성엔지니어링 1
 
1.6%
주)우리기술단 1
 
1.6%
주)유건앤지리정보센터 1
 
1.6%
주)유앤디엔지니어링건축사사무소 1
 
1.6%
주)유일종합기술단 1
 
1.6%
Other values (47) 47
74.6%
2024-03-14T09:29:06.986605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 42
 
9.2%
) 42
 
9.2%
34
 
7.4%
21
 
4.6%
21
 
4.6%
19
 
4.1%
17
 
3.7%
17
 
3.7%
17
 
3.7%
13
 
2.8%
Other values (82) 215
46.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 361
78.8%
Open Punctuation 42
 
9.2%
Close Punctuation 42
 
9.2%
Space Separator 13
 
2.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
9.4%
21
 
5.8%
21
 
5.8%
19
 
5.3%
17
 
4.7%
17
 
4.7%
17
 
4.7%
13
 
3.6%
13
 
3.6%
13
 
3.6%
Other values (79) 176
48.8%
Open Punctuation
ValueCountFrequency (%)
( 42
100.0%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%
Space Separator
ValueCountFrequency (%)
13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 361
78.8%
Common 97
 
21.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
9.4%
21
 
5.8%
21
 
5.8%
19
 
5.3%
17
 
4.7%
17
 
4.7%
17
 
4.7%
13
 
3.6%
13
 
3.6%
13
 
3.6%
Other values (79) 176
48.8%
Common
ValueCountFrequency (%)
( 42
43.3%
) 42
43.3%
13
 
13.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 361
78.8%
ASCII 97
 
21.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 42
43.3%
) 42
43.3%
13
 
13.4%
Hangul
ValueCountFrequency (%)
34
 
9.4%
21
 
5.8%
21
 
5.8%
19
 
5.3%
17
 
4.7%
17
 
4.7%
17
 
4.7%
13
 
3.6%
13
 
3.6%
13
 
3.6%
Other values (79) 176
48.8%

Correlations

2024-03-14T09:29:07.082658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 1Unnamed: 2
Unnamed: 11.0001.000
Unnamed: 21.0001.000

Missing values

2024-03-14T09:29:05.934164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T09:29:06.000879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공공측량업 등록현황Unnamed: 1Unnamed: 2
0연번등록번호업 체 명
1103-000746(유) 세종건설기술
2205-03-1034(유)대도엔지니어링
3305-1088(유)백제기술공사
4405-03-1025(유)승우엔지니어링
5505-03-1019(유)신우엔지니어링
6605-1093(유)신한개발기술단
7705-03-1018(유)여울건설엔지니어링
8805-03-1024(유)이지이앤씨
9905-1054(유)장흥건설기술공사
공공측량업 등록현황Unnamed: 1Unnamed: 2
404005031011(주)현성 엔지니어링
414105031010성원기술개발(주)
424203-000752유한회사 새움
434303-000729유한회사일등엔지니어링
4444051057제이씨엔(주)
454503-000731주식회사 성광
464603-000704주식회사 씨앤에스 비전
474705031021주식회사 천우
4848051050주식회사 한아
494905031006주식회사 현산이엔씨