Overview

Dataset statistics

Number of variables3
Number of observations73
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory25.8 B

Variable types

Unsupported1
Text2

Alerts

Unnamed: 1 has unique valuesUnique
Unnamed: 2 has unique valuesUnique
일반측량업 등록현황 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-14 02:53:06.583505
Analysis finished2024-03-14 02:53:06.855709
Duration0.27 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일반측량업 등록현황
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size716.0 B

Unnamed: 1
Text

UNIQUE 

Distinct73
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size716.0 B
2024-03-14T11:53:07.028860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length7.630137
Min length4

Characters and Unicode

Total characters557
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)100.0%

Sample

1st row등록번호
2nd row05032027
3rd row04-002763
4th row05032084
5th row05032083
ValueCountFrequency (%)
등록번호 1
 
1.4%
05032082 1
 
1.4%
05032089 1
 
1.4%
05032051 1
 
1.4%
05032057 1
 
1.4%
05032067 1
 
1.4%
05032061 1
 
1.4%
05032065 1
 
1.4%
052116 1
 
1.4%
052093 1
 
1.4%
Other values (63) 63
86.3%
2024-03-14T11:53:07.350236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 188
33.8%
2 79
14.2%
5 77
13.8%
3 61
 
11.0%
1 33
 
5.9%
4 27
 
4.8%
8 23
 
4.1%
7 18
 
3.2%
6 18
 
3.2%
9 18
 
3.2%
Other values (5) 15
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 542
97.3%
Dash Punctuation 11
 
2.0%
Other Letter 4
 
0.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 188
34.7%
2 79
14.6%
5 77
14.2%
3 61
 
11.3%
1 33
 
6.1%
4 27
 
5.0%
8 23
 
4.2%
7 18
 
3.3%
6 18
 
3.3%
9 18
 
3.3%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 553
99.3%
Hangul 4
 
0.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 188
34.0%
2 79
14.3%
5 77
13.9%
3 61
 
11.0%
1 33
 
6.0%
4 27
 
4.9%
8 23
 
4.2%
7 18
 
3.3%
6 18
 
3.3%
9 18
 
3.3%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 553
99.3%
Hangul 4
 
0.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 188
34.0%
2 79
14.3%
5 77
13.9%
3 61
 
11.0%
1 33
 
6.0%
4 27
 
4.9%
8 23
 
4.2%
7 18
 
3.3%
6 18
 
3.3%
9 18
 
3.3%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 2
Text

UNIQUE 

Distinct73
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size716.0 B
2024-03-14T11:53:07.538615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length8.7945205
Min length3

Characters and Unicode

Total characters642
Distinct characters113
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)100.0%

Sample

1st row업 체 명
2nd row(유)공간건설엔지니어링
3rd row(유)대건엔지니어링
4th row(유)대성이엔씨
5th row(유)동방엔지니어링
ValueCountFrequency (%)
유한회사 8
 
9.2%
주식회사 3
 
3.4%
거성건설 1
 
1.1%
우리측량토목설계 1
 
1.1%
송하측량토목설계공사 1
 
1.1%
새터이엔지 1
 
1.1%
삼일토목기술단 1
 
1.1%
부경주식회사 1
 
1.1%
동아측량토목설계공사 1
 
1.1%
도시측량설계공사 1
 
1.1%
Other values (68) 68
78.2%
2024-03-14T11:53:07.835635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35
 
5.5%
( 34
 
5.3%
) 34
 
5.3%
33
 
5.1%
28
 
4.4%
24
 
3.7%
22
 
3.4%
21
 
3.3%
21
 
3.3%
21
 
3.3%
Other values (103) 369
57.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 556
86.6%
Open Punctuation 34
 
5.3%
Close Punctuation 34
 
5.3%
Space Separator 14
 
2.2%
Uppercase Letter 3
 
0.5%
Other Symbol 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
6.3%
33
 
5.9%
28
 
5.0%
24
 
4.3%
22
 
4.0%
21
 
3.8%
21
 
3.8%
21
 
3.8%
19
 
3.4%
15
 
2.7%
Other values (96) 317
57.0%
Uppercase Letter
ValueCountFrequency (%)
E 1
33.3%
N 1
33.3%
G 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 34
100.0%
Close Punctuation
ValueCountFrequency (%)
) 34
100.0%
Space Separator
ValueCountFrequency (%)
14
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 557
86.8%
Common 82
 
12.8%
Latin 3
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
6.3%
33
 
5.9%
28
 
5.0%
24
 
4.3%
22
 
3.9%
21
 
3.8%
21
 
3.8%
21
 
3.8%
19
 
3.4%
15
 
2.7%
Other values (97) 318
57.1%
Common
ValueCountFrequency (%)
( 34
41.5%
) 34
41.5%
14
17.1%
Latin
ValueCountFrequency (%)
E 1
33.3%
N 1
33.3%
G 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 556
86.6%
ASCII 85
 
13.2%
None 1
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
35
 
6.3%
33
 
5.9%
28
 
5.0%
24
 
4.3%
22
 
4.0%
21
 
3.8%
21
 
3.8%
21
 
3.8%
19
 
3.4%
15
 
2.7%
Other values (96) 317
57.0%
ASCII
ValueCountFrequency (%)
( 34
40.0%
) 34
40.0%
14
16.5%
E 1
 
1.2%
N 1
 
1.2%
G 1
 
1.2%
None
ValueCountFrequency (%)
1
100.0%

Correlations

2024-03-14T11:53:07.911009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 1Unnamed: 2
Unnamed: 11.0001.000
Unnamed: 21.0001.000

Missing values

2024-03-14T11:53:06.713607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T11:53:06.827159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일반측량업 등록현황Unnamed: 1Unnamed: 2
0연번등록번호업 체 명
1105032027(유)공간건설엔지니어링
2204-002763(유)대건엔지니어링
3305032084(유)대성이엔씨
4405032083(유)동방엔지니어링
55052133(유)미래에스엔씨
6605032063(유)백두엔지니어링
77052182(유)범한
8805032076(유)삼교건설엔지니어링
9905032055(유)삼안엔지니어링
일반측량업 등록현황Unnamed: 1Unnamed: 2
6363052123제일측량설계공사
646404-002914주식회사 고산
656504-003103주식회사 고원
666604-002964주식회사 지유엔지니어링
676705032196지오측량설계사무소
6868052048청구토목측량설계공사
6969052176토지측량설계공사
707005032185하늘측량토목설계
7171052094현대측량설계공사
7272052145호남측량설계공사