Overview

Dataset statistics

Number of variables6
Number of observations36
Missing cells37
Missing cells (%)17.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory51.7 B

Variable types

Text4
Categorical2

Dataset

Description전라북도 정읍시 소재 고물상 현황(상호명, 소재지도로명주소, 소재지지번, 전화번호, 담당부서 등) 의 자료를 제공합니다.
Author전라북도 정읍시
URLhttps://www.data.go.kr/data/15034685/fileData.do

Alerts

담당부서 has constant value ""Constant
데이터기준일자 has constant value ""Constant
소재지도로명주소 has 7 (19.4%) missing valuesMissing
전화번호 has 30 (83.3%) missing valuesMissing
상호명 has unique valuesUnique
소재지지번주소 has unique valuesUnique

Reproduction

Analysis started2023-12-16 15:47:54.069274
Analysis finished2023-12-16 15:47:56.409724
Duration2.34 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호명
Text

UNIQUE 

Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-16T15:47:56.872426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length4
Mean length4.3333333
Min length4

Characters and Unicode

Total characters156
Distinct characters60
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row광주고물상
2nd row제일자원
3rd row상동고물상
4th row화성고물상
5th row하나철강
ValueCountFrequency (%)
광주고물상 1
 
2.8%
제일자원 1
 
2.8%
동문자원 1
 
2.8%
강산자원 1
 
2.8%
뷰티월드자원 1
 
2.8%
삼익고철 1
 
2.8%
왕림자원 1
 
2.8%
고속자원 1
 
2.8%
다원자원 1
 
2.8%
명성자원 1
 
2.8%
Other values (26) 26
72.2%
2023-12-16T15:47:59.150383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25
 
16.0%
24
 
15.4%
7
 
4.5%
6
 
3.8%
5
 
3.2%
4
 
2.6%
4
 
2.6%
4
 
2.6%
4
 
2.6%
4
 
2.6%
Other values (50) 69
44.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 154
98.7%
Close Punctuation 1
 
0.6%
Open Punctuation 1
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
16.2%
24
 
15.6%
7
 
4.5%
6
 
3.9%
5
 
3.2%
4
 
2.6%
4
 
2.6%
4
 
2.6%
4
 
2.6%
4
 
2.6%
Other values (48) 67
43.5%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 154
98.7%
Common 2
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
16.2%
24
 
15.6%
7
 
4.5%
6
 
3.9%
5
 
3.2%
4
 
2.6%
4
 
2.6%
4
 
2.6%
4
 
2.6%
4
 
2.6%
Other values (48) 67
43.5%
Common
ValueCountFrequency (%)
) 1
50.0%
( 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 154
98.7%
ASCII 2
 
1.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
25
 
16.2%
24
 
15.6%
7
 
4.5%
6
 
3.9%
5
 
3.2%
4
 
2.6%
4
 
2.6%
4
 
2.6%
4
 
2.6%
4
 
2.6%
Other values (48) 67
43.5%
ASCII
ValueCountFrequency (%)
) 1
50.0%
( 1
50.0%
Distinct28
Distinct (%)96.6%
Missing7
Missing (%)19.4%
Memory size420.0 B
2023-12-16T15:48:00.521312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length22
Mean length18.793103
Min length15

Characters and Unicode

Total characters545
Distinct characters80
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)93.1%

Sample

1st row전라북도 정읍시 우령4길 2
2nd row전라북도 정읍시 소성상평로 63
3rd row전라북도 정읍시 고부면 영원로 332(구,덕안철강)
4th row전라북도 정읍시 입석6길 17
5th row전라북도 정읍시 망제동 황토현로 938-5
ValueCountFrequency (%)
전라북도 29
23.2%
정읍시 29
23.2%
서부산업도로 4
 
3.2%
황토현로 2
 
1.6%
153 2
 
1.6%
왕림길 2
 
1.6%
태인면 2
 
1.6%
해평복룡길 2
 
1.6%
소성면 2
 
1.6%
월천길 2
 
1.6%
Other values (47) 49
39.2%
2023-12-16T15:48:02.361739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
96
17.6%
33
 
6.1%
33
 
6.1%
31
 
5.7%
30
 
5.5%
29
 
5.3%
29
 
5.3%
29
 
5.3%
1 20
 
3.7%
16
 
2.9%
Other values (70) 199
36.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 350
64.2%
Space Separator 96
 
17.6%
Decimal Number 88
 
16.1%
Dash Punctuation 8
 
1.5%
Open Punctuation 1
 
0.2%
Other Punctuation 1
 
0.2%
Close Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
9.4%
33
 
9.4%
31
 
8.9%
30
 
8.6%
29
 
8.3%
29
 
8.3%
29
 
8.3%
16
 
4.6%
11
 
3.1%
7
 
2.0%
Other values (55) 102
29.1%
Decimal Number
ValueCountFrequency (%)
1 20
22.7%
3 12
13.6%
2 11
12.5%
5 11
12.5%
4 9
10.2%
6 8
 
9.1%
8 6
 
6.8%
7 6
 
6.8%
9 4
 
4.5%
0 1
 
1.1%
Space Separator
ValueCountFrequency (%)
96
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 350
64.2%
Common 195
35.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
9.4%
33
 
9.4%
31
 
8.9%
30
 
8.6%
29
 
8.3%
29
 
8.3%
29
 
8.3%
16
 
4.6%
11
 
3.1%
7
 
2.0%
Other values (55) 102
29.1%
Common
ValueCountFrequency (%)
96
49.2%
1 20
 
10.3%
3 12
 
6.2%
2 11
 
5.6%
5 11
 
5.6%
4 9
 
4.6%
- 8
 
4.1%
6 8
 
4.1%
8 6
 
3.1%
7 6
 
3.1%
Other values (5) 8
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 350
64.2%
ASCII 195
35.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
96
49.2%
1 20
 
10.3%
3 12
 
6.2%
2 11
 
5.6%
5 11
 
5.6%
4 9
 
4.6%
- 8
 
4.1%
6 8
 
4.1%
8 6
 
3.1%
7 6
 
3.1%
Other values (5) 8
 
4.1%
Hangul
ValueCountFrequency (%)
33
 
9.4%
33
 
9.4%
31
 
8.9%
30
 
8.6%
29
 
8.3%
29
 
8.3%
29
 
8.3%
16
 
4.6%
11
 
3.1%
7
 
2.0%
Other values (55) 102
29.1%
Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-16T15:48:04.062445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length19.833333
Min length16

Characters and Unicode

Total characters714
Distinct characters67
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row전라북도 정읍시 북면 한교리 399-1번지
2nd row전라북도 정읍시 신태인읍 우령리 262
3rd row전라북도 정읍시 북면 승부리 314-3번지
4th row전라북도 정읍시 소성면 등계리 835-14
5th row전라북도 정읍시 고부면 덕안리 2-12
ValueCountFrequency (%)
전라북도 36
22.8%
정읍시 36
22.8%
상평동 4
 
2.5%
하북동 4
 
2.5%
북면 3
 
1.9%
소성면 3
 
1.9%
등계리 2
 
1.3%
고천리 2
 
1.3%
구룡동 2
 
1.3%
농소동 2
 
1.3%
Other values (59) 64
40.5%
2023-12-16T15:48:05.825594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
122
17.1%
44
 
6.2%
38
 
5.3%
37
 
5.2%
36
 
5.0%
36
 
5.0%
36
 
5.0%
36
 
5.0%
- 26
 
3.6%
22
 
3.1%
Other values (57) 281
39.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 438
61.3%
Decimal Number 128
 
17.9%
Space Separator 122
 
17.1%
Dash Punctuation 26
 
3.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
10.0%
38
 
8.7%
37
 
8.4%
36
 
8.2%
36
 
8.2%
36
 
8.2%
36
 
8.2%
22
 
5.0%
18
 
4.1%
18
 
4.1%
Other values (45) 117
26.7%
Decimal Number
ValueCountFrequency (%)
1 22
17.2%
3 20
15.6%
2 18
14.1%
6 17
13.3%
4 16
12.5%
9 10
7.8%
5 9
7.0%
7 8
 
6.2%
8 4
 
3.1%
0 4
 
3.1%
Space Separator
ValueCountFrequency (%)
122
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 438
61.3%
Common 276
38.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
10.0%
38
 
8.7%
37
 
8.4%
36
 
8.2%
36
 
8.2%
36
 
8.2%
36
 
8.2%
22
 
5.0%
18
 
4.1%
18
 
4.1%
Other values (45) 117
26.7%
Common
ValueCountFrequency (%)
122
44.2%
- 26
 
9.4%
1 22
 
8.0%
3 20
 
7.2%
2 18
 
6.5%
6 17
 
6.2%
4 16
 
5.8%
9 10
 
3.6%
5 9
 
3.3%
7 8
 
2.9%
Other values (2) 8
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 438
61.3%
ASCII 276
38.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
122
44.2%
- 26
 
9.4%
1 22
 
8.0%
3 20
 
7.2%
2 18
 
6.5%
6 17
 
6.2%
4 16
 
5.8%
9 10
 
3.6%
5 9
 
3.3%
7 8
 
2.9%
Other values (2) 8
 
2.9%
Hangul
ValueCountFrequency (%)
44
 
10.0%
38
 
8.7%
37
 
8.4%
36
 
8.2%
36
 
8.2%
36
 
8.2%
36
 
8.2%
22
 
5.0%
18
 
4.1%
18
 
4.1%
Other values (45) 117
26.7%

전화번호
Text

MISSING 

Distinct6
Distinct (%)100.0%
Missing30
Missing (%)83.3%
Memory size420.0 B
2023-12-16T15:48:06.555086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters72
Distinct characters10
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)100.0%

Sample

1st row063-536-6908
2nd row063-535-1055
3rd row063-535-6439
4th row063-538-1178
5th row063-537-7494
ValueCountFrequency (%)
063-536-6908 1
16.7%
063-535-1055 1
16.7%
063-535-6439 1
16.7%
063-538-1178 1
16.7%
063-537-7494 1
16.7%
063-571-5001 1
16.7%
2023-12-16T15:48:07.883647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 12
16.7%
- 12
16.7%
5 11
15.3%
0 10
13.9%
6 9
12.5%
1 5
6.9%
7 4
 
5.6%
9 3
 
4.2%
8 3
 
4.2%
4 3
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 60
83.3%
Dash Punctuation 12
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 12
20.0%
5 11
18.3%
0 10
16.7%
6 9
15.0%
1 5
8.3%
7 4
 
6.7%
9 3
 
5.0%
8 3
 
5.0%
4 3
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 72
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 12
16.7%
- 12
16.7%
5 11
15.3%
0 10
13.9%
6 9
12.5%
1 5
6.9%
7 4
 
5.6%
9 3
 
4.2%
8 3
 
4.2%
4 3
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 72
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 12
16.7%
- 12
16.7%
5 11
15.3%
0 10
13.9%
6 9
12.5%
1 5
6.9%
7 4
 
5.6%
9 3
 
4.2%
8 3
 
4.2%
4 3
 
4.2%

담당부서
Categorical

CONSTANT 

Distinct1
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size420.0 B
정읍시 자원순환과
36 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정읍시 자원순환과
2nd row정읍시 자원순환과
3rd row정읍시 자원순환과
4th row정읍시 자원순환과
5th row정읍시 자원순환과

Common Values

ValueCountFrequency (%)
정읍시 자원순환과 36
100.0%

Length

2023-12-16T15:48:08.523995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T15:48:08.898225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정읍시 36
50.0%
자원순환과 36
50.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-15
36 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-12-15
2nd row2023-12-15
3rd row2023-12-15
4th row2023-12-15
5th row2023-12-15

Common Values

ValueCountFrequency (%)
2023-12-15 36
100.0%

Length

2023-12-16T15:48:09.640398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T15:48:10.315006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-12-15 36
100.0%

Correlations

2023-12-16T15:48:10.804611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상호명소재지도로명주소소재지지번주소전화번호
상호명1.0001.0001.0001.000
소재지도로명주소1.0001.0001.0001.000
소재지지번주소1.0001.0001.0001.000
전화번호1.0001.0001.0001.000

Missing values

2023-12-16T15:47:55.048900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-16T15:47:55.726382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-16T15:47:56.164772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

상호명소재지도로명주소소재지지번주소전화번호담당부서데이터기준일자
0광주고물상<NA>전라북도 정읍시 북면 한교리 399-1번지<NA>정읍시 자원순환과2023-12-15
1제일자원전라북도 정읍시 우령4길 2전라북도 정읍시 신태인읍 우령리 262<NA>정읍시 자원순환과2023-12-15
2상동고물상<NA>전라북도 정읍시 북면 승부리 314-3번지<NA>정읍시 자원순환과2023-12-15
3화성고물상전라북도 정읍시 소성상평로 63전라북도 정읍시 소성면 등계리 835-14<NA>정읍시 자원순환과2023-12-15
4하나철강전라북도 정읍시 고부면 영원로 332(구,덕안철강)전라북도 정읍시 고부면 덕안리 2-12<NA>정읍시 자원순환과2023-12-15
5(주)거룩전라북도 정읍시 입석6길 17전라북도 정읍시 고부면 입석리 496-1번지<NA>정읍시 자원순환과2023-12-15
6금송자원전라북도 정읍시 망제동 황토현로 938-5전라북도 정읍시 망제동 514-6<NA>정읍시 자원순환과2023-12-15
7금강자원전라북도 정읍시 군대길 76전라북도 정읍시 하북동 278-1번지<NA>정읍시 자원순환과2023-12-15
8금속자원<NA>전라북도 정읍시 신월동 804<NA>정읍시 자원순환과2023-12-15
9태영자원전라북도 정읍시 서부산업도로 542전라북도 정읍시 하북동 193-9번지<NA>정읍시 자원순환과2023-12-15
상호명소재지도로명주소소재지지번주소전화번호담당부서데이터기준일자
26동문자원전라북도 정읍시 황토현로 175전라북도 정읍시 이평면 마항리 372-1번지063-538-1178정읍시 자원순환과2023-12-15
27명성자원전라북도 정읍시 소성면 돌선길 4전라북도 정읍시 소성면 보화리 347-4<NA>정읍시 자원순환과2023-12-15
28백억자원전라북도 정읍시 정읍북로 133-6전라북도 정읍시 구룡동 595-1<NA>정읍시 자원순환과2023-12-15
29오선철강전라북도 정읍시 소성면 소성로 394전라북도 정읍시 소성면 등계리 170-4063-537-7494정읍시 자원순환과2023-12-15
30우주비철전라북도 정읍시 해평복룡길 16전라북도 정읍시 농소동 555번지<NA>정읍시 자원순환과2023-12-15
31신태인광주자원전라북도 정읍시 신태인읍 신태인동길 23전라북도 정읍시 신태인읍 신태인동길 23063-571-5001정읍시 자원순환과2023-12-15
32서해자원전라북도 정읍시 감곡면 하평2길 7-5전라북도 정읍시 감곡면 삼평리 442-1<NA>정읍시 자원순환과2023-12-15
33왕림산업전라북도 정읍시 태인면 왕림길 148-62전라북도 정읍시 태인면 고천리 607<NA>정읍시 자원순환과2023-12-15
34광야무역전라북도 정읍시 북면 태곡리 512-38전라북도 정읍시 북면 칠북로 242<NA>정읍시 자원순환과2023-12-15
35오성자원전라북도 정읍시 하북동 155-27전라북도 정읍시 정신로 69-18<NA>정읍시 자원순환과2023-12-15