Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 26 |
Duplicate rows (%) | 0.3% |
Total size in memory | 546.9 KiB |
Average record size in memory | 56.0 B |
Variable types
Text | 2 |
---|---|
Categorical | 2 |
DateTime | 2 |
Dataset
Description | 2022년 12월 한달 동안 e프라이버시 클린서비스 웹페이지에서 발생한 민원에 대한 데이터로, 사업체명,도메인주소, 민원 구분 등의 데이터를 제공합니다. |
---|---|
Author | 개인정보보호위원회 |
URL | https://www.data.go.kr/data/15119766/fileData.do |
Dataset has 26 (0.3%) duplicate rows | Duplicates |
민원 구분-상세 is highly overall correlated with 민원 구분 | High correlation |
민원 구분 is highly overall correlated with 민원 구분-상세 | High correlation |
민원 구분 is highly imbalanced (79.8%) | Imbalance |
민원 구분-상세 is highly imbalanced (85.2%) | Imbalance |
Reproduction
Analysis started | 2023-12-12 17:38:50.046132 |
---|---|
Analysis finished | 2023-12-12 17:38:50.933198 |
Duration | 0.89 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
사업체명
Text
Distinct | 1937 |
---|---|
Distinct (%) | 19.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
null | 198 | 1.8% |
주식회사 | 163 | 1.5% |
kb손해보험 | 155 | 1.4% |
고용정보원 | 153 | 1.4% |
주)이베이코리아 | 119 | 1.1% |
엔에이치엔(주 | 119 | 1.1% |
롯데멤버스(주 | 117 | 1.1% |
위대한상상(요기요 | 113 | 1.0% |
주)현대백화점 | 106 | 1.0% |
주)11번가 | 105 | 1.0% |
Other values (2047) | 9432 |
Most occurring characters
Value | Count | Frequency (%) |
( | 6181 | 8.0% |
) | 6181 | 8.0% |
주 | 5801 | 7.5% |
이 | 2831 | 3.7% |
스 | 2191 | 2.8% |
리 | 1265 | 1.6% |
아 | 1234 | 1.6% |
트 | 862 | 1.1% |
한 | 808 | 1.0% |
802 | 1.0% | |
Other values (714) | 48853 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 59946 | |
Open Punctuation | 6181 | 8.0% |
Close Punctuation | 6181 | 8.0% |
Uppercase Letter | 3036 | 3.9% |
Space Separator | 802 | 1.0% |
Decimal Number | 390 | 0.5% |
Lowercase Letter | 348 | 0.5% |
Other Symbol | 70 | 0.1% |
Other Punctuation | 38 | < 0.1% |
Dash Punctuation | 10 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
주 | 5801 | 9.7% |
이 | 2831 | 4.7% |
스 | 2191 | 3.7% |
리 | 1265 | 2.1% |
아 | 1234 | 2.1% |
트 | 862 | 1.4% |
한 | 808 | 1.3% |
사 | 779 | 1.3% |
코 | 760 | 1.3% |
에 | 738 | 1.2% |
Other values (644) | 42677 |
Uppercase Letter
Value | Count | Frequency (%) |
L | 568 | |
N | 301 | |
K | 269 | |
S | 224 | 7.4% |
B | 221 | 7.3% |
U | 209 | 6.9% |
G | 191 | 6.3% |
T | 148 | 4.9% |
C | 127 | 4.2% |
E | 115 | 3.8% |
Other values (15) | 663 |
Lowercase Letter
Value | Count | Frequency (%) |
s | 43 | |
e | 33 | 9.5% |
p | 25 | 7.2% |
t | 25 | 7.2% |
g | 23 | 6.6% |
r | 23 | 6.6% |
a | 22 | 6.3% |
o | 22 | 6.3% |
i | 21 | 6.0% |
c | 21 | 6.0% |
Other values (13) | 90 |
Decimal Number
Value | Count | Frequency (%) |
1 | 224 | |
3 | 42 | 10.8% |
6 | 27 | 6.9% |
2 | 23 | 5.9% |
8 | 20 | 5.1% |
4 | 15 | 3.8% |
9 | 14 | 3.6% |
5 | 13 | 3.3% |
0 | 11 | 2.8% |
7 | 1 | 0.3% |
Other Punctuation
Value | Count | Frequency (%) |
& | 20 | |
. | 6 | 15.8% |
/ | 5 | 13.2% |
, | 3 | 7.9% |
· | 3 | 7.9% |
' | 1 | 2.6% |
Open Punctuation
Value | Count | Frequency (%) |
( | 6181 |
Close Punctuation
Value | Count | Frequency (%) |
) | 6181 |
Space Separator
Value | Count | Frequency (%) |
802 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 70 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 7 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 60016 | |
Common | 13609 | 17.7% |
Latin | 3384 | 4.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
주 | 5801 | 9.7% |
이 | 2831 | 4.7% |
스 | 2191 | 3.7% |
리 | 1265 | 2.1% |
아 | 1234 | 2.1% |
트 | 862 | 1.4% |
한 | 808 | 1.3% |
사 | 779 | 1.3% |
코 | 760 | 1.3% |
에 | 738 | 1.2% |
Other values (645) | 42747 |
Latin
Value | Count | Frequency (%) |
L | 568 | |
N | 301 | 8.9% |
K | 269 | 7.9% |
S | 224 | 6.6% |
B | 221 | 6.5% |
U | 209 | 6.2% |
G | 191 | 5.6% |
T | 148 | 4.4% |
C | 127 | 3.8% |
E | 115 | 3.4% |
Other values (38) | 1011 |
Common
Value | Count | Frequency (%) |
( | 6181 | |
) | 6181 | |
802 | 5.9% | |
1 | 224 | 1.6% |
3 | 42 | 0.3% |
6 | 27 | 0.2% |
2 | 23 | 0.2% |
8 | 20 | 0.1% |
& | 20 | 0.1% |
4 | 15 | 0.1% |
Other values (11) | 74 | 0.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 59946 | |
ASCII | 16990 | 22.1% |
None | 73 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
( | 6181 | |
) | 6181 | |
802 | 4.7% | |
L | 568 | 3.3% |
N | 301 | 1.8% |
K | 269 | 1.6% |
S | 224 | 1.3% |
1 | 224 | 1.3% |
B | 221 | 1.3% |
U | 209 | 1.2% |
Other values (58) | 1810 | 10.7% |
Hangul
Value | Count | Frequency (%) |
주 | 5801 | 9.7% |
이 | 2831 | 4.7% |
스 | 2191 | 3.7% |
리 | 1265 | 2.1% |
아 | 1234 | 2.1% |
트 | 862 | 1.4% |
한 | 808 | 1.3% |
사 | 779 | 1.3% |
코 | 760 | 1.3% |
에 | 738 | 1.2% |
Other values (644) | 42677 |
None
Value | Count | Frequency (%) |
㈜ | 70 | |
· | 3 | 4.1% |
도메인 주소
Text
Distinct | 2155 |
---|---|
Distinct (%) | 21.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 34 |
---|---|
Median length | 28 |
Mean length | 12.9571 |
Min length | 6 |
Characters and Unicode
Total characters | 129571 |
---|---|
Distinct characters | 90 |
Distinct categories | 8 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 1139 ? |
---|---|
Unique (%) | 11.4% |
Sample
1st row | hansung.ac.kr |
---|---|
2nd row | (NULL) |
3rd row | license.kacpta.or.kr |
4th row | youth.seoul.go.kr |
5th row | cpoint.or.kr |
Value | Count | Frequency (%) |
null | 198 | 2.0% |
kbinsure.co.kr | 155 | 1.6% |
work.go.kr | 144 | 1.4% |
lpoint.com | 115 | 1.1% |
yogiyo.co.kr | 113 | 1.1% |
hangame.com | 108 | 1.1% |
ehyundai.com | 105 | 1.1% |
interpark.com | 99 | 1.0% |
gmarket.co.kr | 98 | 1.0% |
tmoney.co.kr | 94 | 0.9% |
Other values (2145) | 8771 |
Most occurring characters
Value | Count | Frequency (%) |
. | 15858 | |
o | 15405 | |
c | 10784 | 8.3% |
r | 9274 | 7.2% |
m | 7822 | 6.0% |
k | 7602 | 5.9% |
e | 7449 | 5.7% |
a | 7352 | 5.7% |
n | 6025 | 4.6% |
i | 5046 | 3.9% |
Other values (80) | 36954 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 110697 | |
Other Punctuation | 15930 | 12.3% |
Decimal Number | 1368 | 1.1% |
Uppercase Letter | 792 | 0.6% |
Dash Punctuation | 295 | 0.2% |
Open Punctuation | 200 | 0.2% |
Close Punctuation | 200 | 0.2% |
Other Letter | 89 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
자 | 14 | 15.7% |
문 | 13 | 14.6% |
코 | 4 | 4.5% |
링 | 4 | 4.5% |
힐 | 4 | 4.5% |
드 | 4 | 4.5% |
지 | 3 | 3.4% |
부 | 2 | 2.2% |
블 | 2 | 2.2% |
리 | 2 | 2.2% |
Other values (35) | 37 |
Lowercase Letter
Value | Count | Frequency (%) |
o | 15405 | |
c | 10784 | 9.7% |
r | 9274 | 8.4% |
m | 7822 | 7.1% |
k | 7602 | 6.9% |
e | 7449 | 6.7% |
a | 7352 | 6.6% |
n | 6025 | 5.4% |
i | 5046 | 4.6% |
t | 4463 | 4.0% |
Other values (16) | 29475 |
Decimal Number
Value | Count | Frequency (%) |
1 | 437 | |
2 | 250 | |
4 | 165 | 12.1% |
9 | 160 | 11.7% |
0 | 100 | 7.3% |
8 | 59 | 4.3% |
5 | 59 | 4.3% |
6 | 57 | 4.2% |
3 | 43 | 3.1% |
7 | 38 | 2.8% |
Other Punctuation
Value | Count | Frequency (%) |
. | 15858 | |
/ | 40 | 0.3% |
: | 32 | 0.2% |
Uppercase Letter
Value | Count | Frequency (%) |
L | 396 | |
U | 198 | |
N | 198 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 295 |
Open Punctuation
Value | Count | Frequency (%) |
( | 200 |
Close Punctuation
Value | Count | Frequency (%) |
) | 200 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 111489 | |
Common | 17993 | 13.9% |
Hangul | 89 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
자 | 14 | 15.7% |
문 | 13 | 14.6% |
코 | 4 | 4.5% |
링 | 4 | 4.5% |
힐 | 4 | 4.5% |
드 | 4 | 4.5% |
지 | 3 | 3.4% |
부 | 2 | 2.2% |
블 | 2 | 2.2% |
리 | 2 | 2.2% |
Other values (35) | 37 |
Latin
Value | Count | Frequency (%) |
o | 15405 | |
c | 10784 | 9.7% |
r | 9274 | 8.3% |
m | 7822 | 7.0% |
k | 7602 | 6.8% |
e | 7449 | 6.7% |
a | 7352 | 6.6% |
n | 6025 | 5.4% |
i | 5046 | 4.5% |
t | 4463 | 4.0% |
Other values (19) | 30267 |
Common
Value | Count | Frequency (%) |
. | 15858 | |
1 | 437 | 2.4% |
- | 295 | 1.6% |
2 | 250 | 1.4% |
( | 200 | 1.1% |
) | 200 | 1.1% |
4 | 165 | 0.9% |
9 | 160 | 0.9% |
0 | 100 | 0.6% |
8 | 59 | 0.3% |
Other values (6) | 269 | 1.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 129482 | |
Hangul | 89 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 15858 | |
o | 15405 | |
c | 10784 | 8.3% |
r | 9274 | 7.2% |
m | 7822 | 6.0% |
k | 7602 | 5.9% |
e | 7449 | 5.8% |
a | 7352 | 5.7% |
n | 6025 | 4.7% |
i | 5046 | 3.9% |
Other values (35) | 36865 |
Hangul
Value | Count | Frequency (%) |
자 | 14 | 15.7% |
문 | 13 | 14.6% |
코 | 4 | 4.5% |
링 | 4 | 4.5% |
힐 | 4 | 4.5% |
드 | 4 | 4.5% |
지 | 3 | 3.4% |
부 | 2 | 2.2% |
블 | 2 | 2.2% |
리 | 2 | 2.2% |
Other values (35) | 37 |
민원 구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
회원탈퇴 | |
---|---|
개인정보 | 316 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 회원탈퇴 |
---|---|
2nd row | 회원탈퇴 |
3rd row | 회원탈퇴 |
4th row | 회원탈퇴 |
5th row | 회원탈퇴 |
Common Values
Value | Count | Frequency (%) |
회원탈퇴 | 9684 | |
개인정보 | 316 | 3.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
회원탈퇴 | 9684 | |
개인정보 | 316 | 3.2% |
민원 구분-상세
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
탈퇴 | |
---|---|
열람 | 163 |
처리정지 | 153 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.0306 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 탈퇴 |
---|---|
2nd row | 탈퇴 |
3rd row | 탈퇴 |
4th row | 탈퇴 |
5th row | 탈퇴 |
Common Values
Value | Count | Frequency (%) |
탈퇴 | 9684 | |
열람 | 163 | 1.6% |
처리정지 | 153 | 1.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
탈퇴 | 9684 | |
열람 | 163 | 1.6% |
처리정지 | 153 | 1.5% |
신청일자
Date
Distinct | 5306 |
---|---|
Distinct (%) | 53.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2022-12-01 00:06:00 |
---|---|
Maximum | 2022-12-31 23:57:00 |
완료일자
Date
Distinct | 2874 |
---|---|
Distinct (%) | 28.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2022-12-01 01:22:00 |
---|---|
Maximum | 2023-05-04 12:02:00 |
민원 구분 | 민원 구분-상세 | |
---|---|---|
민원 구분 | 1.000 | 1.000 |
민원 구분-상세 | 1.000 | 1.000 |
민원 구분-상세 | 민원 구분 | |
---|---|---|
민원 구분-상세 | 1.000 | 1.000 |
민원 구분 | 1.000 | 1.000 |
민원 구분 | 민원 구분-상세 | |
---|---|---|
민원 구분 | 1.000 | 1.000 |
민원 구분-상세 | 1.000 | 1.000 |
사업체명 | 도메인 주소 | 민원 구분 | 민원 구분-상세 | 신청일자 | 완료일자 | |
---|---|---|---|---|---|---|
36518 | 한성대학교 | hansung.ac.kr | 회원탈퇴 | 탈퇴 | 2022-12-23 14:25 | 2022-12-23 14:26 |
30703 | (NULL) | (NULL) | 회원탈퇴 | 탈퇴 | 2022-12-19 22:39 | 2022-12-28 11:01 |
5639 | 한국세무사회 | license.kacpta.or.kr | 회원탈퇴 | 탈퇴 | 2022-12-04 20:45 | 2022-12-21 19:43 |
44876 | 서울특별시미래청년기획단 | youth.seoul.go.kr | 회원탈퇴 | 탈퇴 | 2022-12-28 17:56 | 2023-01-17 13:40 |
2973 | 한국환경공단 | cpoint.or.kr | 회원탈퇴 | 탈퇴 | 2022-12-02 17:20 | 2023-02-01 18:00 |
11208 | KB손해보험 | kbinsure.co.kr | 회원탈퇴 | 탈퇴 | 2022-12-08 01:06 | 2022-12-13 19:10 |
22361 | 로카모빌리티 | cashbee.co.kr | 회원탈퇴 | 탈퇴 | 2022-12-14 16:57 | 2023-02-14 14:14 |
41086 | 블리자드엔터테인먼트(유) | blizzard.co.kr | 회원탈퇴 | 탈퇴 | 2022-12-26 18:27 | 2023-02-02 15:11 |
38491 | 러브밤 | lovebam.kr | 회원탈퇴 | 탈퇴 | 2022-12-24 22:07 | 2023-04-20 17:10 |
18814 | (주)팬딩 | fanding.kr | 회원탈퇴 | 탈퇴 | 2022-12-12 20:52 | 2023-02-13 18:34 |
사업체명 | 도메인 주소 | 민원 구분 | 민원 구분-상세 | 신청일자 | 완료일자 | |
---|---|---|---|---|---|---|
4885 | 본아이에프(주) | bonif.co.kr | 회원탈퇴 | 탈퇴 | 2022-12-04 10:27 | 2022-12-23 08:43 |
22211 | (NULL) | (NULL) | 회원탈퇴 | 탈퇴 | 2022-12-14 16:06 | 2022-12-27 16:45 |
40977 | 주식회사 위대한상상(요기요) | yogiyo.co.kr | 회원탈퇴 | 탈퇴 | 2022-12-26 17:26 | 2023-01-19 11:44 |
42606 | BS몰 | banana69.co.kr | 회원탈퇴 | 탈퇴 | 2022-12-27 15:56 | 2023-04-20 17:10 |
45826 | (주)한섬 | sign.handsome.co.kr | 회원탈퇴 | 탈퇴 | 2022-12-29 12:39 | 2023-01-13 16:27 |
4921 | 국민권익위원회 | epeople.go.kr | 회원탈퇴 | 탈퇴 | 2022-12-04 10:52 | 2022-12-15 19:16 |
20221 | (주)알라딘커뮤니케이션 | aladin.co.kr | 회원탈퇴 | 탈퇴 | 2022-12-13 15:08 | 2022-12-21 19:30 |
746 | 콘텐츠웨이브(주) | wavve.com | 회원탈퇴 | 탈퇴 | 2022-12-01 11:42 | 2022-12-19 19:44 |
46666 | AKS&D(주)AK인터넷쇼핑몰 | akmall.com | 회원탈퇴 | 탈퇴 | 2022-12-29 20:20 | 2023-01-30 14:34 |
45602 | (주)이랜드이츠 | elandeat.com | 회원탈퇴 | 탈퇴 | 2022-12-29 09:59 | 2023-03-02 22:47 |
Most frequently occurring
사업체명 | 도메인 주소 | 민원 구분 | 민원 구분-상세 | 신청일자 | 완료일자 | # duplicates | |
---|---|---|---|---|---|---|---|
7 | (주)커리어케어 | careercare.co.kr | 회원탈퇴 | 탈퇴 | 2022-12-31 01:35 | 2023-01-13 16:23 | 3 |
8 | (주)티머니 | tmoney.co.kr | 회원탈퇴 | 탈퇴 | 2022-12-13 19:47 | 2022-12-23 19:10 | 3 |
24 | 한국전력공사 | recruit.kepco.co.kr | 회원탈퇴 | 탈퇴 | 2022-12-13 19:47 | 2023-03-20 15:53 | 3 |
0 | (주)11번가 | 11st.co.kr | 회원탈퇴 | 탈퇴 | 2022-12-30 14:45 | 2023-03-08 13:44 | 2 |
1 | (주)교보문고 | mobile.kyobobook.co.kr | 회원탈퇴 | 탈퇴 | 2022-12-29 15:11 | 2023-02-10 15:05 | 2 |
2 | (주)데일리펀딩 | daily-funding.com | 회원탈퇴 | 탈퇴 | 2022-12-10 14:53 | 2022-12-14 19:19 | 2 |
3 | (주)번개장터 | bunjang.co.kr | 회원탈퇴 | 탈퇴 | 2022-12-16 17:04 | 2023-01-16 19:42 | 2 |
4 | (주)에이비씨마트코리아 | abcmart.co.kr | 회원탈퇴 | 탈퇴 | 2022-12-16 11:56 | 2022-12-22 19:08 | 2 |
5 | (주)위메프 | wemakeprice.com | 회원탈퇴 | 탈퇴 | 2022-12-24 23:26 | 2023-01-20 12:13 | 2 |
6 | (주)지니뮤직 | genie.co.kr | 회원탈퇴 | 탈퇴 | 2022-12-09 19:23 | 2022-12-23 19:19 | 2 |