py3-validate-email/README.rst

91 lines
5.1 KiB
ReStructuredText
Raw Normal View History

2019-03-01 23:32:03 +01:00
.. image:: https://travis-ci.org/karolyi/py3-validate-email.svg?branch=master
:target: https://travis-ci.org/karolyi/py3-validate-email
2019-09-25 20:10:11 +02:00
.. image:: https://bmc-cdn.nyc3.digitaloceanspaces.com/BMC-button-images/custom_images/orange_img.png
2020-01-20 11:36:13 +01:00
:target: https://buymeacoff.ee/karolyi
2019-03-01 23:32:03 +01:00
2019-03-02 03:26:58 +01:00
============================
2019-03-01 23:32:03 +01:00
py3-validate-email
2019-03-02 03:26:58 +01:00
============================
2012-04-07 23:05:57 +02:00
2019-11-24 18:20:03 +01:00
py3-validate-email is a package for Python that check if an email is valid, not blacklisted, properly formatted and really exists.
2012-04-07 23:05:57 +02:00
2019-03-02 00:59:23 +01:00
This module is for Python 3.6 and above!
2012-04-07 23:05:57 +02:00
INSTALLATION
2019-03-02 03:26:58 +01:00
============================
2012-04-07 23:05:57 +02:00
2020-04-11 16:04:27 +02:00
You can install the package with pip::
2012-04-07 23:05:57 +02:00
2020-04-11 16:04:27 +02:00
python -m pip install py3-validate-email
2012-04-07 23:05:57 +02:00
USAGE
2019-03-02 03:26:58 +01:00
============================
2012-04-07 23:05:57 +02:00
Basic usage::
2019-03-01 23:32:03 +01:00
from validate_email import validate_email
is_valid = validate_email(email_address='example@example.com', check_regex=True, check_mx=True, from_address='my@from.addr.ess', helo_host='my.host.name', smtp_timeout=10, dns_timeout=10, use_blacklist=True, debug=False)
2012-04-07 23:05:57 +02:00
2019-03-02 03:30:58 +01:00
:code:`check_regex` will check will the email address has a valid structure and defaults to True
2012-04-07 23:05:57 +02:00
2019-03-02 03:30:58 +01:00
:code:`check_mx`: check the mx-records and check whether the email actually exists
:code:`from_address`: the email address the probe will be sent from,
2019-03-02 04:07:12 +01:00
:code:`helo_host`: the host to use in SMTP HELO when checking for an email,
2019-03-02 04:00:12 +01:00
2019-03-02 03:30:58 +01:00
:code:`smtp_timeout`: seconds until SMTP timeout
2019-05-25 14:36:20 +02:00
:code:`dns_timeout`: seconds until DNS timeout
2019-05-25 13:59:27 +02:00
2019-03-02 04:07:12 +01:00
:code:`use_blacklist`: use the blacklist of domains downloaded from https://github.com/martenson/disposable-email-domains
2012-04-07 23:05:57 +02:00
2020-11-19 14:26:52 +01:00
:code:`debug`: emit debug/warning messages while checking email
2021-02-11 17:49:29 +01:00
:code:`skip_smtp`: (default :code:`False`) skip the SMTP conversation with the server, after MX checks. Will automatically be set to :code:`True` when :code:`check_mx` is :code:`False`!
2021-02-11 15:48:35 +01:00
The function :code:`validate_email_or_fail()` works exactly like :code:`validate_email`, except that it raises an exception in the case of validation failure instead of returning :code:`False`.
2020-10-11 13:51:37 +02:00
The module will try to negotiate a TLS connection with STARTTLS, and silently fall back to an unencrypted SMTP connection if the server doesn't support it.
2019-11-24 18:20:03 +01:00
Auto-updater
============================
2020-04-11 16:04:27 +02:00
The package contains an auto-updater for downloading and updating the built-in blacklist.txt. It will run on each module load (and installation), but will try to update the content only if the file is older than 5 days, and if the content is not the same that's already downloaded.
The update can be triggered manually::
from validate_email.updater import update_builtin_blacklist
update_builtin_blacklist(force: bool = False, background: bool = True,
2020-04-11 16:04:27 +02:00
callback: Callable = None) -> Optional[Thread]
:code:`force`: forces the update even if the downloaded/installed file is fresh enough.
:code:`background`: starts the update in a ``Thread`` so it won't make your code hang while it's updating. If you set this to true, the function will return the Thread used for starting the update so you can ``join()`` it if necessary.
2020-04-11 16:04:27 +02:00
:code:`callback`: An optional `Callable` (function/method) to be called when the update is done.
2019-11-24 18:20:03 +01:00
2020-04-26 19:37:42 +02:00
FAQ:
========
The module provides false positives:
2020-06-09 13:06:05 +02:00
------------------------------------
2020-04-26 19:37:42 +02:00
Some SMTP Servers (Yahoo's servers for example) are only rejecting nonexistent emails after the end of ``DATA`` command has been provided in the conversation with the server. This module only goes until the ``RCPT TO`` and says it's valid if it doesn't get rejected there, since the ``DATA`` part of the email is the email body itself. There's not much one can do with it, you have to accept false positives in the case of yahoo.com and some other providers. I'm not sure if rejecting emails after the ``DATA`` command is a valid behavior based on the SMTP RFC, but I wouldn't wonder if not.
Everything gets rejected:
2020-06-09 13:06:05 +02:00
-------------------------
2020-04-26 19:37:42 +02:00
Check if you have port 25 access from your IP to the accepting server's IP. Even if you do, the server might use RBL's (spamhaus.org lists, for example), and your IP might get rejected because of being listed in one of the used lists by the email server. Your best bet is to use this module on another server that delivers emails, thus eliminating the chance of being blacklisted.
I can't check thousands of emails!
2020-06-09 13:06:05 +02:00
----------------------------------
2020-04-26 19:37:42 +02:00
This module is a tool; every tool can become a weapon if not used properly. In my case, I use this module to check email address validity at registration time, so not thousands at once. Doing so might make you (your IP) end up in one of the aforementioned blocklists, as providers will detect you as a possible spammer. In short, I would advise against your use case.
2020-06-09 12:56:17 +02:00
My email doesn't check out!
2020-06-09 13:06:05 +02:00
---------------------------
Run this code with the module installed (use your parameters within), and see the output::
2020-06-09 12:56:17 +02:00
python -c 'import logging, sys; logging.basicConfig(stream=sys.stderr, level=logging.DEBUG); from validate_email import validate_email; print(validate_email(\'your.email@address.com\', check_mx=True, debug=True))'
If you still don't understand why your code doesn't work as expected by looking at the the logs, then (and only then) add an issue explaining your problem with a REPRODUCIBLE example, and the output of your test run.