diff --git a/CODE_OF_CONDUCT.md b/CODE_OF_CONDUCT.md
new file mode 100644
index 0000000..eb30787
--- /dev/null
+++ b/CODE_OF_CONDUCT.md
@@ -0,0 +1,5 @@
+## Code of conduct
+
+Be respectful with each other..
+
+That shouldn't be too hard right :smiley:
diff --git a/README.MD b/README.MD
index e65dd08..7889b34 100644
--- a/README.MD
+++ b/README.MD
@@ -1,3 +1,6 @@
+#### ⚠Warning: This script is not ready for production use.⚠
+*Not all tables are parseable yet. Please refer to the "Capabilities" section for a list of supported table types.*
+
 # Html2Dict
 
 Simple html tables extractor.
@@ -7,114 +10,143 @@ Simple html tables extractor.
 * Python 3.6+
 * Python module:
   * [lxml](https://lxml.de/)
+  * [requests](http://docs.python-requests.org/en/master/)
   
 ## Installing
 
-1. `pip install html2dict`
+Create and activate a new Python virtual environment then install this dev branch with: 
+  * `pip3 install html2dict` 
+
+## Capabilities
+
+List of table types currently supported:
+  * Basic table without headers. 
+  * Basic table with headers.
+  * Complex tables with merged headers.
+
+List of table types **not** currently supported:
+  * Any tables embedded in iframes.
+  * Tables with vertical headers (scope=“col”)
+  * Tables with new header row after first set of data.
+  * Tables with merged tables accross multiple levels
+
+This project is still very new, if the type of table you are parsing is not in this list, please let me know the outcome.
 
 ## Usage
 
-* Start by instantiating the class with an html string. (I used requests in this example but opening an html file would work just fine)
+Start by importing the desired type of extractor. (Only one available currently). 
 ```Python
-from html2dict import Html2Dict
-import requests
+from html2dict.extractors import BasicTableExtractor
+``` 
+
+Then instantiate an object with one of the 3 constructors provided
+```python
+my_extractor = BasicTableExtractor.from_html_string(html_string=<html_string>)
+
+# or 
+
+my_extractor = BasicTableExtractor.from_html_file(html_file=<relative_or_absolute_filepath>)
 
-my_website = requests.get(url="https://www.python.org/downloads/release/python-370/")
-extractor = Html2Dict(html_string=my_website.text)
+# or
+
+my_extractor = BasicTableExtractor.from_url(url=<url>)
 ``` 
 
-* The object starts with an attribute 'tables' containing all the tables in the html provided as raw html elements.
+You can access the extracted tables from the basic_tables attribute.
+
+```python
+my_extractor.basic_tables
+```
+
+Finally, the data of the table can be accessed from the attributes data_rows or rows.
 
 ```python
->>> extractor.tables
-
-...{'table_0': {'data_rows': [<Element tr at 0x1034e1458>,
-...                           <Element tr at 0x1034e14a8>,
-...                           <Element tr at 0x1034e1598>,
-...                           <Element tr at 0x1034e15e8>,
-...                           <Element tr at 0x1034e1638>,
-...                           <Element tr at 0x1034e1688>,
-...                           <Element tr at 0x1034e16d8>,
-...                           <Element tr at 0x1034e1728>,
-...                           <Element tr at 0x1034e1778>,
-...                           <Element tr at 0x1034e17c8>,
-...                           <Element tr at 0x1034e1818>],
-...             'header_rows': [<Element tr at 0x1034e1548>]}}
+my_extractor.basic_tables[<table_name>].rows
 ```
 
- * The only table extractor method implemented so far is 'basic_tables'. It returns a dict of table where each table is a tuple of dict if the base table had headers otherwise it is a simple list.  
- 
- ```python
->>> extractor.basic_tables()
-
-...{'table_0': ({'Description': 'n/a',
-...              'File Size': '22745726',
-...              'GPG': 'SIG',
-...              'MD5 Sum': '41b6595deb4147a1ed517a7d9a580271',
-...              'Operating System': 'Source release',
-...              'Version': 'Gzipped source tarball'},
-...             {'Description': 'n/a',
-...              'File Size': '16922100',
-...              'GPG': 'SIG',
-...              'MD5 Sum': 'eb8c2a6b1447d50813c02714af4681f3',
-...              'Operating System': 'Source release',
-...              'Version': 'XZ compressed source tarball'},
-...             {'Description': 'for Mac OS X 10.6 and later',
-...              'File Size': '34274481',
-...              'GPG': 'SIG',
-...              'MD5 Sum': 'ca3eb84092d0ff6d02e42f63a734338e',
-...              'Operating System': 'Mac OS X',
-...              'Version': 'macOS 64-bit/32-bit installer'},
-...             {'Description': 'for OS X 10.9 and later',
-...              'File Size': '27651276',
-...              'GPG': 'SIG',
-...              'MD5 Sum': 'ae0717a02efea3b0eb34aadc680dc498',
-...              'Operating System': 'Mac OS X',
-...              'Version': 'macOS 64-bit installer'},
-...             {'Description': 'n/a',
-...              'File Size': '8547689',
-...              'GPG': 'SIG',
-...              'MD5 Sum': '46562af86c2049dd0cc7680348180dca',
-...              'Operating System': 'Windows',
-...              'Version': 'Windows help file'},
-...             {'Description': 'for AMD64/EM64T/x64',
-...              'File Size': '6946082',
-...              'GPG': 'SIG',
-...              'MD5 Sum': 'cb8b4f0d979a36258f73ed541def10a5',
-...              'Operating System': 'Windows',
-...              'Version': 'Windows x86-64 embeddable zip file'},
-...             {'Description': 'for AMD64/EM64T/x64',
-...              'File Size': '26262280',
-...              'GPG': 'SIG',
-...              'MD5 Sum': '531c3fc821ce0a4107b6d2c6a129be3e',
-...              'Operating System': 'Windows',
-...              'Version': 'Windows x86-64 executable installer'},
-...             {'Description': 'for AMD64/EM64T/x64',
-...              'File Size': '1327160',
-...              'GPG': 'SIG',
-...              'MD5 Sum': '3cfdaf4c8d3b0475aaec12ba402d04d2',
-...              'Operating System': 'Windows',
-...              'Version': 'Windows x86-64 web-based installer'},
-...             {'Description': 'n/a',
-...              'File Size': '6395982',
-...              'GPG': 'SIG',
-...              'MD5 Sum': 'ed9a1c028c1e99f5323b9c20723d7d6f',
-...              'Operating System': 'Windows',
-...              'Version': 'Windows x86 embeddable zip file'},
-...             {'Description': 'n/a',
-...              'File Size': '25506832',
-...              'GPG': 'SIG',
-...              'MD5 Sum': 'ebb6444c284c1447e902e87381afeff0',
-...              'Operating System': 'Windows',
-...              'Version': 'Windows x86 executable installer'},
-...             {'Description': 'n/a',
-...              'File Size': '1298280',
-...              'GPG': 'SIG',
-...              'MD5 Sum': '779c4085464eb3ee5b1a4fffd0eabca4',
-...              'Operating System': 'Windows',
-...              'Version': 'Windows x86 web-based installer'})}
-
-
-
-
-```
\ No newline at end of file
+## Examples
+
+* for https://www.python.org/downloads/release/python-370/
+
+```python
+my_extractor = BasicTableExtractor.from_url(url="https://www.python.org/downloads/release/python-370/")
+my_extractor.basic_tables
+
+{'table_0': <html2dict.Table object at 0x10700c828>}
+
+pprint(my_extractor.basic_tables['table_0'].rows)
+
+{'data': [{'Description': 'n/a',
+           'File Size': '22745726',
+           'GPG': 'SIG',
+           'MD5 Sum': '41b6595deb4147a1ed517a7d9a580271',
+           'Operating System': 'Source release',
+           'Version': 'Gzipped source tarball'},
+          {'Description': 'n/a',
+           'File Size': '16922100',
+           'GPG': 'SIG',
+           'MD5 Sum': 'eb8c2a6b1447d50813c02714af4681f3',
+           'Operating System': 'Source release',
+           'Version': 'XZ compressed source tarball'},
+          {'Description': 'for Mac OS X 10.6 and later',
+           'File Size': '34274481',
+           'GPG': 'SIG',
+           'MD5 Sum': 'ca3eb84092d0ff6d02e42f63a734338e',
+           'Operating System': 'Mac OS X',
+           'Version': 'macOS 64-bit/32-bit installer'},
+          {'Description': 'for OS X 10.9 and later',
+           'File Size': '27651276',
+           'GPG': 'SIG',
+           'MD5 Sum': 'ae0717a02efea3b0eb34aadc680dc498',
+           'Operating System': 'Mac OS X',
+           'Version': 'macOS 64-bit installer'},
+          {'Description': 'n/a',
+           'File Size': '8547689',
+           'GPG': 'SIG',
+           'MD5 Sum': '46562af86c2049dd0cc7680348180dca',
+           'Operating System': 'Windows',
+           'Version': 'Windows help file'},
+          {'Description': 'for AMD64/EM64T/x64',
+           'File Size': '6946082',
+           'GPG': 'SIG',
+           'MD5 Sum': 'cb8b4f0d979a36258f73ed541def10a5',
+           'Operating System': 'Windows',
+           'Version': 'Windows x86-64 embeddable zip file'},
+          {'Description': 'for AMD64/EM64T/x64',
+           'File Size': '26262280',
+           'GPG': 'SIG',
+           'MD5 Sum': '531c3fc821ce0a4107b6d2c6a129be3e',
+           'Operating System': 'Windows',
+           'Version': 'Windows x86-64 executable installer'},
+          {'Description': 'for AMD64/EM64T/x64',
+           'File Size': '1327160',
+           'GPG': 'SIG',
+           'MD5 Sum': '3cfdaf4c8d3b0475aaec12ba402d04d2',
+           'Operating System': 'Windows',
+           'Version': 'Windows x86-64 web-based installer'},
+          {'Description': 'n/a',
+           'File Size': '6395982',
+           'GPG': 'SIG',
+           'MD5 Sum': 'ed9a1c028c1e99f5323b9c20723d7d6f',
+           'Operating System': 'Windows',
+           'Version': 'Windows x86 embeddable zip file'},
+          {'Description': 'n/a',
+           'File Size': '25506832',
+           'GPG': 'SIG',
+           'MD5 Sum': 'ebb6444c284c1447e902e87381afeff0',
+           'Operating System': 'Windows',
+           'Version': 'Windows x86 executable installer'},
+          {'Description': 'n/a',
+           'File Size': '1298280',
+           'GPG': 'SIG',
+           'MD5 Sum': '779c4085464eb3ee5b1a4fffd0eabca4',
+           'Operating System': 'Windows',
+           'Version': 'Windows x86 web-based installer'}],
+ 'headers': [['Version',
+              'Operating System',
+              'Description',
+              'MD5 Sum',
+              'File Size',
+              'GPG']]}
+
+```
diff --git a/html2dict.py b/html2dict.py
deleted file mode 100755
index e5dd4a1..0000000
--- a/html2dict.py
+++ /dev/null
@@ -1,126 +0,0 @@
-from lxml import html
-
-
-class Html2Dict(object):
-
-    def __init__(self, html_string, url=None):
-
-        self.html_string = html_string
-        self._tree = html.fromstring(self.html_string)
-        self.url = url
-        if not self.url and self._tree.xpath('//link[@rel="canonical"]'):
-            self.url = self._tree.xpath('//link[@rel="canonical"]')[0].get('href')
-        self._table_presents = self._tree.xpath('//table')
-        self.tables = self._extract_tables()
-
-    def _extract_tables(self):
-
-        tables = {}
-
-        for ind_table, table in enumerate(self._table_presents):
-
-            my_header_rows = []
-            my_data_rows = []
-            t_body = table.xpath('*//tr') or table.xpath('tr')
-
-            for row in t_body:
-
-                if Html2Dict.is_header(row):
-                    my_header_rows.append(row)
-                else:
-                    my_data_rows.append(row)
-
-            tables["table_{}".format(ind_table)] = {
-                "header_rows" : my_header_rows,
-                "data_rows": my_data_rows,
-            }
-        return tables
-
-    @staticmethod
-    def is_header(row):
-
-        if not row.xpath('*'):
-            return False
-
-        for elem in row.xpath('*'):
-
-            if not elem.tag == 'th':
-                return False
-
-        return True
-
-    @staticmethod
-    def get_text_content(cell, is_header=False):
-
-        # base case
-        colspan = int(cell.attrib.get('colspan', 1))
-        # is_header = True if cell.tag == 'th' else False
-        if (colspan > 1 or cell.attrib.get('Html2Dict_merged') == "True") and is_header:
-                cell.attrib['Html2Dict_merged'] = "True"
-                cell.attrib['colspan'] = str(colspan - 1)
-                next_cell_below = cell.getparent().getnext()[0]
-                cell.getparent().getnext().remove(next_cell_below)
-                cell_text = " ".join([i for i in cell.itertext() if i not in ('\\n',)]).strip() or "n/a"
-                cell_text = "/".join([
-                    cell_text,
-                    Html2Dict.get_text_content(cell=next_cell_below, is_header=True)
-                ])
-                return cell_text
-        return " ".join([i for i in cell.itertext() if i not in ('\\n',)]).strip() or "n/a"
-
-    @staticmethod
-    def basic_table(table):
-
-        copy_table = table.copy()
-        header_rows = copy_table['header_rows']
-        data_rows = copy_table['data_rows']
-        tmp_headers = []
-        tmp_data_rows = []
-
-        for row in header_rows + data_rows:
-
-            tmp_row = []
-            for cell in row:
-
-                colspan = int(cell.attrib.get('colspan', 1))
-                for _ in range(colspan):
-                    if row in header_rows:
-                        cell_text = Html2Dict.get_text_content(cell=cell, is_header=True)
-                    else:
-                        cell_text = Html2Dict.get_text_content(cell=cell)
-
-                    tmp_row.append(cell_text)
-            if row in header_rows:
-                tmp_headers.append(tmp_row)
-            else:
-                tmp_data_rows.append(tmp_row)
-        if not tmp_headers:
-            tmp_headers = [None]
-        return {'headers': tmp_headers[0], 'data_rows': tmp_data_rows}
-
-    def basic_tables(self):
-
-        my_basic_tables = {}
-        for my_table in self.tables:
-            try:
-                my_table_basic = Html2Dict.basic_table(self.tables[my_table])
-            except Exception as e:
-                error = """
-                    An error occured with {0}:
-                    {1}
-                    *****************
-                    Proceeding with next table
-                """.format(my_table, e)
-                print(error)
-                continue
-            headers = my_table_basic.get('headers')
-            my_basic_tables[my_table] = tuple(dict(zip(headers, row)) if headers else row for row in my_table_basic.get('data_rows'))
-
-        return my_basic_tables
-
-    def rich_tables(self):
-
-        raise NotImplementedError('This feature is coming soon.')
-
-if __name__ == '__main__':
-    pass
diff --git a/html2dict/__init__.py b/html2dict/__init__.py
new file mode 100644
index 0000000..e69de29
diff --git a/html2dict/base_extractor.py b/html2dict/base_extractor.py
new file mode 100644
index 0000000..79b5715
--- /dev/null
+++ b/html2dict/base_extractor.py
@@ -0,0 +1,126 @@
+from lxml import html
+import requests
+from html2dict.resources import *
+
+
+__all__ = [
+    'TableExtractor',
+    'Table',
+    'get_text_content',
+    'is_header'
+]
+
+class TableExtractor(object):
+    """Html to dictionaries extractor class
+
+    This is the skeleton Extractor class.
+
+    Attributes:
+        html_string (str): String representation of an html.
+        url (str): Url of the website you are parsing
+        raw_tables (:obj:`dict` of :obj: `Table`): dict of all the tables
+            present on the page as raw HTML data and headers (<td> & <th>).
+        _tree (:obj:`HtmlElement`): Html tree from the root of the
+            provided html_string.
+        _table_presents (:obj:`list` of :obj:`dict`): List of tables
+            present in the html_string as html element <table>.
+
+    """
+
+    def __init__(self, html_string: str, url=None):
+        """__init__ method.
+
+        Args:
+            html_string (str): String representation of an html.
+            url (str, optional): Url of the website you are parsing.
+
+        Notes:
+            It is not recommended to instantiate a class manually. Use
+            instead one of the clasmethod provided.
+
+        """
+
+        self.html_string = html_string
+        self._tree = html.fromstring(self.html_string)
+        self.url = url
+        if not self.url and self._tree.xpath('//link[@rel="canonical"]'):
+            self.url = self._tree.xpath('//link[@rel="canonical"]')[0].get('href')
+        self._table_presents = self._tree.xpath('//table')
+        self.raw_tables = self._extract_raw_tables()
+
+    def _extract_raw_tables(self):
+        """Hidden method to initialize the self.raw_tables attribute.
+
+        Iterates over the tables in self._table_presents and returns a
+        dict of the extracted tables.
+
+        Returns:
+            dict: All the tables found in the html string as a dictionary
+                of Table object with raw HTML elements for data and
+                headers.
+
+        """
+
+        tables = {}
+
+        for ind_table, table in enumerate(self._table_presents):
+            my_table = Table.from_html_element(
+                table=table,
+                table_name=f"table_{ind_table}",
+                caption_name_overwrite=True,
+            )
+
+            tables[my_table.name] = my_table
+
+        return tables
+
+    @classmethod
+    def from_html_string(cls, html_string, url=None):
+        """Instantiate an object from an html string.
+
+        Args:
+            html_string (str): String representation of an html
+            url (str, optional): Url of the website the string is coming
+                from. Default to None
+
+        Returns:
+            TableExtractor: The newly created TableExtractor
+
+        """
+
+        return cls(html_string=html_string, url=url)
+
+    @classmethod
+    def from_html_file(cls, html_file, url=None):
+        """Instantiate an object from an html file.
+
+        Args:
+            html_file (str): relative filepath to an html file.
+            url (str, optional): Url of the website the file is coming
+                from. Default to None.
+
+        Returns:
+            TableExtractor: The newly created TableExtractor
+
+        """
+
+        with open(html_file, 'r') as infile:
+            html_string = infile.read()
+
+        return cls(html_string=html_string, url=url)
+
+    @classmethod
+    def from_url(cls, url, **kwargs):
+        """Instantiate an object from a url.
+
+        Args:
+            url (str): Url of the website you are parsing.
+
+        Returns:
+            TableExtractor: The newly created TableExtractor
+
+        """
+
+        html_string = requests.get(url=url, **kwargs).text
+        return cls(html_string=html_string, url=url)
+
diff --git a/html2dict/extractors.py b/html2dict/extractors.py
new file mode 100644
index 0000000..5ba5a93
--- /dev/null
+++ b/html2dict/extractors.py
@@ -0,0 +1,132 @@
+from html2dict.base_extractor import *
+
+
+class BasicTableExtractor(TableExtractor):
+    """Basic tables extractor.
+
+    Attributes:
+        html_string (str): String representation of an html.
+        url (str): Url of the website you are parsing
+        raw_tables (:obj:`dict` of :obj: `Table`): dict of all the tables
+            present on the page as raw HTML data and headers (<td> & <th>).
+        basic_tables (:obj:`dict` of :obj:`Table`): dict of all the tables
+            present on the page as plaintext.
+        _tree (:obj:`HtmlElement`): Html tree from the root of the
+            provided html_string.
+        _table_presents (:obj:`list` of :obj:`dict`): List of tables
+            present in the html_string as html element <table>.
+
+    """
+
+    def __init__(self, html_string, url=None):
+        """__init_ method.
+
+        Args:
+            html_string (str): String representation of an html.
+            url (str, optional): Url of the website you are parsing.
+
+        """
+
+        super(BasicTableExtractor, self).__init__(html_string, url)
+        self.basic_tables = self.extract_basic_tables()
+
+    @staticmethod
+    def basic_table_parser(table: Table):
+        """ Transform a raw table to a slightly more advanced table.
+
+        Take a Table object containing raw HTML elements and extract
+        basic text data from it.
+
+        Args:
+            table (:obj:`Table`): A Table object containing data as HTML
+                elements.
+
+        Returns:
+            Table: A new Table object with its data represented in
+                plaintext.
+
+        """
+        header_rows = table.header_rows
+        data_rows = table.data_rows
+        tmp_header_rows = []
+        tmp_data_rows = []
+
+        for row in header_rows + data_rows:
+
+            tmp_row = []
+            for cell in row:
+
+                colspan = int(cell.attrib.get('colspan', 1))
+                for _ in range(colspan):
+
+                    cell_text = get_text_content(cell=cell)
+                    tmp_row.append(cell_text)
+
+            if row in header_rows:
+                tmp_header_rows.append(tmp_row)
+            else:
+                tmp_data_rows.append(tmp_row)
+
+        if not tmp_header_rows:
+
+            tmp_data_rows = [
+                {f"col_{ind}": item for ind, item in enumerate(row)}
+                for row in tmp_data_rows
+            ]
+            tmp_header_rows = sorted(
+                {header for row in tmp_data_rows for header in row}
+            )
+
+        else:
+            tmp_data_rows = [
+                dict(zip(tmp_header_rows[0], row))
+                for row in tmp_data_rows
+            ]
+
+        return Table(data_rows=tmp_data_rows, header_rows=tmp_header_rows)
+
+    def extract_basic_tables(self):
+        """Basic tables parser.
+
+        Loop over the extracted raw_tables and pass them through the
+        basic_table_parser.
+
+        Returns:
+            dict: All the tables found in the html string as a dictionary
+                of Table object with data and headers as plaintext.
+
+        """
+
+        my_basic_tables = {}
+        for my_table in self.raw_tables:
+            try:
+                basic_table = BasicTableExtractor.basic_table_parser(self.raw_tables[my_table])
+            except Exception as e:
+                error = """
+                    An error occured with {0}:
+                    {1}
+                    *****************
+                    Proceeding with next table
+                """.format(my_table, e)
+                print(error)
+                continue
+
+            my_basic_tables[my_table] = basic_table
+
+        return my_basic_tables
+
+
+class RichTableExtractor(TableExtractor):
+    """ Rich tables extractor.
+
+    Notes:
+        This class is not implemented yet but I am working on it.
+        The goal of it is to return more than just plaintext data.
+        For example if a cell contains an HTML list <li>, I should
+        retrieve it as a Python list or if a cell has a link, I should
+        retrieve something like [some_text](my_link).
+
+    """
+
+    def __init__(self):
+        raise NotImplementedError("Placeholder class. Feature coming soon..")
diff --git a/html2dict/resources.py b/html2dict/resources.py
new file mode 100644
index 0000000..45b0d11
--- /dev/null
+++ b/html2dict/resources.py
@@ -0,0 +1,155 @@
+class Table(object):
+    """Base table object.
+
+    A Table object holds information about a table, including its name,
+    headers row and data rows.
+
+    Attributes:
+        name (str): Name of the table.
+        header_rows (list): A list of headers. If the table doesn't
+            contains headers, default ones will be generated.
+        data_rows (list): Data rows of your table represented as a list
+            of dictionary.
+        rows (dict): Headers and data rows together in a dictionary.
+
+    """
+
+    def __init__(self, data_rows: list, header_rows: list, name=None):
+        """__init__ method.
+
+        Args:
+            name (str, optional): Name of the table. Default to None.
+            header_rows (list): A list of headers.
+            data_rows (list): Data rows of your table represented as a
+                list of dictionary.
+
+        """
+
+        self.name = name
+        self.header_rows = header_rows
+        self.data_rows = data_rows
+        self.rows = {
+            "headers": self.header_rows,
+            "data": self.data_rows,
+        }
+
+    @classmethod
+    def from_html_element(cls, table, table_name=None, caption_name_overwrite=False):
+        """Classmethod to extract a table from a <table> HTML element.
+
+        This clasmethod is used by the Extractor class to extract the
+        tables on a webpage.
+
+        Args:
+            table (:obj:`lxml.html.HtmlElement`): A <table> HTML element.
+            table_name (str, optional): A table name. Defaults to None.
+            caption_name_overwrite (bool, optional): If True, if a table
+                name is provided but a table caption is found, the table
+                caption will be used as the name instead.
+
+        Returns:
+            Table: A Table object
+
+        """
+
+        header_rows = []
+        data_rows = []
+
+        if table.xpath('caption'):
+            caption = table.xpath('caption')[0]
+            table_name = get_text_content(caption)
+        elif table_name and not caption_name_overwrite:
+            table_name = table_name
+
+        t_body = table.xpath('*//tr') or table.xpath('tr')
+
+        for row in t_body:
+
+            if is_header(row):
+                header_rows.append(row)
+            else:
+                data_rows.append(row)
+
+        return cls(
+            name=table_name,
+            data_rows=data_rows,
+            header_rows=header_rows,
+        )
+
+    def search(self, query, column=False):
+        """Search a value in your data rows.
+
+        Search if a value is present anywhere in your table or in a
+        specific column.
+
+        Args:
+            query : Value to search
+            column (str, optional): Column name. Search only in this
+                column. Default to None.
+
+        Returns:
+            list: Rows containing the searched value.
+
+        """
+
+        if column:
+
+            try:
+                return [row for row in self.data_rows if query == row[column]]
+            except KeyError:
+                raise KeyError(
+                    f"'{column}' is not a valid header. Valid headers are {self.header_rows}"
+                )
+
+        return [row for row in self.data_rows if query in row.values()]
+
+
+def is_header(row):
+        """Check if an html row is a header.
+
+        Args:
+            row (HtmlElement): An html row <tr>.
+
+        Returns:
+            True if the row is only made of 'header' cells (<th>).
+
+        """
+
+        return all([True if elem.tag == 'th' else False for elem in row.xpath('*')] or False)
+
+
+def get_text_content(cell):
+    """Get the text content of an html cell
+
+    Extract the text content of a cell in a html table. If the cell is part of a
+    merged header, join its text with a "/" with the text of the cell below it.
+
+    Args:
+        cell (HtmlElement): Html cell <td> or <th>
+
+    Returns:
+        str: Text content at the root of an html cell.
+
+    """
+
+    colspan = int(cell.attrib.get('colspan', 1))
+
+    cell_is_header = True if cell.tag == 'th' else False
+    cell_text = " ".join(
+        [i for i in cell.itertext() if i not in ('\\n',)]).strip() or "n/a"
+
+    if (colspan > 1 or cell.attrib.get('Html2Dict_merged') == "True") and cell_is_header:
+
+        cell.attrib['Html2Dict_merged'] = "True"
+        cell.attrib['colspan'] = str(colspan - 1)
+        next_cell_below = cell.getparent().getnext()[0]
+        cell.getparent().getnext().remove(next_cell_below)
+
+        cell_text = "/".join([
+            cell_text,
+            get_text_content(cell=next_cell_below)
+        ])
+
+        return cell_text
+
+    return cell_text
diff --git a/requirements.txt b/requirements.txt
index 86c871e..7cdc7ea 100755
--- a/requirements.txt
+++ b/requirements.txt
@@ -1 +1,2 @@
-lxml
\ No newline at end of file
+lxml
+requests
\ No newline at end of file
diff --git a/setup.py b/setup.py
index 87903e2..495633f 100644
--- a/setup.py
+++ b/setup.py
@@ -6,10 +6,11 @@
 EMAIL = 'benjamin.souty@gmail.com'
 AUTHOR = 'B-Souty'
 REQUIRES_PYTHON = '>=3.6.0'
-VERSION = '0.1.1'
+VERSION = '0.2'
 
 REQUIRED = [
     'lxml',
+    'requests'
 ]
 
 try:
@@ -29,7 +30,7 @@
     author_email=EMAIL,
     python_requires=REQUIRES_PYTHON,
     url=URL,
-    py_modules=['html2dict'],
+    packages=['html2dict'],
     install_requires=REQUIRED,
     license='MIT',
     classifiers=[
diff --git a/tests/__init__.py b/tests/__init__.py
new file mode 100644
index 0000000..e69de29
diff --git a/tests/simple_server.py b/tests/simple_server.py
new file mode 100755
index 0000000..49dd543
--- /dev/null
+++ b/tests/simple_server.py
@@ -0,0 +1,40 @@
+#! /usr/bin/env python3
+
+import sys
+from http.server import BaseHTTPRequestHandler, HTTPServer
+
+
+TEST_HTML_FILE = sys.argv[1]
+TEST_HTML_STRING = open(TEST_HTML_FILE, 'r').read()
+
+
+class HTTPServer_RequestHandler(BaseHTTPRequestHandler):
+
+    # GET
+    def do_GET(self):
+        # Send response status code
+        self.send_response(200)
+
+        # Send headers
+        self.send_header('Content-type', 'text/html')
+        self.end_headers()
+
+        # Send message back to client
+        message = TEST_HTML_STRING
+        # Write content as utf-8 data
+        self.wfile.write(bytes(message, "utf8"))
+        return
+
+
+def start_server():
+
+    print('starting server...')
+    server_address = ('127.0.0.1', 8081)
+    httpd = HTTPServer(server_address, HTTPServer_RequestHandler)
+    print('running server...')
+    httpd.serve_forever()
+
+
+
+if __name__ == "__main__":
+    start_server()
diff --git a/tests/test_data.json b/tests/test_data.json
new file mode 100644
index 0000000..c4711ab
--- /dev/null
+++ b/tests/test_data.json
@@ -0,0 +1 @@
+[[{"col_0": "a", "col_1": "b", "col_2": "c"}, {"col_0": "1", "col_1": "2", "col_2": "3"}, {"col_0": "x", "col_1": "y", "col_2": "z"}], [{"col_0": "Fruit", "col_1": "Color", "col_2": "Taste"}, {"col_0": "Strawberry", "col_1": "Red", "col_2": "Good"}, {"col_0": "Pear", "col_1": "Green", "col_2": "Bad"}], [{"Fruit": "Strawberry", "Color": "Red"}, {"Fruit": "Pear", "Color": "Green"}], [{"Fruit/Name": "Strawberry", "Fruit/Color": "Red"}, {"Fruit/Name": "Pear", "Fruit/Color": "Green"}], [{"Fruit/Name": "Strawberry", "Fruit/Color": "Red", "Vegetable/Name": "Brocoli", "Vegetable/Color": "Green", "Nut": "Cashew"}, {"Fruit/Name": "Pear", "Fruit/Color": "Green", "Vegetable/Name": "Radish", "Vegetable/Color": "Red", "Nut": "Peanut"}], [{"col_0": "a", "col_1": "b", "col_2": "c"}, {"col_0": "1", "col_1": "2", "col_2": "3"}]]
\ No newline at end of file
diff --git a/tests/test_html2dict.py b/tests/test_html2dict.py
new file mode 100644
index 0000000..b8f9fd3
--- /dev/null
+++ b/tests/test_html2dict.py
@@ -0,0 +1,39 @@
+from html2dict.extractors import BasicTableExtractor
+import subprocess
+import json
+import os
+
+
+TEST_DATA_FOLDER = "tests"
+TEST_HTML_FILE = os.path.join(TEST_DATA_FOLDER, "test_tables.html")
+TEST_HTML_STRING = open(TEST_HTML_FILE, 'r').read()
+
+SIMPLE_SERVER = os.path.join(TEST_DATA_FOLDER, 'simple_server.py')
+subprocess.Popen([SIMPLE_SERVER, TEST_HTML_FILE])
+
+VALIDATION_FILE = os.path.join(TEST_DATA_FOLDER, 'test_data.json')
+VALIDATION_DATA = json.load(open(VALIDATION_FILE, 'r'))
+
+
+def test_basic_table_from_string():
+
+    test_html = BasicTableExtractor.from_html_string(TEST_HTML_STRING)
+    data_rows = [test_html.basic_tables[table].data_rows for table in test_html.basic_tables]
+
+    assert data_rows == VALIDATION_DATA
+
+
+def test_basic_table_from_file():
+
+    test_html = BasicTableExtractor.from_html_file(TEST_HTML_FILE)
+    data_rows = [test_html.basic_tables[table].data_rows for table in test_html.basic_tables]
+
+    assert data_rows == VALIDATION_DATA
+
+
+def test_basic_table_from_url():
+
+    test_html = BasicTableExtractor.from_url(url="http://127.0.0.1:8081")
+    data_rows = [test_html.basic_tables[table].data_rows for table in test_html.basic_tables]
+
+    assert data_rows == VALIDATION_DATA
diff --git a/tests/test_tables.html b/tests/test_tables.html
new file mode 100644
index 0000000..94eb41b
--- /dev/null
+++ b/tests/test_tables.html
@@ -0,0 +1,161 @@
+<!DOCTYPE html>
+<html lang="en">
+<head>
+    <meta charset="UTF-8">
+    <title>Title</title>
+</head>
+<body>
+
+# most simple table, no caption
+<table border="1">
+  <tr>
+    <td>a</td>
+    <td>b</td>
+    <td>c</td>
+  </tr>
+  <tr>
+    <td>1</td>
+    <td>2</td>
+    <td>3</td>
+  </tr>
+  <tr>
+    <td>x</td>
+    <td>y</td>
+    <td>z</td>
+  </tr>
+</table>
+
+<table border="1">
+    <caption>Basic table, NO headers</caption>
+  <tr>
+    <td>Fruit</td>
+    <td>Color</td>
+    <td>Taste</td>
+  </tr>
+  <tr>
+    <td>Strawberry</td>
+    <td>Red</td>
+    <td>Good</td>
+  </tr>
+  <tr>
+    <td>Pear</td>
+    <td>Green</td>
+    <td>Bad</td>
+  </tr>
+</table>
+
+
+<table border="1">
+    <caption>Basic table, With headers</caption>
+  <tr>
+    <th>Fruit</th>
+    <th>Color</th>
+  </tr>
+  <tr>
+    <td>Strawberry</td>
+    <td>Red</td>
+  </tr>
+  <tr>
+    <td>Pear</td>
+    <td>Green</td>
+  </tr>
+</table>
+
+
+
+<table border="1">
+  <caption>Basic table, with merged headers</caption>
+  <tr>
+    <th colspan="2">Fruit</th>
+  </tr>
+  <tr>
+    <th>Name</th>
+    <th>Color</th>
+  </tr>
+  <tr>
+    <td>Strawberry</td>
+    <td>Red</td>
+  </tr>
+  <tr>
+    <td>Pear</td>
+    <td>Green</td>
+  </tr>
+</table>
+
+<table border="1">
+  <caption>Complex table with multiple merged headers</caption>
+  <tr>
+    <th colspan="2">Fruit</th>
+    <th colspan="2">Vegetable</th>
+    <th rowspan="2">Nut</th>
+  </tr>
+  <tr>
+    <th>Name</th>
+    <th>Color</th>
+    <th>Name</th>
+    <th>Color</th>
+  </tr>
+  <tr>
+    <td>Strawberry</td>
+    <td>Red</td>
+    <td>Brocoli</td>
+    <td>Green</td>
+    <td>Cashew</td>
+  </tr>
+  <tr>
+    <td>Pear</td>
+    <td>Green</td>
+    <td>Radish</td>
+    <td>Red</td>
+    <td>Peanut</td>
+  </tr>
+</table>
+
+<table border="1">
+    <caption>Complex table, with multiple merged headers on multiple levels.</caption>
+  <tr>
+    <th colspan="4">Food</th>
+    <th rowspan="3">Simple unmerged header</th>
+  </tr>
+  <tr>
+    <th colspan="2">Fruit</th>
+    <th colspan="2">Vegetable</th>
+  </tr>
+  <tr>
+    <th>Name</th>
+    <th>Color</th>
+    <th>Name</th>
+    <th>Color</th>
+  </tr>
+  <tr>
+    <td>Strawberry</td>
+    <td>Red</td>
+    <td>Brocoli</td>
+    <td>Green</td>
+    <td>'row #1'</td>
+  </tr>
+  <tr>
+    <td>Pear</td>
+    <td>Green</td>
+    <td>Radish</td>
+    <td>Red</td>
+    <td>'row #2'</td>
+  </tr>
+</table>
+
+<table border="1">
+  <tr>
+    <td>a</td>
+    <td>b</td>
+    <td>c</td>
+  </tr>
+  <tr>
+    <td>1</td>
+    <td>2</td>
+    <td>3</td>
+  </tr>
+</table>
+
+
+</body>
+</html>
\ No newline at end of file