utils
some utility functions
df_to_md
df_to_md (df, title=None)
Converts pd.Dataframe to markdown
html_to_df
html_to_df (html_str:str)
Convert HTML to dataframe.
md_to_df
md_to_df (md_str:str)
Convert Markdown to dataframe.
segment
segment (text:str, unit:str='paragraph', maxchars:int=2048)
Segments text into a list of paragraphs or sentences depending on value of unit
(one of {'paragraph', 'sentence'}
. The maxchars
parameter is the maximum size of any unit of text.
split_list
split_list (input_list, chunk_size)
get_datadir
get_datadir ()
download
download (url, filename, verify=False)