Skip to content

Instantly share code, notes, and snippets.

View abhineetmittal's full-sized avatar

Abhineet Mittal abhineetmittal

View GitHub Profile
<?php
// The function to count words in Unicode strings
function count_unicode_words( $unicode_string ){
// First remove all the punctuation marks & digits
$unicode_string = preg_replace('/[[:punct:][:digit:]]/', '', $unicode_string);
// Now replace all the whitespaces (tabs, new lines, multiple spaces) by single space
$unicode_string = preg_replace('/[[:space:]]/', ' ', $unicode_string);
// The words are now separated by single spaces and can be splitted to an array
// I have included \n\r\t here as well, but only space will also suffice
$words_array = preg_split( "/[\n\r\t ]+/", $unicode_string, 0, PREG_SPLIT_NO_EMPTY );
@abhineetmittal
abhineetmittal / str_word_count.php
Last active January 23, 2017 02:53
str_word_count from PHP
<?php
$string = 'This is a sample string.';
$word_count = str_word_count ( string $string );
echo $word_count;
?>