Abstract
Information and communication technologies enable people to access to various documentations and information. Huge documents and information in the Internet or storage disks have made search time more important. Especially as the volume size and the number of documents on the Internet increase, string search times and costs increase have become big burden to search service. But, most of string searching algorithms have not consider lexical structures nor vowels’ occurrence frequency. Formal documents (articles, news, novels, etc.) have important characteristic that is ‘well-formed written’ English. And words of formal documents have ‘limit number of words and alphabets’ that are listed in a dictionary. The ‘limit number of words and alphabets’ has predictable occurrence probability in real world’s documentations.
We try to use the alphabet occurrence probability as first search condition. We analyze all the words in the dictionaries (dictionary of free dictionary project, scrabblehelper – Revision 20, Winedit dictionary) and calculate each alphabet occurrence probability of repeated vowels, repeated consonants, not-repeated vowels and not-repeated consonants. In this paper, we define and propose the search rules and string searching algorithm, based on occurrence frequency and patterns of vowels and consonants. We use only the occurrence patterns and repeated positions of vowel and consonant in a text. Therefore, in the real world, proposed string searching algorithm (OFRP algorithm) is based on occurrence frequency and repetition pattern of vowels and consonants and is usefully and effectively applied to string search service and web search engine.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Boyer, R.S., Moore, J.S.: A fast string searching algorithm. Commun. ACM 20, 762–772 (1977)
Knuth, D., Moris, J.H., Pratt, V.: Fast string searching in strings. SIAM J. Comput. 6(2), 323–350 (1977)
Baeza-Yates, R., Gonnet, G.H.: A new approach to text searching. Commun. ACM 35, 74–82 (1992)
Baase, S., Van Gelder, A.: Computer Algorithms: Introduction to Design and Analysis. Addis-Edisson-Wesley Pub., Reading (1999)
Sedgewick, R.: Algorithms in C++. Addison-Wesley Pub., Reading (1998)
Chung, K.S., Yu, H.-C., Jin, S.H.: An efficient string searching algorithm based on vowel occurrence pattern. In: Park, J.J., Yang, L.T., Lee, C. (eds.) FutureTech 2011, Part II. CCIS, vol. 185, pp. 379–386. Springer, Heidelberg (2011)
Acknowledgements
This work was supported by 2014 Korea National Open University Research Fund.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Chung, K.S., Kim, S.Y., Yu, H.C. (2015). An Efficient String Searching Algorithm Based on Occurrence Frequency and Pattern of Vowels and Consonants in a Pattern. In: He, X., et al. Intelligence Science and Big Data Engineering. Big Data and Machine Learning Techniques. IScIDE 2015. Lecture Notes in Computer Science(), vol 9243. Springer, Cham. https://doi.org/10.1007/978-3-319-23862-3_27
Download citation
DOI: https://doi.org/10.1007/978-3-319-23862-3_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23861-6
Online ISBN: 978-3-319-23862-3
eBook Packages: Computer ScienceComputer Science (R0)