International Journal of Image, Graphics and Signal Processing (IJIGSP)

ISSN: 2074-9074 (Print)

ISSN: 2074-9082 (Online)

DOI: https://doi.org/10.5815/ijigsp

Website: https://www.mecs-press.org/ijigsp

Published By: MECS Press

Frequency: 6 issues per year

Number(s) Available: 131

(IJIGSP) in Google Scholar Citations / h5-index

IJIGSP is committed to bridge the theory and practice of images, graphics, and signal processing. From innovative ideas to specific algorithms and full system implementations, IJIGSP publishes original, peer-reviewed, and high quality articles in the areas of images, graphics, and signal processing. IJIGSP is a well-indexed scholarly journal and is indispensable reading and references for people working at the cutting edge of images, graphics, and signal processing applications.

 

IJIGSP has been abstracted or indexed by several world class databases: Scopus, Google Scholar, Microsoft Academic Search, CrossRef, Baidu Wenku, IndexCopernicus, IET Inspec, EBSCO, JournalSeek, ULRICH's Periodicals Directory, WorldCat, Scirus, Academic Journals Database, Stanford University Libraries, Cornell University Library, UniSA Library, CNKI Scholar, ProQuest, J-Gate, ZDB, BASE, OhioLINK, iThenticate, Open Access Articles, Open Science Directory, National Science Library of Chinese Academy of Sciences, The HKU Scholars Hub, etc..

Latest Issue
Most Viewed
Most Downloaded

IJIGSP Vol. 16, No. 3, Jun. 2024

REGULAR PAPERS

Machine Learning Based Decision Support System for Coronary Artery Disease Diagnosis

By sukru Alkan Muhammed Kursad Ucar

DOI: https://doi.org/10.5815/ijigsp.2024.03.01, Pub. Date: 8 Jun. 2024

Coronary artery disease (CAD) causes millions of deaths worldwide every year. The earliest possible diagnosis is quite important, as in any diseases, for heart diseases causing such a large amount of death. The diagnosis processes have been more successful thanks to the recent studies in medicine and the rapid improvement in computer sciences. In this study, the goal is to employ machine learning methods to facilitate rapid disease diagnosis without the need to observe negative outcomes. The dataset utilized in this study was obtained from an IEEE DataPort data repository. The dataset consists of two classes. Firstly, new features have been produced by using the features in the dataset. Then, datasets that consist of multiple features have been created by using feature selection algorithms. Three models, specifically Support Vector Machines (SVM), the k-Nearest Neighbor algorithm (kNN), and Decision Tree ensembles (EDT), were trained using custom datasets. A hybrid model has been created and the performances have been compared with the other models by using these models. The best performance has been obtained from SVM and its seven performance criteria in order of accuracy, sensitivity, specificity, F- measurement, Kappa and AUC are 97.82, 0.97, 0.99, 0.98, 0.96 and 0.98%. In summary, when evaluating the performance of the constructed models, it has been demonstrated that these recommended models could aid in the swift prediction of coronary artery disease in everyday life.

[...] Read more.
Enhanced Deep Learning Algorithm for Object Detection in the Agriculture Field

By Priya Singh Rajalakshmi Krishnamurthi

DOI: https://doi.org/10.5815/ijigsp.2024.03.02, Pub. Date: 8 Jun. 2024

Agriculture is one of the most prominent industries which guarantee food requirements and employment throughout the globe due to huge land availability, and atmospheric conditions. But nowadays, security of the available resources are the major concerns due to damage caused by objects inside the agriculture field. There are many traditional algorithms for object detection, but they are not very effective in terms of real time environments. Hence, a deep learning-based object detection model is generated by enhancing YOLOv3. The process involved firstly, k-means clustering was used to identify clusters, followed by modifying the convolutional neural network layers. Additionally, the batch and subdivision values of the actual YOLOv3 model were optimized under the darknet53 framework. The architecture was also configured to detect eleven classes of objects, ensuring that the model could identify a broad range of objects. The experimental results demonstrate that the Delta model achieved a remarkable increase in accuracy from 75.19% to 95.86%. In addition, the model outperformed other models in terms of precision(97%), recall(96%), F1_Score(96%), IoU(80.81%), and mAP(95.86%). Based on these findings, it can be concluded that the delta model offers superior detection capabilities and lower computational complexity compared to conventional methods used in the agriculture field.

[...] Read more.
Infrared and Visible Image Fusion (IVF) Using Latent Low-Rank Representation and Deep Feature Extraction Network

By Teku Sandhya Kumari Gundala Sujatha Boddeda Sravya Hari Jyothula

DOI: https://doi.org/10.5815/ijigsp.2024.03.03, Pub. Date: 8 Jun. 2024

The combination of visible and infrared images from different sensors can provide a more detailed and informative image. Visible images capture environmental details and texture, while infrared sensors can detect thermal radiation and create grayscale images that have high contrast. These images are useful for distinguishing between target and background in challenging conditions, such as at night or in inclement weather. When these two types of images are fused, they create high- contrast images with rich texture and target details. In this paper, an effective image fusion technique has been developed, which utilizes Latent Low Rank Representation (LatLRR) method that decomposes the source images into latent low rank and salient parts to capture common and unique information respectively. The proposed network design incorporates the dense network and VGG-19 architectures for deep feature extraction of latent low- rank and salient parts, that minimize distortion while maintaining crucial texture and details in the output. Weighted average fusion strategies are used to combine these latent low-rank and salient parts, and the resulting fused features are used for feature reconstruction to generate a fused low-rank and salient part. These parts are integrated to yield a fused image output. The proposed approach out performs existing state-of-the-art methods on both visual characteristics and objective evaluation metrics.

[...] Read more.
Face Mask Recognition by the Viola-Jones Method Using Fuzzy Logic

By Serhiy Balovsyak Oleksandr Derevyanchuk Vasyl Kovalchuk Hanna Kravchenko Maryna Kozhokar

DOI: https://doi.org/10.5815/ijigsp.2024.03.04, Pub. Date: 8 Jun. 2024

In the work, the software implementation of the face mask recognition system using the Viola-Jones method and fuzzy logic is performed. The initial images are read from digital video cameras or from graphic files. 
Detection of face, eye and mouth positions in images is performed using appropriate Haar cascades. The confidence of detecting a face and its features is determined based on the set parameters of Haar cascades.
Face recognition in the image is performed based on the results of face and eye detection by means of fuzzy logic using the Mamdani knowledge base. Fuzzy sets are described by triangular membership functions. Face mask recognition is performed based on the results of face recognition and mouth detection by means of fuzzy logic using the Mamdani knowledge base. Comprehensive consideration of the results of different Haar cascades in the detection of face, eyes and mouth allowed to increase the accuracy of recognition face and face mask.
The software implementation of the system was made in Python using the OpenCV, Scikit-Fuzzy libraries and Google Colab cloud platform. The developed recognition system will allow monitoring the presence of people without masks in vehicles, in the premises of educational institutions, shopping centers, etc. In educational institutions, a face mask recognition system can be useful for determining the number of people in the premises and for analyzing their behavior.

[...] Read more.
Human Abnormal Activity Recognition from Video Using Motion Tracking

By Manoj Kumar Anoop Kumar Patel Mantosh Biswas Sandeep Singh Sengar

DOI: https://doi.org/10.5815/ijigsp.2024.03.05, Pub. Date: 8 Jun. 2024

The detection of violent behavior in the public environment using video content has become increasingly important in recent years due to the rise of violent incidents and the ease of sharing and disseminating video content through social media platforms. Efficient and effective techniques for detecting violent behavior in video content can assist authorities with identifying potential hazards, preventing crimes, and promoting public safety. Violence detection can also help to mitigate the psychological damage caused by viewing violent content, particularly in vulnerable populations such as infants and victims of violence. We have proposed an algorithm to calculate new descriptors using the magnitude and orientation of optical flow (MOOF) in the video. Descriptors are extracted from MOOF based on four binary histograms each by applying various weighted thresholds. These descriptors are used to train Support Vector Machine (SVM) and classify the video as violent or nonviolent. The proposed algorithm has been tested on the publicly available Hockey Fight Dataset and Violent Flow dataset. The results demonstrate that the proposed descriptors outperform the state-of-the-art algorithms with an accuracy of 91.5% and 78.5% on the Hockey Fight and Violent Flow datasets, respectively.

[...] Read more.
Segmentation of Mammogram Images Using Optimized Kernel Fuzzy AGCWD Based Level Set Method

By Azmeera Srinivas V.V.K.D.V.Prasad B. Leela Kumari

DOI: https://doi.org/10.5815/ijigsp.2024.03.06, Pub. Date: 8 Jun. 2024

Image enhancement technology is widely used to improve images and help radiologists make more accurate cancer diagnoses. In this research work presents an integrating approach for contrast enhancement followed by the segmentation of breast cancer from the mammogram images. The proposed method has been effectively utilized the three different algorithms such as differential Evolution (DE) Algorithm, Kernel Based Fuzzy C Means (KFCM) Clustering and Cuckoo Search Optimization (CSO) algorithm. Here an integrating approach introduced, called Optimized Kernel Fuzzy Adaptive Gamma Correction with Weighed Distribution (OKF-AGCWD) based Level Set Method. The performance of proposed method is enhanced over existing level set methods such as image and vision computing (IVC)-2010, IVC-2013, and Expert Systems with Applications (ESA) 2021. The performance metric parameters like F1_score, Sensitivity, Specificity and accuracy are considered to assess the quality of different methods. The simulation was performed on 16 distinct images from the RIDER mammography database. The experimental results were compared with existing level set approaches such as image and vision computing (IVC)2010, IVC2013 and expert systems and applications (ESA)2021 with respect to OKF-AGCWD. The proposed OKF-AGCWD with contextual level set method (CLSM) minimizes boundary leakage problem of mammogram segmented image and improves segmentation accuracy. 

[...] Read more.
Block-based Local Binary Patterns for Distant Iris Recognition Using Various Distance Metrics

By Arnab Mukherjee Md. Zahidul Islam Raju Roy Lasker Ershad Ali

DOI: https://doi.org/10.5815/ijigsp.2024.03.07, Pub. Date: 8 Jun. 2024

Nowadays iris recognition has become a promising biometric for human identification and authentication. In this case, feature extraction from near-infrared (NIR) iris images under less-constraint environments is rather challenging to identify an individual accurately. This paper extends a texture descriptor to represent the local spatial patterns. The iris texture is first divided into several blocks from which the shape and appearance of intrinsic iris patterns are extracted with the help of block-based Local Binary Patterns (LBPb). The concepts of uniform, rotation, and invariant patterns are employed to reduce the length of feature space. Additionally, the simplicity of the image descriptor allows for very fast feature extraction. The recognition is performed using a supervised machine learning classifier with various distance metrics in the extracted feature space as a dissimilarity measure. The proposed approach effectively deals with lighting variations, blur focuses on misaligned images and elastic deformation of iris textures. Extensive experiments are conducted on the largest and most publicly accessible CASIA-v4 distance image database. Some statistical measures are computed as performance indicators for the validation of classification outcomes. The area under the Receiver Operating Characteristic (ROC) curves is illustrated to compare the diagnostic ability of the classifier for the LBP and its extensions. The experimental results suggest that the LBPb is more effective than other rotation invariants and uniform rotation invariants in local binary patterns for distant iris recognition. The Braycurtis distance metric provides the highest possible accuracy compared to other distance metrics and competitive methods.

[...] Read more.
Enhancing Data Processing Methods to Improve UAV Positioning Accuracy

By Igor Zhukov Bogdan Dolintse Sergii Balakin

DOI: https://doi.org/10.5815/ijigsp.2024.03.08, Pub. Date: 8 Jun. 2024

UAVs play a crucial role in various applications, but their effective operation relies on precise and reliable positioning systems. Traditional positioning systems face challenges in delivering the required accuracy due to factors such as signal degradation, environmental interference, and sensor limitations. This study proposes the LeGNSS positioning subsystem, which integrates low Earth orbit (LEO) satellite network data with GPS and MEMS-based inertial systems, to enhance UAV positioning accuracy and reliability. The presented in this research LeGNSS system employs sophisticated algorithms for optimal data processing and filtering from various sources. Simulation results demonstrate a 9.02% improvement in positioning estimation accuracy compared to classic GPS/INS integration and a 26.4% improvement compared to the onboard GPS receiver. The integration of inertial and satellite positioning, corrective mechanisms, and optimized filtration has resulted in improved precision of trajectory computations, attenuation of positioning signal anomalies, and a significant decrease in INS inaccuracies. The proposed LeGNSS positioning system presents a solution for precise and reliable UAV positioning in a wide range of applications. By leveraging the unique advantages of LEO satellite networks and advanced data fusion techniques, this system pushes the boundaries of UAV positioning capabilities. The novel integration of multiple data sources and the use of adaptive error correction algorithms set a new standard for accuracy and robustness, paving the way for unprecedented capabilities in fields such as aerial surveying, precision agriculture, infrastructure monitoring, and emergency response. Analysing the impact of complex environmental factors on LeGNSS operation can provide insights into expanding the list of satellite systems or sensors to improve positioning accuracy, particularly in high-latitude regions. The findings of this study contribute to improving the accuracy, reliability, and resilience of UAV positioning systems, with applications in scientific polar research, geomatics data gathering, and other domains. The LeGNSS system has the potential to become a key feature for the next generation of autonomous aerial vehicles, unlocking efficiency, safety, and innovation across industries.

[...] Read more.
Text Region Extraction: A Morphological Based Image Analysis Using Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2015.02.06, Pub. Date: 8 Jan. 2015

Image analysis belongs to the area of computer vision and pattern recognition. These areas are also a part of digital image processing, where researchers have a great attention in the area of content retrieval information from various types of images having complex background, low contrast background or multi-spectral background etc. These contents may be found in any form like texture data, shape, and objects. Text Region Extraction as a content from an mage is a class of problems in Digital Image Processing Applications that aims to provides necessary information which are widely used in many fields medical imaging, pattern recognition, Robotics, Artificial intelligent Transport systems etc. To extract the text data information has becomes a challenging task. Since, Text extraction are very useful for identifying and analysis the whole information about image, Therefore, In this paper, we propose a unified framework by combining morphological operations and Genetic Algorithms for extracting and analyzing the text data region which may be embedded in an image by means of variety of texts: font, size, skew angle, distortion by slant and tilt, shape of the object which texts are on, etc. We have established our proposed methods on gray level image sets and make qualitative and quantitative comparisons with other existing methods and concluded that proposed method is better than others.

[...] Read more.
Evolutionary Image Enhancement Using Multi-Objective Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2014.01.09, Pub. Date: 8 Nov. 2013

Image Processing is the art of examining, identifying and judging the significances of the Images. Image enhancement refers to attenuation, or sharpening, of image features such as edgels, boundaries, or contrast to make the processed image more useful for analysis. Image enhancement procedures utilize the computers to provide good and improved images for study by the human interpreters. In this paper we proposed a novel method that uses the Genetic Algorithm with Multi-objective criteria to find more enhance version of images. The proposed method has been verified with benchmark images in Image Enhancement. The simple Genetic Algorithm may not explore much enough to find out more enhanced image. In the proposed method three objectives are taken in to consideration. They are intensity, entropy and number of edgels. Proposed algorithm achieved automatic image enhancement criteria by incorporating the objectives (intensity, entropy, edges). We review some of the existing Image Enhancement technique. We also compared the results of our algorithms with another Genetic Algorithm based techniques. We expect that further improvements can be achieved by incorporating linear relationship between some other techniques.

[...] Read more.
Image Denoising based on Enhanced Wavelet Global Thresholding Using Intelligent Signal Processing Algorithm

By Joseph Isabona Agbotiname Lucky Imoize Stephen Ojo

DOI: https://doi.org/10.5815/ijigsp.2023.05.01, Pub. Date: 8 Oct. 2023

Denoising is a vital aspect of image preprocessing, often explored to eliminate noise in an image to restore its proper characteristic formation and clarity. Unfortunately, noise often degrades the quality of valuable images, making them meaningless for practical applications. Several methods have been deployed to address this problem, but the quality of the recovered images still requires enhancement for efficient applications in practice. In this paper, a wavelet-based universal thresholding technique that possesses the capacity to optimally denoise highly degraded noisy images with both uniform and non-uniform variations in illumination and contrast is proposed. The proposed method, herein referred to as the modified wavelet-based universal thresholding (MWUT), compared to three state-of-the-art denoising techniques, was employed to denoise five noisy images. In order to appraise the qualities of the images obtained, seven performance indicators comprising the Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Structural Content (SC), Peak Signal to Noise Ratio (PSNR), Structural Similarity Index Method (SSIM), Signal-to-Reconstruction-Error Ratio (SRER), Blind Spatial Quality Evaluator (NIQE), and Blind/Referenceless Image Spatial Quality Evaluator (BRISQUE) were employed. The first five indicators – RMSE, MAE, SC, PSNR, SSIM, and SRER- are reference indicators, while the remaining two – NIQE and BRISQUE- are referenceless. For the superior performance of the proposed wavelet threshold algorithm, the SC, PSNR, SSIM, and SRER must be higher, while lower values of NIQE, BRISQUE, RMSE, and MAE are preferred. A higher and better value of PSNR, SSIM, and SRER in the final results shows the superior performance of our proposed MWUT denoising technique over the preliminaries. Lower NIQE, BRISQUE, RMSE, and MAE values also indicate higher and better image quality results using the proposed modified wavelet-based universal thresholding technique over the existing schemes. The modified wavelet-based universal thresholding technique would find practical applications in digital image processing and enhancement.

[...] Read more.
A Review of Self-supervised Learning Methods in the Field of Medical Image Analysis

By Jiashu Xu

DOI: https://doi.org/10.5815/ijigsp.2021.04.03, Pub. Date: 8 Aug. 2021

In the field of medical image analysis, supervised deep learning strategies have achieved significant development, while these methods rely on large labeled datasets. Self-Supervised learning (SSL) provides a new strategy to pre-train a neural network with unlabeled data. This is a new unsupervised learning paradigm that has achieved significant breakthroughs in recent years. So, more and more researchers are trying to utilize SSL methods for medical image analysis, to meet the challenge of assembling large medical datasets. To our knowledge, so far there still a shortage of reviews of self-supervised learning methods in the field of medical image analysis, our work of this article aims to fill this gap and comprehensively review the application of self-supervised learning in the medical field. This article provides the latest and most detailed overview of self-supervised learning in the medical field and promotes the development of unsupervised learning in the field of medical imaging. These methods are divided into three categories: context-based, generation-based, and contrast-based, and then show the pros and cons of each category and evaluates their performance in downstream tasks. Finally, we conclude with the limitations of the current methods and discussed the future direction.

[...] Read more.
Enhancement of Mammographic Images Based on Wavelet Denoise and Morphological Contrast Enhancement

By Toan Le Van Liet Van Dang

DOI: https://doi.org/10.5815/ijigsp.2023.06.03, Pub. Date: 8 Dec. 2023

Breast cancer can be detected by mammograms, but not all of them are of high enough quality to be diagnosed by physicians or radiologists. Therefore, denoising and contrast enhancement in the image are issues that need to be addressed. There are numerous techniques to reduce noise and enhance contrast; the most popular of which incorporate spatial filters and histogram equalization. However, these techniques occasionally result in image blurring, particularly around the edges. The purpose of this article is to propose a technique that uses wavelet denoising in conjunction with top-hat and bottom-hat morphological transforms in the wavelet domain to reduce noise and image quality without distorting the image. Use five wavelet functions to test the proposed method: Haar, Daubechies (db3), Coiflet (coif3), Symlet (sym3), and Biorthogonal (bior1.3); each wavelet function employs levels 1 through 4 with four types of wavelet shrinkage: Bayer, Visu, SURE, and Normal. Three flat structuring elements in the shapes of a disk, a square, and a diamond with sizes 2, 5, 10, 15, 20, and 30 are utilized for top-hat and bottom-hat morphological transforms. To determine optimal parameters, the proposed method is applied to mdb001 mammogram (mini MIAS database) contaminated with Gaussian noise with SD, ? = 20. Based on the quality assessment quantities, the Symlet wavelet (sym3) at level 3, with Visu shrinkage and diamond structuring element size 5 produced the best results (MSE = 50.020, PSNR = 31.140, SSIM = 0.407, and SC = 1.008). The results demonstrate the efficacy of the proposed method.

[...] Read more.
Radio Receiver with Internal Compression of Input Signals Using a Dispersive Delay Line with Bandpass Filters

By Roman Pantyeyev Felix Yanovsky Andriy Mykolushko Volodymyr Shutko

DOI: https://doi.org/10.5815/ijigsp.2023.06.01, Pub. Date: 8 Dec. 2023

This article proposes a receiving device in which arbitrary input signals are subject to pre-detector processing for the subsequent implementation of the idea of compressing broadband modulated pulses with a matched filter to increase the signal-to-noise ratio and improve resolution. For this purpose, a model of a dispersive delay line is developed based on series-connected high-frequency time delay lines with taps in the form of bandpass filters, and analysis of this model is performed as a part of the radio receiving device with chirp signal compression. The article presents the mathematical description of the processes of formation and compression of chirp signals based on their matched filtering using the developed model and proposes the block diagram of a radio receiving device using the principle of compression of received signals. The proposed model can be implemented in devices for receiving unknown signals, in particular in passive radar. It also can be used for studying signal compression processes based on linear frequency modulation in traditional radar systems.

[...] Read more.
Edibility Detection of Mushroom Using Ensemble Methods

By Nusrat Jahan Pinky S.M. Mohidul Islam Rafia Sharmin Alice

DOI: https://doi.org/10.5815/ijigsp.2019.04.05, Pub. Date: 8 Apr. 2019

Mushrooms are the most familiar delicious food which is cholesterol free as well as rich in vitamins and minerals. Though nearly 45,000 species of mushrooms have been known throughout the world, most of them are poisonous and few are lethally poisonous. Identifying edible or poisonous mushroom through the naked eye is quite difficult. Even there is no easy rule for edibility identification using machine learning methods that work for all types of data. Our aim is to find a robust method for identifying mushrooms edibility with better performance than existing works. In this paper, three ensemble methods are used to detect the edibility of mushrooms: Bagging, Boosting, and random forest. By using the most significant features, five feature sets are made for making five base models of each ensemble method. The accuracy is measured for ensemble methods using five both fixed feature set-based models and randomly selected feature set based models, for two types of test sets. The result shows that better performance is obtained for methods made of fixed feature sets-based models than randomly selected feature set-based models. The highest accuracy is obtained for the proposed model-based random forest for both test sets.

[...] Read more.
Fast Encryption Scheme for Secure Transmission of e-Healthcare Images

By Devisha Tiwari Bhaskar Mondal Anil Singh

DOI: https://doi.org/10.5815/ijigsp.2023.05.07, Pub. Date: 8 Oct. 2023

E-healthcare systems (EHSD), medical communications, digital imaging (DICOM) things have gained popularity over the past decade as they have become the top contenders for interoperability and adoption as a global standard for transmitting and communicating medical data. Security is a growing issue as EHSD and DICOM have grown more usable on any-to-any devices. The goal of this research is to create a privacy-preserving encryption technique for EHSD rapid communication with minimal storage. A new 2D logistic-sine chaotic map (2DLSCM) is used to design the proposed encryption method, which has been developed specifically for peer-to-peer communications via unique keys. Through the 3D Lorenz map which feeds the initial values to it, the 2DLSCM is able to provide a unique keyspace of 2544 bits (2^544bits) in each go of peer-to-peer paired transmission. Permutation-diffusion design is used in the encryption process, and 2DLSCM with 3DLorenz system are used to generate unique initial values for the keys. Without interfering with real-time medical transmission, the approach can quickly encrypt any EHSD image and DICOM objects. To assess the method, five distinct EHSD images of different kinds, sizes, and quality are selected. The findings indicate strong protection, speed, and scalability when compared to existing similar methods in literature.

[...] Read more.
A Review on Image Reconstruction through MRI k-Space Data

By Tanuj Kumar Jhamb Vinith Rejathalal V.K. Govindan

DOI: https://doi.org/10.5815/ijigsp.2015.07.06, Pub. Date: 8 Jun. 2015

Image reconstruction is the process of generating an image of an object from the signals captured by the scanning machine. Medical imaging is an interdisciplinary field combining physics, biology, mathematics and computational sciences. This paper provides a complete overview of image reconstruction process in MRI (Magnetic Resonance Imaging). It reviews the computational aspect of medical image reconstruction. MRI is one of the commonly used medical imaging techniques. The data collected by MRI scanner for image reconstruction is called the k-space data. For reconstructing an image from k-space data, there are various algorithms such as Homodyne algorithm, Zero Filling method, Dictionary Learning, and Projections onto Convex Set method. All the characteristics of k-space data and MRI data collection technique are reviewed in detail. The algorithms used for image reconstruction discussed in detail along with their pros and cons. Various modern magnetic resonance imaging techniques like functional MRI, diffusion MRI have also been introduced. The concepts of classical techniques like Expectation Maximization, Sensitive Encoding, Level Set Method, and the recent techniques such as Alternating Minimization, Signal Modeling, and Sphere Shaped Support Vector Machine are also reviewed. It is observed that most of these techniques enhance the gradient encoding and reduce the scanning time. Classical algorithms provide undesirable blurring effect when the degree of phase variation is high in partial k-space. Modern reconstructions algorithms such as Dictionary learning works well even with high phase variation as these are iterative procedures.

[...] Read more.
An Efficient Brain Tumor Detection Algorithm Using Watershed & Thresholding Based Segmentation

By Anam Mustaqeem Engr Ali Javed Tehseen Fatima

DOI: https://doi.org/10.5815/ijigsp.2012.10.05, Pub. Date: 28 Sep. 2012

During past few years, brain tumor segmentation in magnetic resonance imaging (MRI) has become an emergent research area in the ?eld of medical imaging system. Brain tumor detection helps in finding the exact size and location of tumor. An efficient algorithm is proposed in this paper for tumor detection based on segmentation and morphological operators. Firstly quality of scanned image is enhanced and then morphological operators are applied to detect the tumor in the scanned image.

[...] Read more.
Evolutionary Image Enhancement Using Multi-Objective Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2014.01.09, Pub. Date: 8 Nov. 2013

Image Processing is the art of examining, identifying and judging the significances of the Images. Image enhancement refers to attenuation, or sharpening, of image features such as edgels, boundaries, or contrast to make the processed image more useful for analysis. Image enhancement procedures utilize the computers to provide good and improved images for study by the human interpreters. In this paper we proposed a novel method that uses the Genetic Algorithm with Multi-objective criteria to find more enhance version of images. The proposed method has been verified with benchmark images in Image Enhancement. The simple Genetic Algorithm may not explore much enough to find out more enhanced image. In the proposed method three objectives are taken in to consideration. They are intensity, entropy and number of edgels. Proposed algorithm achieved automatic image enhancement criteria by incorporating the objectives (intensity, entropy, edges). We review some of the existing Image Enhancement technique. We also compared the results of our algorithms with another Genetic Algorithm based techniques. We expect that further improvements can be achieved by incorporating linear relationship between some other techniques.

[...] Read more.
Image Denoising based on Enhanced Wavelet Global Thresholding Using Intelligent Signal Processing Algorithm

By Joseph Isabona Agbotiname Lucky Imoize Stephen Ojo

DOI: https://doi.org/10.5815/ijigsp.2023.05.01, Pub. Date: 8 Oct. 2023

Denoising is a vital aspect of image preprocessing, often explored to eliminate noise in an image to restore its proper characteristic formation and clarity. Unfortunately, noise often degrades the quality of valuable images, making them meaningless for practical applications. Several methods have been deployed to address this problem, but the quality of the recovered images still requires enhancement for efficient applications in practice. In this paper, a wavelet-based universal thresholding technique that possesses the capacity to optimally denoise highly degraded noisy images with both uniform and non-uniform variations in illumination and contrast is proposed. The proposed method, herein referred to as the modified wavelet-based universal thresholding (MWUT), compared to three state-of-the-art denoising techniques, was employed to denoise five noisy images. In order to appraise the qualities of the images obtained, seven performance indicators comprising the Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Structural Content (SC), Peak Signal to Noise Ratio (PSNR), Structural Similarity Index Method (SSIM), Signal-to-Reconstruction-Error Ratio (SRER), Blind Spatial Quality Evaluator (NIQE), and Blind/Referenceless Image Spatial Quality Evaluator (BRISQUE) were employed. The first five indicators – RMSE, MAE, SC, PSNR, SSIM, and SRER- are reference indicators, while the remaining two – NIQE and BRISQUE- are referenceless. For the superior performance of the proposed wavelet threshold algorithm, the SC, PSNR, SSIM, and SRER must be higher, while lower values of NIQE, BRISQUE, RMSE, and MAE are preferred. A higher and better value of PSNR, SSIM, and SRER in the final results shows the superior performance of our proposed MWUT denoising technique over the preliminaries. Lower NIQE, BRISQUE, RMSE, and MAE values also indicate higher and better image quality results using the proposed modified wavelet-based universal thresholding technique over the existing schemes. The modified wavelet-based universal thresholding technique would find practical applications in digital image processing and enhancement.

[...] Read more.
Text Region Extraction: A Morphological Based Image Analysis Using Genetic Algorithm

By Dhirendra Pal Singh Ashish Khare

DOI: https://doi.org/10.5815/ijigsp.2015.02.06, Pub. Date: 8 Jan. 2015

Image analysis belongs to the area of computer vision and pattern recognition. These areas are also a part of digital image processing, where researchers have a great attention in the area of content retrieval information from various types of images having complex background, low contrast background or multi-spectral background etc. These contents may be found in any form like texture data, shape, and objects. Text Region Extraction as a content from an mage is a class of problems in Digital Image Processing Applications that aims to provides necessary information which are widely used in many fields medical imaging, pattern recognition, Robotics, Artificial intelligent Transport systems etc. To extract the text data information has becomes a challenging task. Since, Text extraction are very useful for identifying and analysis the whole information about image, Therefore, In this paper, we propose a unified framework by combining morphological operations and Genetic Algorithms for extracting and analyzing the text data region which may be embedded in an image by means of variety of texts: font, size, skew angle, distortion by slant and tilt, shape of the object which texts are on, etc. We have established our proposed methods on gray level image sets and make qualitative and quantitative comparisons with other existing methods and concluded that proposed method is better than others.

[...] Read more.
A Review of Self-supervised Learning Methods in the Field of Medical Image Analysis

By Jiashu Xu

DOI: https://doi.org/10.5815/ijigsp.2021.04.03, Pub. Date: 8 Aug. 2021

In the field of medical image analysis, supervised deep learning strategies have achieved significant development, while these methods rely on large labeled datasets. Self-Supervised learning (SSL) provides a new strategy to pre-train a neural network with unlabeled data. This is a new unsupervised learning paradigm that has achieved significant breakthroughs in recent years. So, more and more researchers are trying to utilize SSL methods for medical image analysis, to meet the challenge of assembling large medical datasets. To our knowledge, so far there still a shortage of reviews of self-supervised learning methods in the field of medical image analysis, our work of this article aims to fill this gap and comprehensively review the application of self-supervised learning in the medical field. This article provides the latest and most detailed overview of self-supervised learning in the medical field and promotes the development of unsupervised learning in the field of medical imaging. These methods are divided into three categories: context-based, generation-based, and contrast-based, and then show the pros and cons of each category and evaluates their performance in downstream tasks. Finally, we conclude with the limitations of the current methods and discussed the future direction.

[...] Read more.
Breast Cancer Classification from Ultrasound Images using VGG16 Model based Transfer Learning

By A. B. M. Aowlad Hossain Jannatul Kamrun Nisha Fatematuj Johora

DOI: https://doi.org/10.5815/ijigsp.2023.01.02, Pub. Date: 8 Feb. 2023

Ultrasound based breast screening is gaining attention recently especially for dense breast. The technological advancement, cancer awareness, and cost-safety-availability benefits lead rapid rise of breast ultrasound market. The irregular shape, intensity variation, and additional blood vessels of malignant cancer are distinguishable in ultrasound images from the benign phase. However, classification of breast cancer using ultrasound images is a difficult process owing to speckle noise and complex textures of breast. In this paper, a breast cancer classification method is presented using VGG16 model based transfer learning approach. We have used median filter to despeckle the images. The layers for convolution process of the pretrained VGG16 model along with the maxpooling layers have been used as feature extractor and a proposed fully connected two layers deep neural network has been designed as classifier. Adam optimizer is used with learning rate of 0.001 and binary cross-entropy is chosen as the loss function for model optimization. Dropout of hidden layers is used to avoid overfitting. Breast Ultrasound images from two databases (total 897 images) have been combined to train, validate and test the performance and generalization strength of the classifier. Experimental results showed the training accuracy as 98.2% and testing accuracy as 91% for blind testing data with a reduced of computational complexity. Gradient class activation mapping (Grad-CAM) technique has been used to visualize and check the targeted regions localization effort at the final convolutional layer and found as noteworthy. The outcomes of this work might be useful for the clinical applications of breast cancer diagnosis.

[...] Read more.
Edibility Detection of Mushroom Using Ensemble Methods

By Nusrat Jahan Pinky S.M. Mohidul Islam Rafia Sharmin Alice

DOI: https://doi.org/10.5815/ijigsp.2019.04.05, Pub. Date: 8 Apr. 2019

Mushrooms are the most familiar delicious food which is cholesterol free as well as rich in vitamins and minerals. Though nearly 45,000 species of mushrooms have been known throughout the world, most of them are poisonous and few are lethally poisonous. Identifying edible or poisonous mushroom through the naked eye is quite difficult. Even there is no easy rule for edibility identification using machine learning methods that work for all types of data. Our aim is to find a robust method for identifying mushrooms edibility with better performance than existing works. In this paper, three ensemble methods are used to detect the edibility of mushrooms: Bagging, Boosting, and random forest. By using the most significant features, five feature sets are made for making five base models of each ensemble method. The accuracy is measured for ensemble methods using five both fixed feature set-based models and randomly selected feature set based models, for two types of test sets. The result shows that better performance is obtained for methods made of fixed feature sets-based models than randomly selected feature set-based models. The highest accuracy is obtained for the proposed model-based random forest for both test sets.

[...] Read more.
Retinal Image Segmentation for Diabetic Retinopathy Detection using U-Net Architecture

By Swapnil V. Deshmukh Apash Roy Pratik Agrawal

DOI: https://doi.org/10.5815/ijigsp.2023.01.07, Pub. Date: 8 Feb. 2023

Diabetic retinopathy is one of the most serious eye diseases and can lead to permanent blindness if not diagnosed early. The main cause of this is diabetes. Not every diabetic will develop diabetic retinopathy, but the risk of developing diabetes is undeniable. This requires the early diagnosis of Diabetic retinopathy. Segmentation is one of the approaches which is useful for detecting the blood vessels in the retinal image. This paper proposed the three models based on a deep learning approach for recognizing blood vessels from retinal images using region-based segmentation techniques. The proposed model consists of four steps preprocessing, Augmentation, Model training, and Performance measure. The augmented retinal images are fed to the three models for training and finally, get the segmented image. The proposed three models are applied on publically available data set of DRIVE, STARE, and HRF. It is observed that more thin blood vessels are segmented on the retinal image in the HRF dataset using model-3. The performance of proposed three models is compare with other state-of-art-methods of blood vessels segmentation of DRIVE, STARE, and HRF datasets.

[...] Read more.
A Review on Image Reconstruction through MRI k-Space Data

By Tanuj Kumar Jhamb Vinith Rejathalal V.K. Govindan

DOI: https://doi.org/10.5815/ijigsp.2015.07.06, Pub. Date: 8 Jun. 2015

Image reconstruction is the process of generating an image of an object from the signals captured by the scanning machine. Medical imaging is an interdisciplinary field combining physics, biology, mathematics and computational sciences. This paper provides a complete overview of image reconstruction process in MRI (Magnetic Resonance Imaging). It reviews the computational aspect of medical image reconstruction. MRI is one of the commonly used medical imaging techniques. The data collected by MRI scanner for image reconstruction is called the k-space data. For reconstructing an image from k-space data, there are various algorithms such as Homodyne algorithm, Zero Filling method, Dictionary Learning, and Projections onto Convex Set method. All the characteristics of k-space data and MRI data collection technique are reviewed in detail. The algorithms used for image reconstruction discussed in detail along with their pros and cons. Various modern magnetic resonance imaging techniques like functional MRI, diffusion MRI have also been introduced. The concepts of classical techniques like Expectation Maximization, Sensitive Encoding, Level Set Method, and the recent techniques such as Alternating Minimization, Signal Modeling, and Sphere Shaped Support Vector Machine are also reviewed. It is observed that most of these techniques enhance the gradient encoding and reduce the scanning time. Classical algorithms provide undesirable blurring effect when the degree of phase variation is high in partial k-space. Modern reconstructions algorithms such as Dictionary learning works well even with high phase variation as these are iterative procedures.

[...] Read more.
Real-Time Video based Human Suspicious Activity Recognition with Transfer Learning for Deep Learning

By Indhumathi .J Balasubramanian .M Balasaigayathri .B

DOI: https://doi.org/10.5815/ijigsp.2023.01.05, Pub. Date: 8 Feb. 2023

Nowadays, the primary concern of any society is providing safety to an individual. It is very hard to recognize the human behaviour and identify whether it is suspicious or normal. Deep learning approaches paved the way for the development of various machine learning and artificial intelligence. The proposed system detects real-time human activity using a convolutional neural network. The objective of the study is to develop a real-time application for Activity recognition using with and without transfer learning methods. The proposed system considers criminal, suspicious and normal categories of activities. Differentiate suspicious behaviour videos are collected from different peoples(men/women). This proposed system is used to detect suspicious activities of a person. The novel 2D-CNN, pre-trained VGG-16 and ResNet50 is trained on video frames of human activities such as normal and suspicious behaviour. Similarly, the transfer learning in VGG16 and ResNet50 is trained using human suspicious activity datasets. The results show that the novel 2D-CNN, VGG16, and ResNet50 without transfer learning achieve accuracy of 98.96%, 97.84%, and 99.03%, respectively. In Kaggle/real-time video, the proposed system employing 2D-CNN outperforms the pre-trained model VGG16. The trained model is used to classify the activity in the real-time captured video. The performance obtained on ResNet50 with transfer learning accuracy of 99.18% is higher than VGG16 transfer learning accuracy of 98.36%. 

[...] Read more.
Deep Learning Based Autonomous Real-Time Traffic Sign Recognition System for Advanced Driver Assistance

By Sithmini Gunasekara Dilshan Gunarathna Maheshi B. Dissanayake Supavadee Aramith Wazir Muhammad

DOI: https://doi.org/10.5815/ijigsp.2022.06.06, Pub. Date: 8 Dec. 2022

Deep learning (DL) architectures are becoming increasingly popular in modern traffic systems and self-driven vehicles owing to their high efficiency and accuracy. Emerging technological advancements and the availability of large databases have made a favorable impact on such improvements. In this study, we present a traffic sign recognition system based on novel DL architectures, trained and tested on a locally collected traffic sign database. Our approach includes two stages; traffic sign identification from live video feed, and classification of each sign. The sign identification model was implemented with YOLO architecture and the classification model was implemented with Xception architecture. The input video feed for these models were collected using dashboard camera recordings. The classification model has been trained with the German Traffic Sign Recognition Benchmark dataset as well for comparison. Final accuracy of classification for the local dataset was 96.05% while the standard dataset has given an accuracy of 92.11%. The final model is a combination of the detection and classification algorithms and it is able to successfully detect and classify traffic signs from an input video feed within an average detection time of 4.5fps

[...] Read more.