IEEE ICASSP 2024

IEEE ICASSP 2024 - IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The IEEE ICASSP 2024 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit the website.

Enhanced Axle-Based Vehicle Classification Using Angle-Based Micro-Doppler Signature

Read more about Enhanced Axle-Based Vehicle Classification Using Angle-Based Micro-Doppler Signature
Log in to post comments

This study introduces an angle-based micro-Doppler analysis using Frequency Modulated Continuous Wave (FMCW) radar tailored for axle-based vehicle classification. The novel approach exploits the signal angle of arrival to separate incoming signals and noise from distinct targets. This is done by analysing the phase difference of a dual antenna radar system based on the time-frequency representation of the radar beat signal. Vehicles driving side by side can now be discriminated. Multipath signals and clutter are more easily identified and filtered out.

Poster_ICASSP_A0_PDF.pdf

Poster_ICASSP_A0_PDF.pdf (33)

Categories:: Other

6 Views

Vision Transformer MST++: Efficient Hyperspectral Skin Reconstruction

Read more about Vision Transformer MST++: Efficient Hyperspectral Skin Reconstruction
Log in to post comments

Channel reconstruction transforms a subsampled mutispectral image into hyperspectral, offering hyperspectral imaging benefits without a dedicated camera. MST++ is a
state of the art channel reconstruction technique, but it faces memory limitations for high spatial resolution images. In this context, we introduce VITMST++, a novel architecture in-
corporating Vision Transformer embedding and compression, multi-resolution image context and a channel-weighted loss. Developed for the ICASSP 2024 Hyperspectral Skin Chal-

ICASSP_VITMST++_final.pdf

ICASSP_VITMST++_final.pdf (22)

Categories:: Other applications of machine learning (MLR-APPL)

13 Views

KEEP_KNOWLEDGE_IN_PERCEPTION_slides

Read more about KEEP_KNOWLEDGE_IN_PERCEPTION_slides
Log in to post comments

This is the ppt of our paper: KEEP KNOWLEDGE IN PERCEPTION: ZERO-SHOT IMAGE AESTHETIC ASSESSMENT, in ICASSP 2024.

KEEP KNOWLEDGE IN PERCEPTION.pptx

KEEP KNOWLEDGE IN PERCEPTION.pptx (27)

Categories:: Quality Assessment

8 Views

CO-OCCURRENCE GRAPH-ENHANCED HIERARCHICAL PREDICTION OF ICD CODES

Read more about CO-OCCURRENCE GRAPH-ENHANCED HIERARCHICAL PREDICTION OF ICD CODES
Log in to post comments

Recent healthcare applications of natural language processing involve multi-label classification of health records using the International Classification of Diseases (ICD). While prior research highlights intricate text models and explores external knowledge like hierarchical ICD ontology, fewer studies integrate code relationships from whole datasets to enhance ICD coding accuracy. This study presents a modular approach, sequentially combining graph-based integration of ICD code co-occurrence with a hard-coded hierarchical enriched text representation drawn from the ICD ontology.

Poster_for_ICASSP_2024__CO_OCCURRENCE_GRAPH_ENHANCED_HIERARCHICAL_PREDICTION_OF_ICD_CODES_final.pdf

Poster_for_ICASSP_2024__CO_OCCURRENCE_GRAPH_ENHANCED_HIERARCHICAL_PREDICTION_OF_ICD_CODES_final.pdf (33)

Categories:: Other

11 Views

Enabling Device Control Planning Capabilities of Small Language Model

Read more about Enabling Device Control Planning Capabilities of Small Language Model
Log in to post comments

Smart home device control is a difficult task if the instruction is abstract and the planner needs to adjust dynamic home configurations. With the increasing capability of Large Language Model (LLM), they have become the customary model for zero-shot planning tasks similar to smart home device control. Although cloud supported large language models can seamlessly do device control tasks, on-device small language models show limited capabilities. In this work, we show how we can leverage large language models to enable small language models for device control task.

icassp_sudipta.pptx

icassp_sudipta.pptx (25)

Categories:: Other

9 Views

ELECTROENCEPHALOGRAM SENSOR DATA COMPRESSION USING AN ASYMMETRICAL SPARSE AUTOENCODER WITH A DISCRETE COSINE TRANSFORM LAYER

Electroencephalogram (EEG) data compression is necessary for wireless recording applications to reduce the amount of data that needs to be transmitted. In this paper, an asymmetrical sparse autoencoder with a discrete cosine transform (DCT) layer is proposed to compress EEG signals. The encoder module of the autoencoder has a combination of a fully connected linear layer and the DCT layer to reduce redundant data using hard-thresholding nonlinearity.

Presentation_EEG_Compression.pptx

EEG data compression, autoencoder, DCT layer (22)

Categories:: Bio Imaging and Signal Processing

12 Views

dklement_dvbx_slides

Read more about dklement_dvbx_slides
Log in to post comments

Bayesian HMM clustering of x-vector sequences (VBx) has become a widely adopted diarization baseline model in publications and challenges. It uses an HMM to model speaker turns, a generatively trained probabilistic linear discriminant analysis (PLDA) for speaker distribution modeling, and Bayesian inference to estimate the assignment of x-vectors to speakers. This paper presents a new framework for updating the VBx parameters using discriminative training, which directly optimizes a predefined loss.

DVBx-slides_fin.pdf

DVBx-slides_fin.pdf (26)

Categories:: Other

11 Views

Localizing Acoustic Energy in Sound Field Synthesis by Weighted Exterior Radiation Suppression

A method for synthesizing the desired sound field while suppressing the exterior radiation power with directional weighting is proposed. The exterior radiation from the loudspeakers in sound field synthesis systems can be problematic in practical situations. Although several methods to suppress the exterior radiation have been proposed, suppression in all outward directions is generally difficult, especially when the number of loudspeakers is not sufficiently large.

icassp2024_koyama.pdf

icassp2024_koyama.pdf (25)

Categories:: Spatial and Multichannel Audio

20 Views

Quantum Federated Learning with Quantum Networks PPT

Read more about Quantum Federated Learning with Quantum Networks PPT
Log in to post comments

A major concern of deep learning models is the large amount of data that is required to build and train them, much of which is reliant on sensitive and personally identifiable information that is vulnerable to access by third parties. Ideas of using the quantum internet to address this issue have been previously proposed, which would enable fast and completely secure online communications. Previous work has yielded a hybrid quantum-classical transfer learning scheme for classical data and communication with a hub-spoke topology.

Quantum Federated Learning with Quantum Networks ICASSP 2024.pptx

Quantum Federated Learning with Quantum Networks ICASSP 2024.pptx (23)

Categories:: Other

10 Views

Linear Complexity Gibbs Sampling for Generalized Labeled Multi-Bernoulli Filtering

Read more about Linear Complexity Gibbs Sampling for Generalized Labeled Multi-Bernoulli Filtering
Log in to post comments

Generalized Labeled Multi-Bernoulli (GLMB) densities arise in a host of multi-object system applications analogous to Gaussians in single-object filtering. However, computing the GLMB filtering density requires solving NP-hard problems. To alleviate this computational bottleneck, we develop a linear complexity Gibbs sampling framework for GLMB density computation.

Poster - Linear Complexity Gibbs Sampling for GLMB Filtering.pdf

Poster - Linear Complexity Gibbs Sampling for GLMB Filtering.pdf (20)

Categories:: Statistical Signal Processing

17 Views

IEEE ICASSP 2024

Pages