OpenSolaris
Collectives
Discussions
Documentation
Download
Source Browser
Free CD
Log-in
|
en
Project awards
:
Undergraduate Student Research Grant Program
>
Grant Proposals
>
Proposal: Image Storage
Top Menu
Show
:
Comments
Attachments
History
Information
Print
:
Print
Print preview
Export as PDF
Export as RTF
Export as HTML
Export as XAR
Wiki code for
Proposal: Image Storage
Hide Line numbers
1: == Image Storage and Retrieval in OpenSolaris 2: 3: ==== Antonin Brjetchka - lead student researcher Sanjiv K. Bhatia - faculty advisor Drew Garrett, Dante Avery, Prasad Raghavendra - student researchers 4: 5: === University of Missouri – Saint Louis, St. Louis, MO, U.S.A. 6: 7: [[Project web site>>http://solaris-lab.cs.umsl.edu/trac/wiki/]] 8: 9: [[Final Report>>attach:Project awards.files@ImageStorageFinalReport.pdf]] [PDF] 10: 11: [[Second Progress Report>>attach:Project awards.files@ImageStorageProgressReport2.pdf]] [PDF] 12: 13: [[First Progress Report>>attach:Project awards.files@ImageStorageProgressReport1.pdf]] [PDF] 14: 15: ==== Statement of Purpose 16: 17: In this proposal, we plan to develop a utility that can be used to store and organize image files in OpenSolaris. Our aim is to expand the utilization of OpenSolaris by providing adequate means to store large number of image files, and their fast and accurate retrieval. Our faculty advisor, Dr. Sanjiv Bhatia, and one of his graduate students, Debangshu Goswami, have developed a content-based image retrieval (CBIR) system named Robust Image Search Engine, or RISE. RISE has been developed on Sun’s hardware using Oracle and Java Advanced Imaging (JAI) tools. RISE is effective in retrieving images that are similar to a given image. However, it is limited in the sense that it does not have any facility to provide for image annotations, and hence, cannot support text-based image retrieval (TBIR). We will adapt the implementation of RISE using open source tools like ImageMagick and MySQL to provide for both CBIR and TBIR capabilities. 18: 19: ==== Background 20: 21: With the increasing popularity and mobility of digital imaging devices, such as cameras, scanners, home entertaining systems, and cell phones, there is an equally increasing demand for building, storing, and querying a large image database. This task becomes increasingly complex as millions of internet users generate image files at an exponential rate. The currently dominant image retrieval techniques use human language to describe the contents of an image, also known as text-based image retrieval (TBIR). The drawbacks of this technique are poor accuracy and the retrieval of possibly unrelated images. These drawbacks are due to the varying perception of a single image by different individuals, and the existence of different ways to describe a single image in human language. While humans can easily discern objects in an image, they are unable to describe its full semantic content in an unambiguous language that can be tokenized for automatic indexing and retrieval [2]. This problem ensues from the fact that some image content that is important to one person may not be relevant to another. Furthermore, some content of the image may even be left out, which further increases the margin of error. Finally, this method is severely limited as it is time intensive, involves a human element to annotate images, and hence, is impractical for large databases [4]. The attempts to solve this problem led to the development of a technique, described as content-based image retrieval (CBIR). 22: 23: One of the first CBIR systems resulted from the QBIC (Query by Image Content) project at IBM [3]. QBIC pioneered the idea of using a probability distribution function to describe color in the images. Later work in CBIR concentrated on creating indices to describe the images that improved the efficiency and effectiveness of the retrieval process. 24: 25: There are two query methods in CBIR: query-by-example and query-by-memory [1]. Query-by-example allows the user to select an image to be used for the query. The system then retrieves images that have a similar probability distribution function for color as the query image. In query-by-memory the user may select one or more image features, such as texture, color, and shape, from their memory to be used in the query. RISE was developed as a CBIR system that provides an image repository with an interface, which allows querying and maintaining an image database. However, RISE completely ignored the TBIR aspect of retrieval, which may be of tremendous use in a real application for query and retrieval. RISE has a nice web-based interface that can be extended to provide it the capability of both CBIR and TBIR systems. Having both methods of image retrieval will further increase the usability and scalability of the system. 26: 27: The TBIR system of RISE will allow the user to specify image descriptive tags as images are inserted into the database. These tags may later be used to query the database, providing functionality similar to popular websites, such as Google and flickr. The user will be able to select the method of image retrieval, options will be provided for TBIR, CBIR, or a combination of both methods. These options will enable to user to narrow or widen the range of results. The use of TBIR will yield a winder range, less accurateboth techniques in a single query will increase result accuracy even more. By using TBIR and CBIR together images that have similar color attributes but contain undesirable tags, or contain matching tags but different color attributes, will be omitted thus producing more desirable results. 28: 29: RISE uses the distribution of colors in different regions of the image in order to compute its signature at multiple resolutions. The signature of an image in the database is computed by systematically dividing the image into a set of small blocks of pixels and then computing the average color of each block [4]. This is based on the Discrete Cosine Transform (DCT) that forms the basis for popular JPEG image file format [4]. In RISE, this process can be applied to any image format, which allows for great flexibility of the system. RISE divides an image in a quadtree structure [7] and computes the average color in each pixel block within the subdivided image. RISE stores the computed information into a quad tree structure [7] to form the signature of an image. RISE extracts the RGB data of an image and converts it into L*a*b* color space [6], this provides a uniform way of comparing images of different file formats. 30: 31: RISE uses the L*a*b* color space for its perceptual linearity. The L*a*b* color model was developed by CIE (Commission Internationale d’Eclairage) to describe the colors as perceived by the human eye [4]. This model represents colors in a three dimensional space. The vertical axis corresponds to the luminance channel L*, the contrast between green and red is represented by the axis a*, and the contrast between yellow and blue is represented by the axis b*. This model was designed to be device independent and to be used as a reference. It provides linear response to human visual perception [4]. The conversion from RGB to L*a*b* color space is based on the Rec. 709 standard [5]. The conversion is based on CIE XYZ, a set of three linear light components that embed the spectral properties of human color perception. 32: 33: In RISE, the size of an image is converted to 512 x 512 pixels, and the resized image is divided into 8x8 blocks. We decided on using the 8x8 pixel block in accordance with the requirement in JPEG compression where each 8x8 block is filtered through Discrete Cosine Transform to separate the frequencies. The DC value of the block, in location (0,0), gives us the average pixel value in the block. The average color values of four 8x8 blocks, comprising a 16x16 block, represent leaf nodes in the quad tree, and each parent node contains an average of the values of its child nodes, thus the root node contains an average value for the whole tree. This setup allows two images to be compared at different levels and a degree of similarity may be determined. Using a commercial relational database system (RDBMS) to store and query signatures of images improves the efficiency of the system [4]. The signatures of images are stored in a table in the RDBMS. The columns of the table include the name of the image, the average L*a*b* values at each level of the quad tree structure. We plan to extend RISE by adding additional attributes in the RDBMS to store tokens to be used by the TBIR system. Using Structured Query Language (SQL) allows the system to specify the level of comparison of two images. In addition an RDBMS provides an index for faster access to data. An index contains an entry for each value that appears in the indexed column(s) of the relation and provides direct, fast access to rows [4]. 34: 35: ==== Approach 36: 37: RISE will be implemented into OpenSolaris in C, for its speed and efficiency. In addition, MySQL and ImageMagick will be used in this project. 38: 39: MySQL is open source and a widely available RDBMS. Its speed, reliability, and ease of use make it the perfect RDBMS to use for this project. MySQL will be used to store and maintain image signatures, and to provide a fast and efficient search and retrieval of the images. This project will take advantage of the ability of MySQL to be embedded into applications; this feature will eliminate the complexity of creating a separate database in order to use RISE. Embedded MySQL eliminates the administration of a stand-alone database server, and provides a fully featured database at the cost of a very small footprint. Furthermore, embedding MySQL into the application also eliminates the overhead of establishing a connection and the client-server correspondence. 40: 41: ImageMagick will be used for image-specific operations; this will additionally improve system efficiency. ImageMagick will allow us the capability to read multiple image formats; this capability came in RISE from the use of JAI tools. MagickWand is a C application program interface (API) of ImageMagick, which will be used to convert images and calculate their average values. Image processing will be very fast and efficient, since MagickWand is already optimized to perform these operations. This will also allow us to concentrate on developing the algorithm, rather than manipulating the images. MagickWand provides a wide variety of functions to retrieve information about an image, these functions will allow us to fine-tune the signature calculation process and increase the overall accuracy and performance of the system. 42: 43: ==== End Result 44: 45: At the end of this project we expect to have an application, which will provide the ability to store, search, and retrieve images through a web based graphical user interface ( GUI ). In addition, this implementation of RISE will provide both TBIR and CBIR methods of image retrieval. This will allow the user to describe content to be searched for or specify an example image. 46: 47: ==== References: 48: 49: 1. E. L. van den Broek, P. M. Kisters, and L. G. Vuurpijl. Design guidelines for a contentbased image retrieval color-selection interface. In Proceedings of the Conference on Dutch Directions in HCI, Amsterdam, Holland, 2004. 50: 1. S. Climer and S. K. Bhatia. Image database indexing using JPEG coefficients. Pattern Recognition, 35(11):2479–2488, November 2002. 51: 1. M. Flickner, et al. Query by Image Content: The QBIC System. IEEE Computer, 28(9): 23-32, September 1995. 52: 1. D. Goswami, S.K. Bhatia, and A. Samal. RISE: A Robust Image Search Engine. Pattern Recognition Theory and Applications. E. A. Zoeller (ed.). Nova Science Publishers, 2007. 53: 1. ITU-R Recommendation BT.709. Basic parameter values for the hdtv standard for the studio and for international programme exchange. Technical Report BT.709 [formerly CCIR Rec. 709], International Telecommunications Union, 1211 Geneva 20, Switzerland, 1993. 54: 1. C. Poynton. A guided tour of color space. In New Foundations for Video Technology: Proceedings of the SMPTE Advanced Television and Electronic Imaging Conference, San Francisco, CA. February 1995. Pages 167-180. 55: 1. H. Samet. The quadtree and related hierarchical data structures. ACM Computing Surveys, 16(2):187–260, June 1984.
Search
Collectives
Community Group
Academic and Research
Accessibility
Advocacy
Appliances
Approachability
Architecture Process and Tools
BrandZ
Chinese Users
Community Advisory Board
Databases
Desktop
Device Drivers
Distribution
Documentation
DTrace
Emerging Platforms
Fault Management
Games on OpenSolaris
HA Clusters
HPC Developer
Installation and Packaging
Internationalization and Localization
Laptop
Logical Domains
Modular Debugger (MDB)
Networking
NFS
Observability
OpenSolaris Governing Board (OGB)
OpenSolaris Printing
OS/Net (ON)
Performance
Power Management
PowerPC
Security
Service Management Facility (smf(5))
Software Porters
Solaris Volume Manager
Storage
Systems Administration Community Group
Testing
Tools Home
Unix File Systems (UFS)
Website Community
X Window System
Xen
ZFS
Zones
Project
ADSL Modem Enhancement
ARC Process Definition
ARM Platform Port
Automatic Data Migration
BIND Update
Bluetooth Stack & Drivers
Brocade FC HBA - Initiator
Brocade FC HBA - Target
Brussels - unified network link configuration
Caiman, Solaris Install Revisited
Celeste
Český portál
Chime Visualization Tool for DTrace
CIFS client for Solaris
CIFS Server
Clearview: Network Interface Coherence
Cluster Agent: Informix Dynamic Server
Cluster Agent: OpenSolaris Container
Cluster Agent: OpenSolaris xVM
Cluster Agent: Oracle E-Business Suite
Cluster agent: PostgreSQL
Cluster Agent: Samba
Cluster Agent: Tomcat
CMT
Coarse Data Flow Parallelism
Colorado: Open HA Cluster on OpenSolaris
Command Assistant
Common Array Manager
Companion - /opt/sfw: Free and Open Source software
COMSTAR: Common Multiprotocol SCSI Target
Content
Contest
CPU Observability
Credentials Process Groups
Crossbow: Network Virtualization and Resource Control
Crypto KMS Agent Toolkit
Cryptographic Framework
Data Migration Manager
Data Tethers
Deutsches Portal
Device Detection Tool
Device Driver Utility
Device Manager
Device Mapper
Direct Rendering Infrastructure & 3D drivers
DTrace Guide
Duckwater: Simplified name services management
Easy Tools
Emancipation
Emulex Fibre Channel Device Driver
Emulex Advanced Ethernet Device Driver
Enable/Enhance Solaris support for Intel Platform
Enhance the support of USB webcams
Enhanced SMF Profiles
Enhancements for AMD-based Platforms
Erlang DTrace Integration
Ethernet bridge module for Solaris
Evaluate Conary
Events Registry
Ext3 file system support
F/OSS Package Base
Facilitation
Fibre Channel over Ethernet
Fine Grained Access Policy (FGAP)
Fingerprint Authentication
Flexible Mandatory Access Control
Forensic Tools
Fully Open X Project
Fuse on Solaris
gcore
Generic Machine Check Architecture Improvements
Google SOC
HA-JBoss
HA-MySQL
Hadoop Live CD
Hitachi
HoneyComb Fixed Content Storage
HPC Stack
Image Packaging System
Improved Performance MIB
Indiana
Innovation Awards
Input Method
Intel Graphics
Internet Key Exchange, version 2
Interrupt Resource Management
IP Datapath Refactoring
IP over Infiniband
IPsec Tunnel Reform
iSCSI Extensions for Remote DMA (iSER)
iSNS Server
JeOS - Just enough Operating System
JKstat - a java binding for libkstat
Journaled File System (JFS)
K Desktop Environment
Kerberos
Kernel Sockets
Kernel SSL Enhancements
Key Management Framework
Korn Shell 93 integration/migration project
Labeled IPsec
LatencyTOP
Layer 2 Filtering
LDoms Manager
Lending
libMicro - portable microbenchmarks
Link Layer Discovery
Live Media: Technologies for distributions running from CD and other media
Locale Data
lofi compression and cryptography support
lx64 brand
Media Management System
Mega_sas
Mexico
MilaX minimal Live Distribution
MIPS Platform Port
Mozilla DTrace
MRSL.NONsharedDevice
Multi-lingual Glossary
Multi-pathing software (MPxIO)
Multiple disk sector size support
Multiple DOI
Muskoka: An open repository for OpenSolaris technical content
Navigator
Nemo: A Framework for High-Performance Networking
Network Auto-Magic
Network Data Management Protocol
Network MIBs
Network Storage
Network Time Protocol (NTP)
Nevada Globalization
New Design of 4over6 Mechanism Based on OpenSolaris
NFS RDMA transport update and performance analysis
NFS Server in non-Global Zones
NFS version 4.1 pNFS
NFSv4 namespace extensions
Nightingale: Port Songbird to OpenSolaris
NPort ID Virtualization (NPIV)
NUMA
Object Storage Device (OSD) support for Solaris
OHACGE Script Based Plug-in
ON/Nevada (ONNV) Project
Open Development Infrastructure
Open HA Cluster Utilities
Open Sound System
OpenGrok
OpenPegasus CIM Server
OpenRTI
OpenSolaris Busybox
OpenSolaris Desktop
OpenSolaris Hispano
OpenSolaris Security Audit
OpenSolaris support for the QEMU processor emulator: host and guest
PEF: Packet Event Framework
Performance Wrappers
Pkgfactory
Polski Portal
Portail Francophone
Portal Brasil
Portals
Power Management Usability Interfaces
Presto: Automatic Printing Configuration
Printable Many Page Solaris Manuals
Promise SuperTrak RAID HBA Driver
QLogic Converged Network Adapter GLDv3 NIC Driver
Quagga Routing Protocol Suite Integration
RAID Configuration Utility
RBridge (IETF TRILL) support
RDMA Offload Framework
Reno: Login Process Enhancements for Interop
Resource Management
s10brand
SAM/QFS
SCM Migration Project
SCSI RDMA Protocol
SDcard Drivers
Sensor Abstraction Layer
Session Initiation Protocol
SFW
Shell: bourne shell, korn shell, C shell, etc.
Sierra: Intel WiFi Chipsets Support
Simple Panels
SM-HBA Based SAS HBA Management
SMF Documentation
Solaris iSCSI Target
Solaris PowerPC Port
SourceJuicer
Sparks: name service switch/nscd enhancements
Squashfs
Star integration/migration project
Starfish
Starter Kit
Storage Power Management
Sun Security Toolkit
Sun StorageTek Availability Suite
Support for OpenFabrics User Verbs / API on OpenSolaris OS
Support gcc4/GCCfss in Solaris
Suspend/Resume
SVR4 Packaging
Systemz
Tamarack: Removable Media Enhancements in Solaris
Tesla: OpenSolaris Enhanced Power Management
Test Development
Tickless Kernel Architecture
TIPC
Trademarks
Trusted networking interface policy database for Trusted Extensions
Trusted Platform Module support
Use Case
Validated Execution Project
Virtual Console
Virtual Network Machines
Visual Panels
Visualization for HPC
Volo
VRRP: Virtual Router Redundancy Protocol Implementation
VSCAN service
Web Stack
Website
Winchester: Schema mapping and ID mapping for AD Interoperability
Wireless USB Support
Wireless Wide Area Network
X Consolidation
x86 Generic FMA Topology Enumerator
Xen Gate
Xfce: A lightweight desktop environment
ZFS Boot and Install
ZFS on disk encryption support
Zone Manager
Zone Statistics
Русский портал
البوابة العربية
भारतीय पोर्टल
中国门户
日本ポータル
한국 포탈
User Group
Adelaide
Argentina
Arizona
Atlanta
Baltimore-Washington
Bangalore
Bangkok
Bangladesh
Beijing
Bélem
Berlin
Bhimavaram
Bloomington
Campus Ambassadors
Capital Region
Cardiff
Charlotte
Chengdu
Chennai
Chihuahua
Chile
Cleveland
Colombia
Columbus
Connecticut
Cracow
Czech
Dallas/Ft. Worth
Danish
Delaware
Edinburgh
Egypt
Finland
Florida
Front Range
FuZhou
Great Lakes
Greece
Hangzhou
Hawaii
HeFei
Houston
Hyderabad
Indonesia
Irish
Israel
Italian
Jinan
Kabul
Kansas City
Latvia
London
Madurai
Manchester
Mato Grosso
Melbourne
Minas Gerais
Minnesota
Montreal
Moscow
Mumbai
Munich
NEA
Netherlands
New England
New York City
New Zealand
NIT Hamirpur
Noroeste
Oklahoma City
Osnabrück
Peru
Philadelphia
Piaski
Pittsburgh
Porto Alegre
Puget Sound
Pune
Queensland
Research Triangle Park
Romania
Russia
San Antonio
San Diego
San Francisco
São Paulo
Scottish
Serbia
Shanghai
Shenzhen
Silicon Valley
Singapore
Slovak
South African
Southern Connecticut
St. Louis
Sweden
Switzerland
Sydney
Szczecin
Taiwan
Tecum
Thames Valley
Tokyo
Toronto
Trondheim
Tulsa
Turkey
Ukraine
University of Melbourne
Vale do Paraíba
Vancouver
Venezuela
Welsh - Cymru
Wisconsin
Xi'an
Subsites
Code Reviews
Code Repositories
Package Search
Bugster
Bugzilla
Test Machines
Planet
Mailing Lists
Elections & Polls
ARC Case Logs
Source Juicer
Package Factory
User Authentication
Project awards Pages
OpenSolaris Community Innovation Awards Program
Contest Entries
Contest FAQ
Official Rules
Contest Judging
Undergraduate Student Research Grant Program
Grant Proposals
Grant FAQ
Call for Proposals
Files