多媒体技术课后题.ppt_三一办公31ppt.com

资源描述

《多媒体技术课后题.ppt》由会员分享，可在线阅读，更多相关《多媒体技术课后题.ppt（102页珍藏版）》请在三一办公上搜索。

1、Ch3 Graphics and Image Data Representations,1.Briefly explain why we need to be able to have less than 24-bit color and why this makes for a problem.Generally,what do we need to do to adaptively transform 24-bit color values to 8-bit ones?Answer:May not be able to handle such large file sizes or not

2、 have 24-bit displays.The colors will be somewhat wrong,however.,We need to cluster color pixels so as to best use the bits available to be as accurate as possible for the colors in an image.In more detail:variance minimization quantizationvmquant.m Minimum variance quantization allocates more of th

3、e available colormap entries to colors that appear frequently in the input image and allocates fewer entries to colors that appear infrequently.Therefore if there are for example many reds,as in a red apple,there will be more resolution in the red part of the color cube.An excellent implementation o

4、f this idea is Wus Color Quantizer(see Graphics Gems vol.II,pp.126-133).,2.Suppose we decide to quantize an 8-bit grayscale image down to just 2 bits of accuracy.What is the simplest way to do so?What ranges of byte values in the original image are mapped to what quantized values?Answer:,0 to 6364 t

5、o 127128 to 191192 to 255Then reconstruction values should be taken as the middle of these ranges;i.e.,3296160224,3.Suppose we have a 5-bit grayscale image.What size of ordered dither matrix do we need to display the image on a 1-bit printer?Answer:25=32 levels=n2+1 with n=6;therefore need D(6),4.Su

6、ppose we have available 24 bits per pixel for a color image.However,we notice that humans are more sensitive to R and G than to B in fact,1.5 times more sensitive to R or G than to B.How could we best make use of the bits available?Answer:ratio is 3:3:2,so use bits 9:9:6 for R:G:B.,5.At your job,you

7、 have decided to impress the boss by using up more disk space for the companys grayscale images.Instead of using 8 bits per pixel,youd like to use 48 bits per pixel in RGB.How could you store the original grayscale images so that in the new format they would appear the same as they used to,visually?

8、,Answer:48 bits RGB means 16 bits per channel:so re-store the old ints,which were 2 8,as new ints 2 16.But then the new values have to be created by multiplying the old values by 2 8,so that e.g.a mid-gray is still a mid-gray.As well,have to duplicate the old gray into all three of R,G,B.,6.For the

9、color LUT problem,try out the median-cut algorithm on a sample image.Explain briefly why it is that this algorithm,carried out on an image of red apples,puts more color gradation in the resulting 24-bit color image where it is needed,among the reds.,7.Write down an algorithm(pseudocode)for calculati

10、ng a color histogram for RGB data.Answer:int hist256256256;image is an appropriate struct with int fields red,green,bluefor i=0.(MAX_Y-1)for j=0.(MAX_X-1)R=imagexy.red;G=imagexy.green;B=imagexy.blue;histRGB+;,Ch4 Color in Image and Video,Exercise 3,1.Consider the following set of color-related terms

11、:(a)wavelength(b)color level(c)brightness(d)whitenessHow would you match each of the following(more vaguely stated)characteristics to each of the above terms?,(a)luminance)brightness(b)hue)wavelength(c)saturation)whiteness(d)chrominance)color level,2.What color is outdoor light?For example,around wh

12、at wavelength would you guess the peak power is for a red sunset?For blue sky light?Answer:450 nm,650 nm.,3.(a)Suppose images are not gamma corrected by a camcorder.Generally,how would they appear on a screen?Answer:Too dark at the low-intensity end.,(b)What happens if we artificially increase the o

13、utput gamma for stored image pixels?(We can do this in Photoshop.)What is the effect on the image?Answer:Increase the number of bright pixels we increase the number of pixels that map to the upper half of the output range.This creates a lighter image.and incidentally,we also decrease highlight contr

14、ast and increase contrast in the shadows.,Ch5 Fundamental Concepts in Video,1.NTSC video has 525 lines per frame and 63.6 sec per line,with 20 lines per field of vertical retrace and 10.9 sec horizontal retrace.(a)Where does the 63.6 sec come from?Answer:,(b)Which takes more time,horizontal retrace

15、or vertical retrace?How much more time?Answer:,2.Which do you think has less detectable flicker,PAL in Europe or NTSC is North America?Justify your conclusion.Answer:PAL could be better since more lines,but is worse because of fewer frames/sec.,3.Sometimes the signals for television are combined int

16、o fewer than all the parts required for TV transmission.(a)Altogether,how many and what are the signals used for studio broadcast TV?Answer:5R,G,B,audio,sync;can say“blanking”instead,too.,(b)How many and what signals are used in S-Video?What does S-Video stand for?Answer:Luminance+chrominance=2+audi

17、o+sync=4Separated video(c)How many signals are actually broadcast for standard analog TV reception?What kind of video is that called?Answer:1Composite,4.One sometimes hears that the old Betamax format for videotape,which competed with VHS and lost,was actually a better format.How would such a statem

18、ent be justified?Answer:Betamax has more samples per line:500,as opposed to 240.,5.We dont see flicker on a workstation screen when displaying video at NTSC frame rate.Why do you think this might be?Answer:NTSC video is displayed at 30 frames per sec,so flicker is possibly present.Nonetheless,when v

19、ideo is displayed on a workstation screen the video buffer is read and then rendered on the screen at a much higher rate,typically the refresh rate 60 to 90 Hz so no flicker is perceived.,(And in fact most display systems have double buffers,completely removing flicker:since main memory is much fast

20、er than video memory,keep a copy of the screen in main memory and then when we this buffer update is complete,the whole buffer is copied to the video buffer.),6.Digital video uses chroma subsampling.What is the purpose of this?Why is it feasible?Answer:Human vision has less acuity in color vision th

21、an it has in black and whiteone can distinguish close black lines more easily than colored lines,which soon are perceived just a mass without texture as the lines move close to each other.Therefore,it is acceptable perceptually to remove a good deal of color information.In analog,this is accomplishe

22、d in broadcast TV by simply assigning a smaller frequency bandwidth to color than to black and white information.In digital,we“decimate”the color signal by subsampling(typically,averaging nearby pixels).The purpose is to have less information to transmit or store.,7.What are the most salient differe

23、nces between ordinary TV and HDTV?Answer:More pixels,and aspect ratio of 16/9 rather than 4/3.What was the main impetus for the development of HDTV?Immersion“being there”.Good for interactive systems and applications such as virtual reality.,8.What is the advantage of interlaced video?What are some

24、of its problems?Answer:Positive:Reduce flicker.Negative:Introduces serrated edges to moving objects and flickers along horizontal edges.,9.One solution that removes the problems of interlaced video is to de-interlace it.Why can we not just overlay the two fields to obtain a de-interlaced image?Sugge

25、st some simple de-interlacing algorithms that retain information from both fields.Answer:The second field is captured at a later time than the first,creating a temporal shift between the odd and even lines of the image.,The methods used to overcome this are basically two:non-motion compensated and m

26、otion compensated de-interlacing algorithms.The simplest non-motion compensated algorithm is called“Weave”;it performs linear interpolation between the fields to fill in a full,“progressive”,frame.A defect with this method is that moving edges show up with significant serrated lines near them.,A bet

27、ter algorithm is called“Bob”:in this algorithm,one field is discarded and a a full frame is interpolated from a single field.This method generates no motion artifacts(but of course detail is reduced in the resulting progressive image).,In a vertical-temporal(VT)de-interlacer,vertical detail is reduc

28、ed for higher temporal frequencies.Other,non-linear,techniques are also used.Motion compensated de-interlacing performs inter-field motion compensation and then combinesfields so as to maximize the vertical resolution of the image.,Ch6 Basics of Digital Audio,Exercise 1,1.My old Soundblaster card is

29、 an 8bit card.(a)What is it 8 bits of?(b)What is the best SQNR(Signal to Quantization Noise Ratio)it can achieve?,Answer:,2.If a set of ear protectors reduces the noise level by 30 dB,how much do they reduce the intensity(the power)?Answer:A reduction in intensity of 1000.,3.A loss of audio output a

30、t both ends of the audible frequency range is inevitable,due to the frequency response function of an audio amplifier and the medium(e.g.,tape).(a)If the output was 1 volt for frequencies at midrange,what is the output voltage after a loss of 3 dB at 18 kHz?(b)To compensate for the loss,a listener c

31、an adjust the gain(and hence the output)on an equalizer at different frequencies.If the loss remains 3 dB and a gain through the equalizer is 6 dB at 18 kHz,what is the output voltage now?Hint:Assume log102=0.3.,4.Suppose the sampling frequency is 1.5 times the true frequency.What is the alias frequ

32、ency?Answer:0.5 times the True Frequency.,5.In a crowded room,we can still pick out and understand a nearby speakers voice,notwithstanding the fact that general noise levels may be high.This is known as the cocktail-party effect.The way it operates is that our hearing can localize a sound source by

33、taking advantage of the difference in phase between the two signals entering our left and right ears(binaural auditory perception).In mono,we could not hear our neighbors conversation well if the noise level were at all high.State how you think a karaoke machine works.,Hint:The mix for commercial mu

34、sic recordings is such that the“pan”parameter is different going to the left and right channels for each instrument.That is,for an instrument,either the left or right channel is emphasized.How would the singers track timing have to be recorded to make it easy to subtract the sound of the singer(whic

35、h is typically done)?,Answer:For the singer,left and right is always mixed with the exact same pan.This information can be used to subtract out the sound of the singer.To do so,replace the left channel by the difference between the left and the right,and boost the maximum amplitude;and similarly for

36、 the right channel.,6.The dynamic range of a signal V is the ratio of the maximum to the minimum absolute value,expressed in decibels.The dynamic range expected in a signal is to some extent an expression of the signal quality.It also dictates the number of bits per sample needed to reduce the quant

37、ization noise to an acceptable level.For example,we may want to reduce the noise to at least an order of magnitude below Vmin.Suppose the dynamic range for a signal is 60 dB.Can we use 10 bits for this signal?Can we use 16 bits?,7.Suppose the dynamic range of speech in telephony implies a ratio Vmax

38、/Vmin of about 256.Using uniform quantization,how many bits should we use to encode speech to make the quantization noise at least an order of magnitude less than the smallest detectable telephonic sound?Answer:Vmin=Vmax/256.The quantization noise is Vmax=2exp(n),if we use n bits.Therefore to get qu

39、antization noise about a factor of 16 below the minimum sound,we need 12 bits.,8.Perceptual nonuniformity is a general term for describing the nonlinearity of human perception.That is,when a certain parameter of an audio signal varies,humans do not necessarily perceive the difference in proportion t

40、o the amount of change.(a)Briefly describe at least two types of perceptual nonuniformities in human auditory perception.(b)Which one of them does A-law(or-law)attempt to approximate?Why could it improve quantization?,Answer:(a):(1)Logarithmic response to magnitude,(2)different sensitivity to differ

41、ent frequencies,(b):A-law(or-law)approximates the non-linear response to magnitude.It makes better use of the limited number of bits available for each quantized data.,9.Suppose a signal contains tones at 1,10,and 21 kHz and is sampled at the rate 12 kHz(and then processed with an antialiasing filte

42、r limiting output to 6 kHz).What tones are included in the output?Hint:Most of the output consists of aliasing.Answer:1 kHz,12-10=2 kHz,and 2*12-21=3 kHz tones are present.,10.(a)Can a single MIDI message produce more than one note sounding?Answer:No.(b)Is it possible for more than one note to sound

43、 at once on a particular instrument?If so,how is it done in MIDI?Answer:Yes use two NoteOn messages for one channel before the NoteOff message is sent.,(c)Is the Program Change MIDI message a Channel Message?What does this message accomplish?Based on the Program Change message,how many different ins

44、truments are there in General MIDI?Why?Answer:Yes.Replaces patch for a channel.128,since has one data byte,which must be in 0.127.,(d)In general,what are the two main kinds of MIDI messages?In terms of data,what is the main difference between the two types of messages?Within those two categories,lis

45、t the different subtypes.Answer:Channel Messages and System Messages.Channel voice messages,Channelmodemessages,System real-time messages,System common messages,System exclusive messages.Channel messages have a status byte with leading most-significant-bit set,and 4 bits of channel information;Syste

46、m messages have the 4 MSBs set.,11.(a)Give an example(in English,not hex)of a MIDI voice message.Answer:NoteOn(b)Describe the parts of the“assembler”statement for the message.Answer:opcode=Note on;data=note,or key,number;data=“velocity”=loudness.,(c)What does a Program Change message do?Suppose Prog

47、ram change is hex“&HC1.”What does the instruction“&HC103”do?Answer:Changes the patch to#4 on channel 2.,12.In PCM,what is the delay,assuming 8 kHz sampling?Generally,delay is the time penalty associated with any algorithm due to sampling,processing,and analysis.Answer:Since there is no processing as

48、sociated with PCM,the delay is simply the time interval between two samples,and at 8 kHz,this is 0.125 msec.,Ch7 Lossless Compression Algorithms,Exercises 4,1.Suppose eight characters have a distribution A:(1),B:(1),C:(1),D:(2),E:(3),F:(5),G:(5),H:(10).Draw a Huffman tree for this distribution.(Beca

49、use the algorithm may group subtrees with equal probability in a different order,your answer is not strictly unique.),2.(a)What is the entropy()of the image below,where numbers(0,20,50,99)denote the gray-level intensities?,(b)Show step by step how to construct the Huffman tree to encode the above fo

50、ur intensity values in this image.Show the resulting code for each intensity value.(c)What is the average number of bits needed for each pixel,using your Huffman code?How does it compare to?,3.Arithmetic Coding and Huffman Coding are two popular lossless compression methods.(a)What are the advantage

展开阅读全文