umpcportal home

Tag Archive | "cpu"

Ultra Mobile Video Editing Part 4 – Lenovo S205 / AMD Brazos Video Editing Test


You may recall that I lost a number of days work last week. Test results, screen caps and a whole bunch of text went out the window when I did a factory re-install on the device I was both testing and writing the article on. That will teach me!

The article was Part 4 of the Ultra Mobile Video Editing series and was a detailed look at two Brazos-optimised video editing applications on the Lenovo S205 AMD E-350 device. The results were, in general, quite impressive with both Cyberlink Power Director 9 and Corel VideStudio X4 showing use of the platforms features. In some cases, hardware video decoding and effects rendering was significantly speeded up through use of AMD Brazos features. In some cases, the results weren’t so good. It all depended on the type of output file.

After I lost my work on the reivew I wrote an overview of AMD APP enhancements amd i’ve just updated that with input from AMD and Cyberlink. The key thing to know with the C and E-Series APUs is this AMDs video encoding acceleration subsystem (which used to be called Avivo) which is used by many of the AMD enhanced video editors and converters, is not implemented on the E-350 (or its drivers.) It’s likely to be because it’s actually slower than the CPU but the end result is that there are limits to the enhancements that can be made.You can’t expect General Purpose GPU (GP-GPU) enhancements.

cyberlink pipeline

The diagram above shows the Cyberlink PowerDirector Brazos acceleration process. Note that the encode stage is 100% CPU bound. Actually this Cyberlink diagram is slightly wrong because there are some encode enhancements implemented in specific effects code that has been written to use Open CL/APP by Cyberlink.

videostudiopro

pd screen cap
Corel Video Studio X4 (above) and Cyberlink PowerDirector 9 (bottom) video editing panes.

vs accel optionspd-hw-accel
Click to enlarge. These are the hardware acceleration features in Video Studio (left) and PowerDirector.

Ultra Mobile Video Editing Test Results.

The parameters I’ve set for the project are shown below and you can read about why these parameters have been set here.

  • PC and software to cost less than 600 Euros
  • PC to be less than 1.5KG with 12 inch screen or less.
  • Total camera + PC solution to weigh less than 2KG and cost less than 1000 Euro
  • Source video should be 720p
  • Video sent to YouTube should be 480p minimum
  • Editing solution must include watermarking, overlays, crossfades, and multiple audio tracks.

Testing results. (Summarised from paper notes taken during testing.)

Source file: H.264 720p 8mbps 25fps. Output file H.264 720p 5mbps.  All possible acceleration turned on. System power settings set to ‘always on’ (full power.)

1 Corel VideoStudio X4 managed to do this test in 3.7x real time which, for such a heavy processing job, is impressive. PowerDirector 9 took significantly longer.

2 In this test, the video decoding accelerations and memory transfer accelerations implemented tend to have little effect as the processing is very CPU intensive. Hardware video decoding and memory optimisations start to have a more significant effect where output files are smaller and use a less complex codec. 720p H.264 input and MPEG-2 DVD file output is a good example. Resutls were the same with and without accelerations enabled. Using the AMD System Monitor (V0.91) very little GPU activity was seen.

3 Using Corel VideoStudio, the E-350 CPU was 1.5x faster than the C-50 and 2X faster than an N450 CPU on this specific video encoding test.

4 For this project (480p minimum output size requirement) there wasn’t an output profile on either software suite that supported the required 852×480 output resolution. 852×480 is a favorable resolution for enabling HQ/480p  experience on YouTube.

5 By adding DivX Pro to the Lenovo S205 I was able to create the required output in an AVI container. I was not successful in getting MP3 audio into the container but I’m confident this is possible. Divx Pro is a 19 Euro license. Divx Pro is a similar implementation of MPEG4 to H.264 (MPEG 4 Part 10) The rendering speed was approx 2.2X real time. This was the best result I achieved in all the tests I did and one that proves the AMD E-350 is capable of producing fast results for my specific video editing and upload requirements. Based on this test it appears that Divx Pro is more tunable (for both resolutions and encoding speed) than the H.264 codecs used in these editing suites.

6 I was able to output a 720×576 file with 16:9 aspect ratio (correct when re=sized to 852×480) using Power Director 9. The rendering speed was an impressive 1.8x real time. This option as sub-optimal as some resolution is lost when the rendered file is squashed into a 720-wide frame.

Other notes:

  • The YouTube processes on both Cyberlink PowerDirector 9 and Corel Videostudio X4 don’t support an HQ profile. On Cyberlink, the profiles outpur WMV files which took much longer then H.264 files to output. (Aprox 5.5x real time)
  • VideoStudio took a very long time to start from fresh boot. Over 60 seconds. PowerDirector 9 is about 25 seconds to start up in the same scenario.
  • Both video editing suites were fluid in their editing processes.
  • Power Director has some effects that are implemented in OpenCL for a significant acceleration advantage.

NotebookItalia also did tests (translation) with Corel VideoStudio and got similar results with Corel VideoStudio . We worked together on cross-checking our articles before posting.

Summary

Editing 720p content with these two software packages is quite acceptable for small, in-field projects. Rendering profiles need to be chosen carefully though to get the best out of the system. For my requirements a 720p H.264 file input and a 480p file output for optimal YouTube uploading I found that a combination of Corel VideoStudio and Divx Pro worked best. Hardware and memory acceleration works in this process and with an 8mbps H.264 720p file input and 480p 30fps file output with ‘fastest’ Divx encoding settings and a 2Mbps bitrate. Rendering rate was 2-2.5x real time depending on audio codex setup. (Using high-power ‘always on’ battery settings on the system.) The images below show most of the configuration settings used.

imageimage

imageimageimage

With most of the process being CPU-load though, one wonders what a dual-core N570 CPU would achieve with the same test. I will be looking to get that result added to this report as soon as possible.

Let us all know about your mobile video editing experiences in the comments/discussion below.

Stay tuned for the next part of this series where I will be testing a standard laptop CPU and chipset, possibly an Intel Sandy Bridge system.

AMD C-50 (Ontario) – CPU Performance Round-Up


By now we’re fully aware that the Fusion platform comprising AMD Ontario/Zacate CPU and Radeon 6250 can turn in some impressive 3D performance. With HD video decoding on board too it’s a double-punch to Intel platforms with the Atom CPU but with the high-end E-series requiring power that most mobile computers can’t deliver it’s only the C-series (C-30 single-core and C-50 dual core) that we’re concerned with here and in day-to-day usage mobile computing usage, where the CPU is all-important, we need to find out how it’s going to compare with dual-core Atom CPUs

I have a personal interest in getting as much CPU power as possible in my netbook but all I want to do here is highlight some reports that are coming in via the C-50-based Toshiba NB550D. Overall, it looks like the high-end Atom N550 is still the best performing mobile CPU. Here are the test results we’ve seen so far.

Passmark, a company that collects 3rd-party reports via its own software now has a few reports in from the C-50. The first benchmark was received just a few days ago so be aware that there are only 2 data points so far. The C-50 is clocking in with an average CPU mark of 480. It’s a better score than the older N470 (score:355) but the Atom N550 is averaging a score of 559 16% more.

Eprice have had reports of the NB550d before but on the 25th Jan a new report was posted that included PCMark05 scores. Unfortunately the device used is an engineering test sample with a single-core C-30 APU inside so bear that in mind. The report does links to a Cinebench mark of 1271 for the C-50 CPU, however. For the Atom N550 CPU i’ve found cinebench scores of 1504 and 1461 and 1444 – An average 15% more.

Netbooknews are currently testing an NB550D and has delivered a full suite of test results and a video.

NB550DCM

It’s an interesting Crystal Mark score. The total is about the same as I’ve seen on two Atom N550 based devices but the CPU score is down, especial for the ALU tests. Below is a result I took from an Acer D255. (Atom N550.) I saw similar results on a Samsung N350.

D255 ALU result

Overall then, were seeing the C-50 CPU score lower than the Atom N550 .

It’s not the complete picture of course (Graphics and video decoding on the C-50 APU is in another performance bracket altogether!) but for those wanting to number-crunch on a netbook, these results should help you decide.

With my requirements firmly in the ‘office’ space with a view to some low-end video editing, I’d choose the dual-core Atom-based devices. What about you?

Multi-core superphones could Benefit From Honeycomb.


I don’t know if you’ve seen the pdfs yet but the whitepapers published by Nvidia last week are worth spending an hour going through if you’re interested in ultra mobile and low power computing.

The two pdfs focus on the benefits of high performance graphics and multiple cores in mobile computing. While I’m yet to be convinced that I need 1080p decoding and gaming graphics on my mobile computer, I do see that improved user interfaces and acceleration of some elements of the web page and web application process is beneficial. After reading the reports I’ve also come away with positive thoughts about multicore computing as a way to save battery life. The theory is simple – high clockrates need higher voltages and more power in exponentially rising amounts and so running two cores at a lower clock to complete the same task can result in power savings.

In podcast 63 at Meetmobility, Al Sutton of Funky Android, an Android consulting company, highlighted why he thought Honeycomb would appear on phones. His theory is based on the fact that Honeycomb is the first version of Android to be built with multicore platforms in mind and the supephones will therefore benefit. The Dalvik environment that applications run in is multicore-aware and will attempt to use multiple cores to speed up (and lower the power cost) of jobs that run in it. That feature alone could help every application running on Android without any programming changes in the application. With smartphones heading in the multicore direction, Honeycomb brings advantages and unless there’s a new multicore aware version in the 2.x branch, Honeycomb could be the way to go for multicore smartphones.

So why don’t silicon experts Intel use multiple cores in their Moorestown platform? The platform runs up to 1.8Ghz I understand so wouldn’t it be better to run 2 cores at, say, 1Ghz? Cost of silicon, size and complexity are probably in the equation and there’s probably a marketing advantage in using a higher clockrate but you would think that if this theory of more cores x lower clock=less power is true, Intel would be doing it too considering how badly they want to get into smartphones. Perhaps it is because much of the software out there isn’t truly multi-threading enabled and the advantages are limited. Where a program runs on multiple cores at a lower clockrate but only utilises one it means that the operation takes longer to run and the system can’t get into an idle state as quickly and the power used is way higher. Just leaving a wifi and screen on for a small extra time will negate any potential advantage.

It’s complex stuff but my feeling right now is that multiple cores are going to bring advantages. We’ll see, in time, if the Honeycomb-for-superphones theory is correct and we’ll see if Intel goes that route for Moorestown and Medfield too.

Is Tunnel Creek the new Consumer Handheld CPU from Intel? No. (Better Things Are Coming)


A few days ago I did some research and analysis on the new ‘Queensbay’ platform from Intel It’s a highly integrated 2-chip system comprising Tunnel Creek processing platform and I/O chip that takes the platform used on many current MIDs and UMPCs and optimizes it in a similar way that Pinetrail did for netbooks. Despite some improvements in size and graphics power and a hint that it might be capable of some cool consumer and media devices, it’s not the Apple A4 / Tegra 2 competitor that I’m guessing will get referenced in articles today. That job lies with Moorestown and I’ll tell you more about that later today on Carrypad. Update: Intel Moorestown article now available.

Tunnel Creek was formerly announced today. See Intel Press Release.
More detailed information has already been made available. See below.

Tunnel Creek integrates a lot of activity on one die and offers a doubling of GPU power, an increase in memory bus speed, an open PCIe bus Southbridge architecture that allows for third party chipsets, a SATA storage interface and a promise of a lower bill of materials. It is possible to build some nice little handhelds out of it and one could imagine interesting tablets with Nvidia Ion on board but later today, we’ll hear more about something that’s been build ground-up for handheld tablets, mids, smartphones, active remote controls and of course, the ‘iPad killers.’

I’ll write about Moorestown later today but for the time being, here are some schematics for Tunnel Creek; the first showing the differences between Menlow (I’m using a PC based on Menlow right now) and Tunnel Creek and the last slide showing Tunnel Creek vs Menlow in a mediaphone scenario.

quuensbay-1

tunnecreek-mediaphone

tunnelcreek-graphics

There are a couple of things there that I didn’t mention in my previous article. Number 1 hardware accelerated video ENCODE. 2) Audio DSP functions. This leads me to believe that Tunnel Creek is in fact a version of Lincroft, the processing unit used in the Moorestown platform. We’ll talk more about that later.

For the Intel IDF presentation on Tunnel Creek (from which the above slides were taken) see the IDF 2010 Beijing Content Catalogue and search for ‘Tunnel Creek’ (Unfortunately I can’t link direct as the catalogue generates one-time URLs.)

For a primer on Moorestown, see the links below. More detailed information on Moorestown architecture is expected from IDF later today.

Moorestown Digging a little deeper.

Intel’s Moorestown Platform. From Smartphone Through Smartbook and Beyond.

ARM Products for MIDs in 2010.


The starting point for a mobile internet device design is to look at the processing platforms available in the market. I’ve spent the last few days researching and reviewing the ARM-based CPUs and platforms and put together a full report that you can find over at UMPCPortal.

If you’re tracking news about MID and ‘smart devices’ from CES next week, keep this as a reference.

ARM Products and Platforms Primer and Resource List for Mobile Internet Devices in 2010. | UMPCPortal – Ultra Mobile Personal Computing.

Moorestown and Pinetrail graphics core thoughts. GMA500 likely.


graphic One of the things I expected to find out at the Intel Developer Forum this year were details about Pinetrail and Moorestown graphics. GMA950 on the current netbook platform clearly needs a boost in the video codec department and if Moblin 2 is only going to support Moorestown and Pinetrail along with the current generation of netbooks, it makes sense that they have the same graphics core right? It fits perfectly with Intel’s ‘continuum’ of devices on the Atom platform. Unfortunately the information on the graphics was limited to notes about OpenGL 2.0 ES and a mention of a PowerVR core in a ‘Sodaville’ Atom-based media processor presentation (image right.)

During a session at IDF this week though, I had fairly concrete confirmation from people in-the-know that the graphics on the Moorestown platform would be a GMA500 (PowerVR SGX core) as you’ll find in the Menlow (Poulsbo chipset as seen on the Asus T91, Viliv X70 and other mobile and long battery-life-focused solutions.) I was a little surprised that it’s the GMA500 but have no reason to disbelieve the info I was given.

Dovetailing nicely with that information though is continuing speculation that Pineview, the CPU+GPU on the Pinetrail netbook platform is also going to use the GMA500. The original info comes from a June article by HKEPC but LinuxDevices seem fairly confident that it is in fact a GMA500 core in Pinetrail.

Its looking like we’re going to have a very closely-matched range of platforms come mid-2010 then.

  • Pinetrail Netbook platform. Atom 1.66Ghz + GMA500 GPU
  • Menlow MID platform. Atom ranging from 800Mhz-2.0Ghz + GMA 500 GPU (with PowerVR SGX core)
  • Moorestown Atom CPU (clockrate unknown) + GMA 500 GPU (With PowerVR SGX core)
  • Sodaville Atom  CPU + PowerVR SGX core.

Note: In each case the graphics may be clocked at different speeds ranging from 133 400Mhz. On current devices we see a 133Mhz graphics clock.

The exciting thing about this is that everything is aligned well to keep it simple for developers and there’s just one operating system that will sit on top of all these to provide optimised kernel, drivers, SDK and app-store. Moblin. From smartphones to netbooks through set-top boxes, PNDs, PMPs, Web tablets and more. One platform for developers that covers, in the 2010-2012 timeframe, an addressable market of over 400 million units, in just the mobile internet device and smartphone segment. Add a few hundred million on top of that for netbooks!

Image2

Intel’s ‘Continuum’ is starting to come together.

Improve your UMPC performance….for free.


Loox U with desktop screen

I’m sitting here in front of a huge 1920×1080 screen with a 1280×800 screen as an extended display. Windows 7 is running and I’ve got 10 Firefox tabs open, Windows Media player, Tweetdeck, Windows Live Photo Gallery, Windows Live Mesh, Windows Paint and Windows Live Writer running. (Love those Windows Live apps!) It’s all running smoothly on the tiny Fujitsu Loox U (U820/U2010) in 1GB RAM.

The last time I was able to do this was with the 1.8Ghz version of  the OQO 2+  but it wasn’t as smooth as this. Why?

1 OS

2 Fast SSD

3 NoScript

Point 1. Windows 7 is better than Vista. No argument.  The second point is also well known. A fast SSD helps with program and file access. It also helps with swap files when, in situations like this, you’ve used up all your memory. I’ll talk more about the (awesome) Runcore Pro IV that i’ve got installed, in another post. (Hint: 80MB+ max read speed)

The last point is something I’ve talked about before but can now highlight in a very very simple way.

With 10 tabs running on Firefox, the chances are that you’re using a few heavy ajax or flash-based sites. It should be no secret that web browsing is one of the most CPU intensive tasks you can do on a device and even if you’ve got windows minimised, it’s still using the CPU in the background.

Enter Noscript.

I’ve used Noscript in the past to optimise my browsing experience and there are other, more scientific tests that highlight the advantages but today, because of the dual screen setup I have here, the effect is extremely pronounced.These two CPU graphs taken over about 2 minutes of browsing, show the difference.

Before. Browsing websites. Hitting CPU limits. You can see the typical heartbeat of a flash animation.

beforenoscript

After. Browsing websites. CPU not hitting limits. Average utilization is much much less and that heartbeat has gone. A few more processes running in this test too.

afternoscript

Side-by-side view:

 beforenoscript (2) afternoscript (2)

The difference is huge, very noticeable and within 2 minutes of installing NoScript, the fan turned off. It’s firing up every now and again but it’s not pegged on like it was before. As I type this I have 12 tabs open, the Firefox process is averaging 4.5%. I’ve done tests like this in the past and seen the CPU averaging 15-20%.

Bloggers and advertisers will hate you for it but if you’re using a UMPC, it’s one of the best CPU/Battery life/heat/noise savers there is out there. And it’s free.

Pause when minimised.

There’s something else that can be learned from this. When using web-based applications, there is no such thing as a device in standby. ARM and Intel would do well to encourage desktop browser developers to enable an optional ‘pause when minimised’ feature (there’s a reason that the iPhone doesn’t multi-task) . It will have a huge effect on the mobile web experience. If it saves as much as I’ve just seen it would be more significant for the mobile web than a couple of years of technology development. I vote for Opera 11 to have this feature. Combined with ‘Turbo’ it would make Opera the best browser for mobile computers.

MIDs, UMPCs, Smartbooks. Where Are We on the Timeline?


We’ve been talking about the area of mobile internet-connected devices between the smartphone and notebook for a long time now. Many companies have tried with many many different designs but in reality, none of them have been a success. OQO, Flipstart, Wibrain and Raon Digital are proof that it’s a market where many are going to fail before a winner comes along but what is it going to take to make that winner and how close are we to it? Where are we on the Mobile Internet Timeline?

There are three components to a successful mobile internet product. Hardware (including technology and design), software (including UI, apps and community) and the magic pixie dust that is the combination of pricing, branding, distribution and marketing needed to bring the device to the people in just the right way. Unfortunately I don’t know enough about marketing to be able to comment (apart from the fact that I think that certain marketing teams are easily capable of making it happen today) but my experiences with ultra mobile devices and customers over the last 3 years gives me some idea about what’s needed for the hardware and software components.

One of the keywords that I keep coming back to in this segment is the word ‘microblogging.’ In my mind, microblogging is the word that connects to all the elements that go to make up a good mobile internet device. I’ve listed those below.

  • Portability and form factor
  • Always-on, connected and fast in use
  • Content, image and video creation, optimisation and playback
  • Fully web capable
  • Position awareness
  • Local storage
  • Capable of multiple communication methods
  • Flexible, reliable and up-to-date software set
  • Pervasive and low-cost mobile internet connectivity.

I talk a bit more here about my own ideal microblogging device and the markets it addresses but let’s focus on a more generic level here.

Of all the items listed above, the first two are the biggest issues. These are the ones that are limited by current technology and have prevented engineers from creating what could be the ideal device so lets take a closer look at these.

Portability

mid-d-a It goes without saying; a mobile internet device needs to be portable but how portable is portable? Through my conversations with thousands of mobile computing and smartphone fans I’ve learned that it can be vastly different to many people although it largely centers around ‘the bag.’ If you’ve got a bag with you all the time (as I do, I hate carrying things in my pockets) then portability goes all the way up to 7 inch screens and nearly 2lb in weight. For those that are looking for a jacket pocket solution, forget anything over 4.8 inch and 1lb. One important thing to note though is that the smaller the device is, the easier it is to hold in one or two hands but that it gets harder to display information. Pixel density can only go so high before websites become unreadable and need to be zoomed. 4-5 inch, 250gm, 15mm thick and an 800×480 screen is going to be the target in my opinion for the next few years. How far away from that are we? We’re there already. Devices have already been designed with both X86 and ARM cores that achieve this size. [Example]

Form Factor Style. Keyboard. Screen.

There’s no real answer here expect to say that most styles and form-factors in most materials are possible today. A keyboard creates a thickness problem and a folding screen option would be nice for the industrial designers but I’m confident that given current industrial design knowledge, materials, skills and production technologies, almost anything is possible today. Fasion changes but given economies of scale, anything should be possible.

n810hsdpaallday Always-on and connected

Being always connected means not having to charge a device for a whole day of about 15 hours. This has been possible with mobile phones for along time but we’re talking about a different level of ‘always connected’ with a mobile internet device. This is not a device that will sit idle. Background software will be checking emails, waiting for instant messages, polling social networks, processing GPS signals, updating locations, checking for software updates, playing music, checking accelerometers and ambient light levels, scheduling alarms and the device will probably be in-use, with the screen backlight on, every 15 minutes during the day. Idle devices are a thing of the past and that’s where the technology challenge comes in. Running these scenarios on a phone architecture results in a dead battery within hours. I know because i’ve tested it!

Always-on Internet applications increases the average power requirement of a smartphone by a minimum of 1000% and up to 3000%

gtdbattThe answer here is not to use the lowest power CPU but to use a very closely coupled hardware and software layer that can schedule events at the right times and make sure the device sleeps for as long as possible. In general, low power devices take longer to perform general purpose processing tasks so the power advantages are outweighed by the need to keep screens and radios active. In general, a device that can perform tasks quicker in processing ‘windows’ and then resume to a standby state is probably going to have the advantage. [See this article for more thoughts and view the image on the left.] How far away from that are we? Quite far. We’re in a very early stage of mobile internet software development and unless a golden bullet comes along in the form of new battery technology, the problem will need to be worked on for some time before true all-day mobile internet ‘computing’ is possible.

Fast in use.

Getting a task completely quickly and efficiently means better user satisfaction and more productivity. The days of waiting 20 seconds for a web page to load on a smartphone are over and we need to be looking towards the sub 10-second page load. Ultimately, 5 seconds for every page on the internet. Not only that but applications need to start instantaneously and user interfaces need to react in a physical way in order for the device to become more a part of the user. How far away from that are we? Quite far. The new generations of smartphone are improving very well and reaching the 15 second average page load level along with having great almost instantaneous interfaces but its still the exception. UMPCs and netbooks are into the sub 10-second level for web page processing but still struggle to meet the battery life requirements when doing so. In general we need about 4x efficiency improvement from the X86 based devices and about 4 x processing power improvements in the ARM-based devices. Both sides are moving very quickly towards it but we’re a couple of years away from what I would call a thrilling experience.

Pervasive 3G networks

As for the rest of the list, we’re pretty much there now. Location, storage, video and software is available. It’s just a matter of sticking it al together in the right way and focusing on the limitations above.

The dream of that ultimate mobile internet communications device is spread wide across potential consumer base and the industry itself but we’ve still got a long way to go. 2009 is an early point on the timeline where although devices are possible, customers are burdened with issues. Be it battery life or slow web processing, heavy form factors or tiny screens. The good news is that everyone working in the industry is already working on those problems and we’re now only a short few years away from seeing devices that satisfy everyone.

Follow Chippy on  TwitterFollow Chippy on  YouTube

Popular mobile computers on UMPCPortal

Acer C740
11.6" Intel Celeron 3205U
Acer Aspire Switch 10
10.1" Intel Atom Z3745
HP Elitebook 820 G2
12.5" Intel Core i5 5300U
Acer Aspire E11 ES1
11.6" Intel Celeron N2840
Acer C720 Chromebook
11.6" Intel Celeron 2955U
ASUS Zenbook UX305
13.3" Intel Core M 5Y10a
Dell Latitude E7440
14" Intel Core i5-4200U
Lenovo Thinkpad X220
12.5" Intel Core i5
Acer Chromebook 11 CB3-131
11.6" Intel Celeron N2807
Lenovo Ideapad Flex 10
10.1" Intel Celeron N2806

Find ultra mobile PCs, Ultrabooks, Netbooks and UMPCs quickly using the following links: