Ok so the reason karaoke lags is because every single line has a blurred border. Aside from the expense of the blurring itself, creating outline borders like that is very expensive and can't be multithreaded as far as I've been able to tell. The reason why xy-VSFilter might be running better for you is because he's much more aggressive about caching outlines than I am. For me, the additional caching systems I'm adding into threaded-VSFilter is meant to complement subpicture prebuffering, whereas xy needs more extensive and aggressive caching to compensate for the loss of subpicture prebuffering. The biggest advantage to enabling subpicture prebuffering is that it decouples the subtitle rendering from the video decoding (normally, VSFilter is run by the video decoder thread), so that even if VSFilter starts choking on subtitles, playback won't grind to a halt. The reason for that is subtitle rendering gets kicked off into a child thread, separate from the video decoder's thread, when subpicture prebuffering is enabled. The other advantage to subpicture prebuffering is that it allows threaded-VSFilter to render ahead and fill the buffer during periods where it has nothing to do.
Unfortunately, if you're averaging 87% CPU load just decoding the video, threaded-VSFilter might actually cause CPU contention, especially with subpicture prebuffering on. On my machine with a Phenom X4 9950, I don't get any lag, but I can see madVR's render queue dip every time a new line shows on screen, so I can imagine how it would be pretty bad for you. Seems xy-VSFilter might be better just because it won't cause CPU contention with the video decoder.
On the plus side, the script showed some additional functions that would benefit from the new SSE2 alpha blend I wrote a while back so those have been added in. Now the UTW logo should be less crippling to draw (gah, I can't believe that's softsubbed >.<). If you wouldn't mind testing the latest build
with subpicture prebuffering enabled (any buffer value greater than 0), I'd appreciate it.EDIT:
So I tested UTW's Fate Zero episode 5 with the build I linked you on my laptop with the following configuration:
Core Duo T2500 2Ghz
2GB of RAM
ATI Mobility Radeon X1600 256MB
Windows 7 32-bit
LAV v0.37 set to output P010
Subpicture prebuffering set to 10
I didn't get any lag with the OP karaoke at all. In fact, using the VSFilter that came with CCCP didn't lag either. Our systems are pretty similar, although I have an older, but higher clocked CPU, so if you're still lagging, I'm not sure what's going on. EDIT2:
Just tested both CCCP VSFilter and threaded-VSFilter without subpicture prebuffering enabled and both of them were able to playback OP karaoke without any lag so I'm not sure what's going on with your system.