Uploaded image for project: 'Opencast'
  1. MH-13546

Textanalysis shouldn't create tons of copies of the source video

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed and reviewed
    • Affects versions: 6.0
    • Fix versions: 6.5
    • Components: Backend Software
    • Labels:
      None
    • Severity:
      Incorrectly Functioning Without Workaround
    • Steps to reproduce:
      Hide
      Background:
      The textanalysis workflow operation extract an image from the source video for each video segment. Then the tesseract is called to extract text from this image files. And so on…

      Issue:
      Opencast create for each video segment an image extraction job. Each job call workspace.get(URI..., unique: true). This create a copy of the source video. If you run an textanalysis for 100 segments, you will end up copying the video file 100 times.

      Solution:
      You can pass an array of timestamps to the image extraction job. This will create only one job for extracting all image files at once.
      Show
      Background: The textanalysis workflow operation extract an image from the source video for each video segment. Then the tesseract is called to extract text from this image files. And so on… Issue: Opencast create for each video segment an image extraction job. Each job call workspace.get(URI..., unique: true). This create a copy of the source video. If you run an textanalysis for 100 segments, you will end up copying the video file 100 times. Solution: You can pass an array of timestamps to the image extraction job. This will create only one job for extracting all image files at once.

      TestRail: Results

        Attachments

          Activity

            People

            • Assignee:
              waldemarsmirnow Waldemar Smirnow
              Reporter:
              waldemarsmirnow Waldemar Smirnow
            • Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                TestRail: Cases