Reindex logger by mohityadav766 · Pull Request #26610 · open-metadata/OpenMetadata

mohityadav766 · 2026-03-19T19:09:05Z

Describe your changes:

I worked on ... because ...

Type of change:

Checklist:

I have read the CONTRIBUTING document.
My PR title is Fixes <issue-number>: <short explanation>
I have commented on my code, particularly in hard-to-understand areas.
For JSON Schema changes: I updated the migration scripts or explained why it is not needed.

Summary by Gitar

App run logging infrastructure:
- Added AppRunLogAppender (Logback appender) to capture logs during app execution via MDC or thread name matching
- Implemented RunLogBuffer with scheduled flushing and stream listener support for real-time log delivery
- Added dual storage backends: LocalAppRunLogStorage (filesystem) and S3AppRunLogStorage with buffered uploads
REST API endpoints:
- GET /v1/apps/name/{name}/runs/{runTimestamp}/logs — fetch text logs with server filtering
- GET /v1/apps/name/{name}/runs/{runTimestamp}/logs/download — download as .log file
- GET /v1/apps/name/{name}/runs/{runTimestamp}/logs/stream — Server-Sent Events stream for live/archived logs
- GET /v1/apps/name/{name}/runs/{runTimestamp}/logs/servers — list servers with logs for a run
Frontend component:
- Added AppRunTextLogs UI component with run/server selection, live streaming, download, and copy-to-clipboard
SearchIndexExecutor stats fixes:
- Added periodic sink sync every 2 seconds and entity total adjustment when success+failed exceeds initial total

_{This will update automatically on new commits.}

openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java

openmetadata-service/src/main/java/org/openmetadata/service/apps/logging/AppRunLogAppender.java

...metadata-service/src/main/java/org/openmetadata/service/apps/scheduler/OmAppJobListener.java

openmetadata-service/src/main/java/org/openmetadata/service/apps/logging/RunLogBuffer.java

openmetadata-service/src/main/resources/logback.xml

Copilot

Pull request overview

This PR adds per-run log capture and UI viewing/downloading for internal Applications (notably SearchIndex/reindex), and refines SearchIndexExecutor stats reporting.

Changes:

Backend: Introduces a Logback appender + buffer to capture app-run logs to disk and exposes new AppResource endpoints to fetch/download/stream logs.
UI: Adds a new “Logs” tab for internal apps and a log viewer that supports run selection, server selection, SSE streaming for active runs, and downloads.
SearchIndex: Improves progress/stat consistency by periodically syncing sink stats and ensuring totals don’t drift below observed success+failed counts.

Reviewed changes

Copilot reviewed 16 out of 16 changed files in this pull request and generated 14 comments.

Show a summary per file

File	Description
openmetadata-ui/src/main/resources/ui/src/rest/applicationAPI.ts	Adds REST helpers to fetch and download app-run text logs.
openmetadata-ui/src/main/resources/ui/src/constants/constants.ts	Adds a socket event constant for app-run logs (currently unused).
openmetadata-ui/src/main/resources/ui/src/components/Settings/Applications/MarketPlaceAppDetails/MarketPlaceAppDetails.interface.ts	Adds `LOGS` tab enum value.
openmetadata-ui/src/main/resources/ui/src/components/Settings/Applications/AppRunTextLogs/AppRunTextLogs.interface.ts	Defines props/response typings for the new log viewer.
openmetadata-ui/src/main/resources/ui/src/components/Settings/Applications/AppRunTextLogs/AppRunTextLogs.component.tsx	Implements the Logs UI (run/server selection, SSE stream, download).
openmetadata-ui/src/main/resources/ui/src/components/Settings/Applications/AppDetails/AppDetails.component.tsx	Adds “Logs” tab for internal, scheduled apps.
openmetadata-service/src/main/resources/logback.xml	Registers new `APP_RUN_LOG` appender on root logger.
openmetadata-service/src/main/java/org/openmetadata/service/apps/logging/RunLogBuffer.java	Adds buffering + periodic flushing + stream listener support.
openmetadata-service/src/main/java/org/openmetadata/service/apps/logging/AppRunLogAppender.java	Implements Logback appender and log retention/listing utilities.
openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java	Adds endpoints to get/download/list/stream app-run logs.
openmetadata-service/src/main/java/org/openmetadata/service/apps/scheduler/OmAppJobListener.java	Starts/stops log capture for app runs via MDC + thread prefix matching.
openmetadata-service/src/main/java/org/openmetadata/service/socket/WebSocketManager.java	Adds an app-run logs channel constant (currently unused).
openmetadata-service/src/main/java/org/openmetadata/service/apps/bundles/searchIndex/SearchIndexExecutor.java	Periodic sink stat syncing + total record consistency adjustments.
openmetadata-service/src/test/java/org/openmetadata/service/apps/logging/RunLogBufferTest.java	Adds unit tests for buffering/flushing/line caps.
openmetadata-service/src/test/java/org/openmetadata/service/apps/logging/AppRunLogAppenderTest.java	Adds unit tests for appender behaviors (server listing, retention, concurrency).
openmetadata-service/src/test/java/org/openmetadata/service/apps/bundles/searchIndex/SearchIndexStatsTest.java	Adds tests for new stats consistency behaviors.

...metadata-service/src/main/java/org/openmetadata/service/apps/scheduler/OmAppJobListener.java

...esources/ui/src/components/Settings/Applications/AppRunTextLogs/AppRunTextLogs.component.tsx

openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java

openmetadata-service/src/main/java/org/openmetadata/service/apps/logging/AppRunLogAppender.java

...esources/ui/src/components/Settings/Applications/AppRunTextLogs/AppRunTextLogs.component.tsx

openmetadata-ui/src/main/resources/ui/src/constants/constants.ts

...adata-service/src/test/java/org/openmetadata/service/apps/logging/AppRunLogAppenderTest.java

...metadata-service/src/main/java/org/openmetadata/service/apps/logging/S3AppRunLogStorage.java

openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java

Copilot

Pull request overview

Copilot reviewed 21 out of 21 changed files in this pull request and generated 8 comments.

Copilot · 2026-03-20T07:56:56Z

...metadata-service/src/main/java/org/openmetadata/service/apps/logging/S3AppRunLogStorage.java

+  public boolean exists(String appName, long runTimestamp, String serverId) {
+    String key = s3Key(appName, runTimestamp, serverId);
+    try {
+      s3Client.headObject(HeadObjectRequest.builder().bucket(bucketName).key(key).build());
+      return true;
+    } catch (NoSuchKeyException e) {
+      return false;
+    }


exists() only catches NoSuchKeyException, but AWS SDK v2 headObject commonly throws S3Exception with status code 404 for missing keys (depending on client configuration/endpoints). As written, missing objects may surface as uncaught exceptions instead of returning false. Consider catching S3Exception and treating 404/NoSuchKey as non-existent, while rethrowing other errors.

Copilot · 2026-03-20T07:56:57Z

...metadata-service/src/main/java/org/openmetadata/service/apps/logging/S3AppRunLogStorage.java

+    @Override
+    public void flush() throws IOException {
+      if (buffer.size() == 0) {
+        return;
+      }
+      byte[] newContent = buffer.toByteArray();
+      buffer.reset();
+
+      byte[] existing = new byte[0];
+      try {
+        existing =
+            client
+                .getObject(GetObjectRequest.builder().bucket(bucket).key(key).build())
+                .readAllBytes();
+      } catch (NoSuchKeyException e) {
+        // first write
+      }
+
+      byte[] combined = new byte[existing.length + newContent.length];
+      System.arraycopy(existing, 0, combined, 0, existing.length);
+      System.arraycopy(newContent, 0, combined, existing.length, newContent.length);
+
+      client.putObject(
+          PutObjectRequest.builder().bucket(bucket).key(key).build(),
+          RequestBody.fromBytes(combined));
+    }


S3AppendOutputStream.flush() reads the entire existing object and rewrites it on every flush to simulate append. For longer runs this becomes increasingly expensive (O(n²) bytes transferred over time) and can significantly increase S3 costs/latency. Consider switching to chunked/object-per-flush storage (then compose on read), multipart uploads, or buffering larger batches and reducing flush frequency to limit read/rewrites.

Copilot · 2026-03-20T07:56:57Z

openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java

+                      try {
+                        while (!Thread.currentThread().isInterrupted()) {
+                          Thread.sleep(5000);
+                          output.write(": heartbeat\n\n".getBytes());
+                          output.flush();
+                        }
+                      } catch (InterruptedException e) {
+                        Thread.currentThread().interrupt();
+                      } finally {
+                        activeBuffer.removeStreamListener(listener);
+                      }
+                    } else {


The SSE streaming loop for active runs only exits on thread interruption. When a run completes and AppRunLogAppender.stopCapture(...) removes/closes the buffer, this handler will keep sending heartbeats indefinitely and never emits the done event, which can leak request threads and connections. Add a termination condition inside the loop (e.g., break when AppRunLogAppender.getBuffer(name, String.valueOf(runTimestamp)) is null/closed) and send event: done before returning.

Copilot · 2026-03-20T07:56:57Z

openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java

+      @Parameter(description = "Server ID filter", schema = @Schema(type = "string"))
+          @QueryParam("serverId")
+          String serverId) {
+    repository.getByName(uriInfo, name, repository.getFields("id"));


These new log endpoints validate the app exists via repository.getByName(...), but they don’t perform any authorization check (and securityContext is otherwise unused). To avoid exposing app run logs to callers without view permission, route through getByNameInternal(...) / getInternal(...) or explicitly call authorizer.authorize(...) with the appropriate VIEW operation before reading/streaming logs.

Suggested change

repository.getByName(uriInfo, name, repository.getFields("id"));

App app = repository.getByName(uriInfo, name, repository.getFields("id"));

authorizer.authorize(

getSubjectContext(securityContext),

new OperationContext(APPLICATION, MetadataOperation.VIEW_BASIC),

app.getEntityReference());

Copilot · 2026-03-20T07:56:57Z

...metadata-service/src/test/java/org/openmetadata/service/apps/logging/AppRunLogStorageIT.java

+    static final GenericContainer<?> minio =
+        new GenericContainer<>("minio/minio:latest")
+            .withCommand("server /data")
+            .withExposedPorts(9000)
+            .withEnv("MINIO_ROOT_USER", MINIO_ACCESS_KEY)
+            .withEnv("MINIO_ROOT_PASSWORD", MINIO_SECRET_KEY)
+            .waitingFor(
+                new HttpWaitStrategy()
+                    .forPath("/minio/health/ready")
+                    .forPort(9000)
+                    .withStartupTimeout(Duration.ofMinutes(1)));
+


The MinIO Testcontainers image is pinned to latest, which makes the test suite non-deterministic and can break unexpectedly when MinIO releases new versions. Pin to a specific MinIO image tag (and optionally document/centralize the version) for reproducible CI runs.

Copilot · 2026-03-20T07:56:58Z

openmetadata-integration-tests/src/test/java/org/openmetadata/it/tests/AppRunLogsIT.java

+        responseCode == 400 || responseCode == 500,
+        "Path traversal should be rejected, got: " + responseCode);


This test currently treats a 500 response as acceptable for a path traversal attempt. Since the API should reject invalid serverId with a deterministic 4xx (and the resource code throws BadRequestException for invalid server IDs), tighten the assertion to expect 400 so the test will catch regressions that accidentally turn input validation failures into 500s.

Suggested change

responseCode == 400 || responseCode == 500,

"Path traversal should be rejected, got: " + responseCode);

responseCode == 400,

"Invalid serverId should return 400, got: " + responseCode);

Copilot · 2026-03-20T07:56:58Z

...esources/ui/src/components/Settings/Applications/AppRunTextLogs/AppRunTextLogs.component.tsx

+            <pre
+              data-testid="lazy-log"
+              ref={logContainerRef}
+              style={{
+                height: '60vh',
+                overflow: 'auto',
+                margin: 0,
+                padding: '12px',
+                backgroundColor: '#222',
+                color: '#fff',
+                fontSize: '12px',
+                fontFamily: '"Monaco", "Menlo", "Consolas", monospace',
+                lineHeight: 1.6,
+                borderRadius: '4px',
+                whiteSpace: 'pre',
+                tabSize: 4,
+              }}>


This component renders logs with a raw
and hardcoded inline styles (including hardcoded colors). The UI already has a log viewer implementation (@melloware/react-logviewer / LazyLog) and shared styling (e.g., lazy-log-container) used in AppLogsViewer, which also provides search, selectable lines, and consistent theming. Consider reusing the existing log viewer component/styles and moving any styling to the existing LESS/theme tokens instead of inline hex colors.

Copilot · 2026-03-20T07:56:58Z

...metadata-service/src/test/java/org/openmetadata/service/apps/logging/AppRunLogStorageIT.java

+    void localStorageRejectsPathTraversal() {
+      LocalAppRunLogStorage storage = new LocalAppRunLogStorage(tempDir.toString());
+      try {
+        storage.readLogs("../../etc", 1L, "passwd");
+        // Should not reach here
+        assertFalse(true, "Expected IllegalArgumentException");
+      } catch (IllegalArgumentException e) {
+        assertTrue(e.getMessage().contains("Invalid path"));
+      }


This path traversal test uses a manual try/catch with assertFalse(true, ...) to indicate failure. Use assertThrows(IllegalArgumentException.class, ...) (and then assert on the message) to make the intent clearer and ensure the test fails correctly if the exception is not thrown.

gitar-bot · 2026-03-20T09:33:45Z

Code Review ⚠️ Changes requested 12 resolved / 15 findings

Reindex logger adds S3 support and fixes multiple log streaming issues, but three important findings remain: formatLine drops exception stack traces, the SSE endpoint loads entire log files into memory, and SSE listeners silently swallow RuntimeExceptions on client disconnect.

⚠️

Bug: formatLine drops exception stack traces from captured logs

📄 openmetadata-service/src/main/java/org/openmetadata/service/apps/logging/AppRunLogAppender.java:90-99

The formatLine method only uses event.getFormattedMessage() and completely ignores event.getThrowableProxy(). In Logback, exception stack traces are stored separately in the throwable proxy, not in the formatted message. This means all LOG.error("...", exception) calls will have their stack traces silently dropped from the captured app run logs, making it very difficult to debug failed runs — which is the primary use case for this feature.

Suggested fix

static String formatLine(ILoggingEvent event) {
    String timestamp = FORMATTER.format(Instant.ofEpochMilli(event.getTimeStamp()));
    StringBuilder sb = new StringBuilder();
    sb.append(String.format("%s [%s] %-5s %s - %s",
        timestamp, event.getThreadName(), event.getLevel(),
        event.getLoggerName(), event.getFormattedMessage()));
    if (event.getThrowableProxy() != null) {
      sb.append("
");
      sb.append(ch.qos.logback.classic.pattern.ThrowableProxyConverter
          .throwableProxyToString(event.getThrowableProxy()));
    }
    return sb.toString();
}

⚠️

Performance: SSE stream endpoint reads entire log file into memory String

📄 openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java:757-766

In streamAppRunTextLogs, storage.readLogs() at line 759 reads the entire log file into a single in-memory String, then splits it by newline to write line-by-line as SSE events. With maxLinesPerRun defaulting to 100,000 lines, this could easily be tens of MB per request. The readLogsStream() method exists specifically for streaming, and is already used correctly in the download endpoint.

Suggested fix

// Replace readLogs + split with streaming read:
if (resolvedServerId != null
    && storage.exists(name, runTimestamp, resolvedServerId)) {
  try (var reader = new java.io.BufferedReader(
      new java.io.InputStreamReader(
          storage.readLogsStream(name, runTimestamp, resolvedServerId),
          java.nio.charset.StandardCharsets.UTF_8))) {
    String line;
    while ((line = reader.readLine()) != null) {
      output.write(("data: " + line + "

")
          .getBytes(java.nio.charset.StandardCharsets.UTF_8));
    }
  }
  output.flush();
}

💡 Edge Case: SSE listener RuntimeException on disconnect swallowed silently

📄 openmetadata-service/src/main/java/org/openmetadata/service/apps/logging/RunLogBuffer.java:156-164 📄 openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java:786-796

In the SSE streaming endpoint, when a client disconnects, the listener's output.write() throws IOException which is wrapped in RuntimeException (line 795). This propagates to RunLogBuffer.notifyListeners() which catches it and removes the listener (line 161-163). However, the exception message logged at DEBUG level says "Stream listener error, removing" — this is expected behavior on disconnect, not an error. More importantly, if a listener throws during notifyListeners, it breaks the loop and subsequent listeners for the same batch won't be notified.

Suggested fix

// In RunLogBuffer.notifyListeners, catch per-listener
// to avoid breaking the loop:
private void notifyListeners(String batchText) {
  List<Consumer<String>> toRemove = new ArrayList<>();
  for (Consumer<String> listener : streamListeners) {
    try {
      listener.accept(batchText);
    } catch (Exception e) {
      LOG.debug("Removing disconnected stream listener");
      toRemove.add(listener);
    }
  }
  streamListeners.removeAll(toRemove);
}

✅ 12 resolved

✅ Security: Path traversal via unsanitized name/serverId in log endpoints

📄 openmetadata-service/src/main/java/org/openmetadata/service/apps/logging/AppRunLogAppender.java:160 📄 openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java:601 📄 openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java:662 📄 openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java:745
The name (path param) and serverId (query param) are passed directly to Paths.get(logDirectory, appName, runTimestamp + "-" + serverId + ".log") without any sanitization. An attacker can supply values like ../../etc/passwd to read arbitrary files on the server. This affects all four new endpoints: getAppRunTextLogs, downloadAppRunTextLogs, streamAppRunTextLogs, and getAppRunLogServers.

While repository.getByName() is called first (which validates the app name exists in the DB), the serverId query parameter has no such validation and can contain path traversal sequences. Even for name, if any app name in the DB contains special characters, it could be exploited.

Fix: Resolve the constructed path to its canonical form and verify it still starts with the expected log directory before performing any I/O.

✅ Bug: MDC cleanup skipped if jobWasExecuted throws before cleanup

📄 openmetadata-service/src/main/java/org/openmetadata/service/apps/scheduler/OmAppJobListener.java:201
In OmAppJobListener.jobWasExecuted, the MDC cleanup (MDC.remove(...) at lines 207-210) and AppRunLogAppender.stopCapture() (line 205) are inside the try block but not in a finally block. If any code between lines 140-205 throws an exception (e.g., NPE from null runRecord, JSON parsing failure, WebSocket error), the MDC entries leak on the Quartz scheduler thread for its entire lifetime. This could cause subsequent unrelated jobs on the same thread to have their log events incorrectly routed to the wrong buffer, and the RunLogBuffer (including its flusher ScheduledExecutorService) would never be closed — leaking a thread.

✅ Bug: TOCTOU race in RunLogBuffer.append allows exceeding maxLines

📄 openmetadata-service/src/main/java/org/openmetadata/service/apps/logging/RunLogBuffer.java:52
The append method does if (totalLineCount.get() >= maxLines) return then totalLineCount.incrementAndGet() — a classic check-then-act race. Since append is called from the Logback appender on arbitrary application threads concurrently, multiple threads can pass the guard simultaneously and push the line count beyond maxLines. The practical impact is low (a soft cap on log lines slightly exceeded), but if strict enforcement is needed, use compareAndSet or getAndIncrement atomically.

✅ Edge Case: Logback config registers appender twice (XML + programmatic)

📄 openmetadata-service/src/main/resources/logback.xml:26 📄 openmetadata-service/src/main/resources/logback.xml:34 📄 openmetadata-service/src/main/java/org/openmetadata/service/apps/logging/AppRunLogAppender.java:102
The APP_RUN_LOG appender is declared in logback.xml (attached to root logger), but ensureRegistered() in AppRunLogAppender also programmatically creates and attaches a second instance to the root logger. This means every log event is processed by two AppRunLogAppender instances, resulting in duplicate lines in the log buffers. The XML config should either omit the appender-ref for APP_RUN_LOG from the root logger (relying on programmatic registration) or ensureRegistered() should check for existing appenders by name before adding a new one.

✅ Bug: getBuffer called with wrong arity in streamAppRunTextLogs

📄 openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java:770 📄 openmetadata-service/src/main/java/org/openmetadata/service/apps/logging/AppRunLogAppender.java:166
At line 770, AppRunLogAppender.getBuffer(String.valueOf(runTimestamp)) is called with a single argument, but the method signature requires two: getBuffer(String appName, String runTimestamp). This will cause a compilation error, making the entire SSE streaming endpoint non-functional.

The correct call (as used on line 614) should include the name parameter.

...and 7 more resolved from earlier reviews

🤖 Prompt for agents

Code Review: Reindex logger adds S3 support and fixes multiple log streaming issues, but three important findings remain: `formatLine` drops exception stack traces, the SSE endpoint loads entire log files into memory, and SSE listeners silently swallow RuntimeExceptions on client disconnect.

1. ⚠️ Bug: formatLine drops exception stack traces from captured logs
   Files: openmetadata-service/src/main/java/org/openmetadata/service/apps/logging/AppRunLogAppender.java:90-99

   The `formatLine` method only uses `event.getFormattedMessage()` and completely ignores `event.getThrowableProxy()`. In Logback, exception stack traces are stored separately in the throwable proxy, not in the formatted message. This means all `LOG.error("...", exception)` calls will have their stack traces silently dropped from the captured app run logs, making it very difficult to debug failed runs — which is the primary use case for this feature.

   Suggested fix:
   static String formatLine(ILoggingEvent event) {
       String timestamp = FORMATTER.format(Instant.ofEpochMilli(event.getTimeStamp()));
       StringBuilder sb = new StringBuilder();
       sb.append(String.format("%s [%s] %-5s %s - %s",
           timestamp, event.getThreadName(), event.getLevel(),
           event.getLoggerName(), event.getFormattedMessage()));
       if (event.getThrowableProxy() != null) {
         sb.append("
   ");
         sb.append(ch.qos.logback.classic.pattern.ThrowableProxyConverter
             .throwableProxyToString(event.getThrowableProxy()));
       }
       return sb.toString();
   }

2. ⚠️ Performance: SSE stream endpoint reads entire log file into memory String
   Files: openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java:757-766

   In `streamAppRunTextLogs`, `storage.readLogs()` at line 759 reads the entire log file into a single in-memory String, then splits it by newline to write line-by-line as SSE events. With `maxLinesPerRun` defaulting to 100,000 lines, this could easily be tens of MB per request. The `readLogsStream()` method exists specifically for streaming, and is already used correctly in the download endpoint.

   Suggested fix:
   // Replace readLogs + split with streaming read:
   if (resolvedServerId != null
       && storage.exists(name, runTimestamp, resolvedServerId)) {
     try (var reader = new java.io.BufferedReader(
         new java.io.InputStreamReader(
             storage.readLogsStream(name, runTimestamp, resolvedServerId),
             java.nio.charset.StandardCharsets.UTF_8))) {
       String line;
       while ((line = reader.readLine()) != null) {
         output.write(("data: " + line + "
   
   ")
             .getBytes(java.nio.charset.StandardCharsets.UTF_8));
       }
     }
     output.flush();
   }

3. 💡 Edge Case: SSE listener RuntimeException on disconnect swallowed silently
   Files: openmetadata-service/src/main/java/org/openmetadata/service/apps/logging/RunLogBuffer.java:156-164, openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java:786-796

   In the SSE streaming endpoint, when a client disconnects, the listener's `output.write()` throws IOException which is wrapped in RuntimeException (line 795). This propagates to `RunLogBuffer.notifyListeners()` which catches it and removes the listener (line 161-163). However, the exception message logged at DEBUG level says "Stream listener error, removing" — this is expected behavior on disconnect, not an error. More importantly, if a listener throws during `notifyListeners`, it breaks the loop and subsequent listeners for the same batch won't be notified.

   Suggested fix:
   // In RunLogBuffer.notifyListeners, catch per-listener
   // to avoid breaking the loop:
   private void notifyListeners(String batchText) {
     List<Consumer<String>> toRemove = new ArrayList<>();
     for (Consumer<String> listener : streamListeners) {
       try {
         listener.accept(batchText);
       } catch (Exception e) {
         LOG.debug("Removing disconnected stream listener");
         toRemove.add(listener);
       }
     }
     streamListeners.removeAll(toRemove);
   }

Options

Auto-apply is off → Gitar will not commit updates to this branch.
Display: compact → Showing less information.

Comment with these commands to change:

`Auto-apply`	`Compact`
`gitar auto-apply:on`	`gitar display:verbose`

_{Was this helpful? React with 👍 / 👎 | Gitar}

Copilot

Pull request overview

Copilot reviewed 21 out of 21 changed files in this pull request and generated 6 comments.

Copilot · 2026-03-20T09:40:22Z

openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java

+      @Parameter(description = "Run timestamp", schema = @Schema(type = "number"))
+          @PathParam("runTimestamp")
+          Long runTimestamp) {
+    repository.getByName(uriInfo, name, repository.getFields("id"));


getAppRunLogServers returns server IDs for a run without any authorization check. Even if the content is just metadata, it can leak operational details; please authorize VIEW access to the App before returning this information.

Suggested change

repository.getByName(uriInfo, name, repository.getFields("id"));

App app = repository.getByName(uriInfo, name, repository.getFields("id"));

SubjectContext subjectContext = getSubjectContext(securityContext);

OperationContext operationContext =

new OperationContext(APPLICATION, MetadataOperation.VIEW_ALL);

authorizer.authorize(subjectContext, operationContext, app.getEntityReference());

Copilot · 2026-03-20T09:40:22Z

...ice/src/main/java/org/openmetadata/service/apps/bundles/searchIndex/SearchIndexExecutor.java

+    long now = System.currentTimeMillis();
+    if (now - lastSinkSyncTime >= SINK_SYNC_INTERVAL_MS) {
+      lastSinkSyncTime = now;
+      syncSinkStatsFromBulkSink();
+    }


periodicSyncSinkStats is invoked from multiple consumer threads, but the now - lastSinkSyncTime check and the lastSinkSyncTime = now update are not atomic. Under concurrency, multiple threads can pass the check and trigger syncSinkStatsFromBulkSink() far more often than intended. Use an AtomicLong with CAS (or a synchronized/locked section) to enforce the interval reliably.

Copilot · 2026-03-20T09:40:23Z

...metadata-service/src/test/java/org/openmetadata/service/apps/logging/AppRunLogStorageIT.java

+
+    @Container
+    static final GenericContainer<?> minio =
+        new GenericContainer<>("minio/minio:latest")


The MinIO Testcontainers image is pinned to latest, which makes tests non-reproducible and can introduce sudden breakages when the upstream image changes. Pin this to a specific MinIO release tag (and update intentionally when needed).

Suggested change

new GenericContainer<>("minio/minio:latest")

new GenericContainer<>("minio/minio:RELEASE.2024-01-18T21-02-27Z")

Copilot · 2026-03-20T09:40:23Z

openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java

+          @QueryParam("serverId")
+          String serverId) {
+    repository.getByName(uriInfo, name, repository.getFields("id"));
+


streamAppRunTextLogs streams potentially sensitive log content but does not authorize the request (it only validates the app exists). Please add an explicit authorization check for VIEW access to the App before opening the SSE stream.

Suggested change

// Authorize VIEW access on Apps before streaming potentially sensitive logs

authorizer.authorize(

securityContext, new OperationContext(APPLICATION, MetadataOperation.VIEW));

Copilot · 2026-03-20T09:40:23Z

openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java

+                      java.util.function.Consumer<String> listener =
+                          batchText -> {
+                            try {
+                              for (String logLine : batchText.split("\n")) {
+                                output.write(


The SSE listener writes to the same output stream that the request thread is also writing to (heartbeats / done events). Because RunLogBuffer notifies listeners from its scheduled flusher thread, this results in concurrent writes to output, which is not thread-safe and can interleave/corrupt the SSE stream or throw sporadic IO errors. Consider funneling all writes through a single thread (e.g., listener enqueues lines into a BlockingQueue that the SSE loop drains) or synchronizing all output writes on a shared lock.

Copilot · 2026-03-20T09:40:23Z

openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java

+      @Parameter(description = "Server ID filter", schema = @Schema(type = "string"))
+          @QueryParam("serverId")
+          String serverId) {
+    repository.getByName(uriInfo, name, repository.getFields("id"));


downloadAppRunTextLogs also bypasses the usual authorizer.authorize(...) checks before streaming log content. Please add an explicit authorization check for VIEW access on the App before allowing download.

Suggested change

repository.getByName(uriInfo, name, repository.getFields("id"));

App app = repository.getByName(uriInfo, name, repository.getFields("id"));

SubjectContext subjectContext = getSubjectContext(securityContext);

OperationContext operationContext =

new OperationContext(APPLICATION, MetadataOperation.VIEW_BASIC);

authorizer.authorize(subjectContext, operationContext, getResourceContext(app));

sonarqubecloud · 2026-03-20T10:04:26Z

Quality Gate passed for 'open-metadata-ui'

Issues
4 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

sonarqubecloud · 2026-03-20T10:36:01Z

Quality Gate passed for 'open-metadata-ingestion'

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

mohityadav766 · 2026-03-21T04:32:31Z

closing this

gitar-bot · 2026-03-21T04:36:12Z

openmetadata-service/src/main/java/org/openmetadata/service/apps/logging/AppRunLogAppender.java

+  static String formatLine(ILoggingEvent event) {
+    String timestamp = FORMATTER.format(Instant.ofEpochMilli(event.getTimeStamp()));
+    return String.format(
+        "%s [%s] %-5s %s - %s",
+        timestamp,
+        event.getThreadName(),
+        event.getLevel(),
+        event.getLoggerName(),
+        event.getFormattedMessage());
+  }


⚠️ Bug: formatLine drops exception stack traces from captured logs

The formatLine method only uses event.getFormattedMessage() and completely ignores event.getThrowableProxy(). In Logback, exception stack traces are stored separately in the throwable proxy, not in the formatted message. This means all LOG.error("...", exception) calls will have their stack traces silently dropped from the captured app run logs, making it very difficult to debug failed runs — which is the primary use case for this feature.

Suggested fix:

static String formatLine(ILoggingEvent event) { String timestamp = FORMATTER.format(Instant.ofEpochMilli(event.getTimeStamp())); StringBuilder sb = new StringBuilder(); sb.append(String.format("%s [%s] %-5s %s - %s", timestamp, event.getThreadName(), event.getLevel(), event.getLoggerName(), event.getFormattedMessage())); if (event.getThrowableProxy() != null) { sb.append(" "); sb.append(ch.qos.logback.classic.pattern.ThrowableProxyConverter .throwableProxyToString(event.getThrowableProxy())); } return sb.toString(); }

_{Was this helpful? React with 👍 / 👎 | Reply gitar fix to apply this suggestion}

gitar-bot · 2026-03-21T04:36:13Z

openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java

+                    if (resolvedServerId != null
+                        && storage.exists(name, runTimestamp, resolvedServerId)) {
+                      String content = storage.readLogs(name, runTimestamp, resolvedServerId);
+                      for (String line : content.split("\n")) {
+                        output.write(
+                            ("data: " + line + "\n\n")
+                                .getBytes(java.nio.charset.StandardCharsets.UTF_8));
+                      }
+                      output.flush();
+                    }


⚠️ Performance: SSE stream endpoint reads entire log file into memory String

In streamAppRunTextLogs, storage.readLogs() at line 759 reads the entire log file into a single in-memory String, then splits it by newline to write line-by-line as SSE events. With maxLinesPerRun defaulting to 100,000 lines, this could easily be tens of MB per request. The readLogsStream() method exists specifically for streaming, and is already used correctly in the download endpoint.

Suggested fix:

// Replace readLogs + split with streaming read: if (resolvedServerId != null && storage.exists(name, runTimestamp, resolvedServerId)) { try (var reader = new java.io.BufferedReader( new java.io.InputStreamReader( storage.readLogsStream(name, runTimestamp, resolvedServerId), java.nio.charset.StandardCharsets.UTF_8))) { String line; while ((line = reader.readLine()) != null) { output.write(("data: " + line + " ") .getBytes(java.nio.charset.StandardCharsets.UTF_8)); } } output.flush(); }

_{Was this helpful? React with 👍 / 👎 | Reply gitar fix to apply this suggestion}

gitar-bot · 2026-03-21T04:36:13Z

openmetadata-service/src/main/java/org/openmetadata/service/apps/logging/RunLogBuffer.java

+  private void notifyListeners(String batchText) {
+    for (Consumer<String> listener : streamListeners) {
+      try {
+        listener.accept(batchText);
+      } catch (Exception e) {
+        LOG.debug("Stream listener error, removing: {}", e.getMessage());
+        streamListeners.remove(listener);
+      }
+    }


💡 Edge Case: SSE listener RuntimeException on disconnect swallowed silently

In the SSE streaming endpoint, when a client disconnects, the listener's output.write() throws IOException which is wrapped in RuntimeException (line 795). This propagates to RunLogBuffer.notifyListeners() which catches it and removes the listener (line 161-163). However, the exception message logged at DEBUG level says "Stream listener error, removing" — this is expected behavior on disconnect, not an error. More importantly, if a listener throws during notifyListeners, it breaks the loop and subsequent listeners for the same batch won't be notified.

Suggested fix:

// In RunLogBuffer.notifyListeners, catch per-listener // to avoid breaking the loop: private void notifyListeners(String batchText) { List<Consumer<String>> toRemove = new ArrayList<>(); for (Consumer<String> listener : streamListeners) { try { listener.accept(batchText); } catch (Exception e) { LOG.debug("Removing disconnected stream listener"); toRemove.add(listener); } } streamListeners.removeAll(toRemove); }

_{Was this helpful? React with 👍 / 👎 | Reply gitar fix to apply this suggestion}

mohityadav766 added 4 commits March 19, 2026 18:29

Fix Stats for non-distributed

048f045

Add Logging

27345bd

Get Streaming logs

80b351b

Fix Logs

1d78148

mohityadav766 requested a review from a team as a code owner March 19, 2026 19:09

Copilot AI review requested due to automatic review settings March 19, 2026 19:09

github-actions bot added backend safe to test Add this label to run secure Github workflows on PRs labels Mar 19, 2026

Copilot started reviewing on behalf of mohityadav766 March 19, 2026 19:09 View session

github-advanced-security bot found potential problems Mar 19, 2026

View reviewed changes

gitar-bot bot reviewed Mar 19, 2026

View reviewed changes

openmetadata-service/src/main/java/org/openmetadata/service/apps/logging/AppRunLogAppender.java Outdated Show resolved Hide resolved

gitar-bot bot reviewed Mar 19, 2026

View reviewed changes

...metadata-service/src/main/java/org/openmetadata/service/apps/scheduler/OmAppJobListener.java Outdated Show resolved Hide resolved

gitar-bot bot reviewed Mar 19, 2026

View reviewed changes

openmetadata-service/src/main/java/org/openmetadata/service/apps/logging/RunLogBuffer.java Show resolved Hide resolved

gitar-bot bot reviewed Mar 19, 2026

View reviewed changes

openmetadata-service/src/main/resources/logback.xml Outdated Show resolved Hide resolved

mohityadav766 and others added 2 commits March 20, 2026 00:44

Fix Logs Scrolling

4f33f66

Merge branch 'main' into reindex-logger

8f4dee5

Copilot AI reviewed Mar 19, 2026

View reviewed changes

mohityadav766 had a problem deploying to test March 19, 2026 19:25 — with GitHub Actions Error

mohityadav766 added 2 commits March 20, 2026 00:56

Review Comments

1ffe113

Merge remote-tracking branch 'origin/reindex-logger' into reindex-logger

8acbd40

mohityadav766 had a problem deploying to test March 19, 2026 19:37 — with GitHub Actions Error

gitar-bot bot reviewed Mar 20, 2026

View reviewed changes

...metadata-service/src/main/java/org/openmetadata/service/apps/logging/S3AppRunLogStorage.java Show resolved Hide resolved

gitar-bot bot reviewed Mar 20, 2026

View reviewed changes

...metadata-service/src/main/java/org/openmetadata/service/apps/logging/S3AppRunLogStorage.java Outdated Show resolved Hide resolved

gitar-bot bot reviewed Mar 20, 2026

View reviewed changes

...metadata-service/src/main/java/org/openmetadata/service/apps/logging/S3AppRunLogStorage.java Show resolved Hide resolved

gitar-bot bot reviewed Mar 20, 2026

View reviewed changes

openmetadata-service/src/main/java/org/openmetadata/service/resources/apps/AppResource.java Show resolved Hide resolved

Copilot AI reviewed Mar 20, 2026

View reviewed changes

mohityadav766 temporarily deployed to test March 20, 2026 08:01 — with GitHub Actions Inactive

mohityadav766 and others added 2 commits March 20, 2026 15:02

Address review comments

816e371

Merge branch 'main' into reindex-logger

953258e

Copilot AI review requested due to automatic review settings March 20, 2026 09:32

Copilot started reviewing on behalf of mohityadav766 March 20, 2026 09:33 View session

Copilot AI reviewed Mar 20, 2026

View reviewed changes

mohityadav766 temporarily deployed to test March 20, 2026 09:42 — with GitHub Actions Inactive

mohityadav766 had a problem deploying to test March 20, 2026 09:42 — with GitHub Actions Failure

mohityadav766 temporarily deployed to test March 20, 2026 09:42 — with GitHub Actions Inactive

mohityadav766 closed this Mar 21, 2026

gitar-bot bot reviewed Mar 21, 2026

View reviewed changes

-    repository.getByName(uriInfo, name, repository.getFields("id"));
+    App app = repository.getByName(uriInfo, name, repository.getFields("id"));
+    authorizer.authorize(
+        getSubjectContext(securityContext),
+        new OperationContext(APPLICATION, MetadataOperation.VIEW_BASIC),
+        app.getEntityReference());

		responseCode == 400 \|\| responseCode == 500,
		"Path traversal should be rejected, got: " + responseCode);

-    repository.getByName(uriInfo, name, repository.getFields("id"));
+    App app = repository.getByName(uriInfo, name, repository.getFields("id"));
+    SubjectContext subjectContext = getSubjectContext(securityContext);
+    OperationContext operationContext =
+        new OperationContext(APPLICATION, MetadataOperation.VIEW_ALL);
+    authorizer.authorize(subjectContext, operationContext, app.getEntityReference());

	new GenericContainer<>("minio/minio:latest")
	new GenericContainer<>("minio/minio:RELEASE.2024-01-18T21-02-27Z")

+    // Authorize VIEW access on Apps before streaming potentially sensitive logs
+    authorizer.authorize(
+        securityContext, new OperationContext(APPLICATION, MetadataOperation.VIEW));

Conversation

mohityadav766 commented Mar 19, 2026 • edited by gitar-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe your changes:

Type of change:

Checklist:

Summary by Gitar

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

gitar-bot bot commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

mohityadav766 commented Mar 19, 2026 •

edited by gitar-bot bot

Loading

gitar-bot bot commented Mar 20, 2026 •

edited

Loading