livekit

mirror of https://github.com/livekit/livekit.git synced 2026-05-13 20:53:26 +00:00

Author	SHA1	Message	Date
Raja Subramanian	ef2e5efe14	Log large packets receive/send. (#4521 ) * Log large packets receive/send. Seeing cases of servers reporting need for segmentation/re-assembly of packets. So, logging packet receive/send for RTP/RTCP to check if anything is seeing more than 1400 byte packets. * log downtrack RTCP too	2026-05-13 16:04:53 +05:30
networkException	d123675008	feat: auto create rooms for tokens with the RoomCreate grant (#4320 ) This patch updates the check for auto creating rooms to also consider the RoomCreate grant per token instead of just the global config option. With this patch, applications can decide on their own whether users or which users can auto create rooms. This allows applications that rely on auto creation (saving an API call) to co-exist with those who might want to mint tokens for subscribe-only users. Specifically LaSuite Meet relies on the auto create behavior, however enabling the global config option would make a MatrixRTC deployment vulnerable to abuse, as users on remote homeservers get tokens in order to subscribe.	2026-05-13 11:25:08 +05:30
Théo Monnom	7a3e595bde	apply room tags from JWT grant room configuration (#4518 )	2026-05-12 21:21:42 -07:00
Paul Wells	ab7fdeab7c	add AssignmentHook to AssignJob; propagate websocket write errors (#4516 ) * add AssignmentHook to AssignJob; propagate websocket write errors - Replace the `url string` parameter on `Worker.AssignJob` with a middleware-style `AssignmentHook` so callers can intercept the `JobAssignment` send (e.g. to set Url, or to gate hedged attempts so only one assignment is written). - Remove the `sendRequest` helper. Inline `WriteServerMessage` and propagate the error: `AssignJob` returns immediately on a failed availability or assignment write, leaving the job out of `runningJobs`; `TerminateJob` still updates local bookkeeping when the wire write fails but surfaces the write error to the caller. tidy	2026-05-10 21:14:02 -07:00
Raja Subramanian	cf20c9cd05	Add expiry to TURN password. (#4515 ) * Add expiry to TURN password. Defaults to 5m. For backwards compatibility expiry = 0 skips adding it. * fix variable shadowing	2026-05-09 12:15:01 +05:30
Raja Subramanian	20d4a3a168	Populate data track loggers with context (#4514 )	2026-05-09 10:14:48 +05:30
Paul Wells	12fff29a12	allow setting agent job assignment url (#4512 )	2026-05-07 13:13:21 -07:00
Denys Smirnov	ba366fc712	Fix SIP media config upgrade. (#4511 )	2026-05-07 10:12:45 +02:00
Paul Wells	8fbc5adfce	update protocol for protojson (#4510 )	2026-05-07 00:55:00 -07:00
Raja Subramanian	3de6f517e5	Add TURN permission handler. (#4505 ) * Add TURN permission handler. - Turn off permissions to private/link local/multicast and internal IPs - Add a list of CIDRs that can be used for more things to deny permission to. * unused * add config for allowing private IPs, used in testing * add a TTL to user name and use it to auth * allow list for restricted peer CIDRs	2026-05-06 23:43:11 +05:30
Denys Smirnov	8ffcef93b2	Update protocol to support SIP media config. (#4509 )	2026-05-06 18:18:21 +02:00
Raja Subramanian	1ab1e072d1	test: verify upstream and downstream connection stats end-to-end (#4508 ) * test: verify upstream and downstream connection stats Adds TestConnectionStats integration test where two clients connect, each publishes audio + video, and the test asserts that both publisher-side (LocalMediaTrack.GetTrackStats) and subscriber-side (DownTrack.GetTrackStats) report non-zero packets and bytes. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * sfu: add DownTrack.OnStatsUpdate hook and use it in stats test Adds a public OnStatsUpdate setter on DownTrack mirroring the existing pattern on WebRTCReceiver. The new callback fires alongside the configured DownTrackListener (production path is unaffected) and is intended for tests/observers to validate the AnalyticsStat data flowing through the listener. Augments TestConnectionStats to: - hook WebRTCReceiver.OnStatsUpdate for each published track and assert the captured AnalyticsStat has non-zero packets/bytes (upstream). - hook the new DownTrack.OnStatsUpdate for each subscribed track and make the same assertion (downstream). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-06 14:52:30 +05:30
Raja Subramanian	c4fd71a5dd	Fix sense check in DeltaInfo gathering (#4507 )	2026-05-06 13:34:26 +05:30
Paul Wells	803999efad	rename agent environment to deployment (#4506 ) * rename agent environment to deployment * deps	2026-05-05 14:19:40 -07:00
Paul Wells	bacc21e6c0	add helper to check for agent worker endpoint (#4503 )	2026-05-05 13:38:53 -07:00
Paul Wells	253f977d32	add duration seconds reporting (#4500 ) * add duration seconds reporting * deps * deps	2026-05-02 06:19:23 -07:00
Paul Wells	ffab3bd308	add agent environment (#4498 ) * add agent environment * lint * psrpc error * deps	2026-05-01 19:30:06 -07:00
Raja Subramanian	ccdf23c8a6	Use mediatransportutil/codec package, no functional change (#4497 )	2026-05-01 20:06:29 +05:30
Raja Subramanian	680703f228	Include reception reoprts in receiver report callback. (#4496 ) Media loss proxy is not use, so it is okay, but was an unintentional delete in https://github.com/livekit/livekit/pull/3252/changes Was checking code due to a report of how Chrome does RTCP reports in 147+ breaking a few services. Don't think that affects LK, but found this while reading code.	2026-04-30 22:10:11 +05:30
olafal0	f51798bcf6	Fix publish-only limitations being incorrectly applied to receivers (#4495 ) * Fix publish-only limitations being incorrectly applied receive-side in a single PC * `StaticConfigurations` disabled some codecs for publish only, which worked in dual PC * In single PC, the server incorrectly disabled these codecs in both directions * Dual PC mode is unchanged; single PC handles per-direction filtering correctly * Filter recv-side codecs to publish list in single-PC SDP answer * Confirm H264 is present in offer in test	2026-04-30 18:49:34 +05:30
Raja Subramanian	a002337db1	Legacy TrackInfo.Simulcast flag. (#4493 ) * Legacy TrackInfo.Simulcast flag. When AddTrack did not send SimulcastCodecs, the legacy `Simulcast` flag was not set. Fix it by setting the flag when a second layer is published. * staticcheck * use the existing PrimaryReceiver function	2026-04-29 22:43:33 +05:30
Théo Monnom	af1dcc8843	Add CloseWithReason to agent SignalConn interface (#4492 )	2026-04-28 22:14:06 -07:00
Paul Wells	d7c2daf1ac	report all simulcast layers (#4491 )	2026-04-28 10:45:32 -07:00
Raja Subramanian	c1ad2b22e6	Misc optimisations. (#4490 ) - prevent some escape to heap - avoid copying by using a ring buffer for receiver reports (probably should remove this as this is for debugging only and data so far has shown clients sending bad data and nothing more.)	2026-04-28 20:51:04 +05:30
Jacob Gelman	19b9e8c00a	Additional data tracks logging (#4489 ) * Additional data track logging * Track total bytes published * Rename field	2026-04-28 21:26:07 +09:00
David Chen	743d9c8b3a	add support for client capabilities (#4461 ) * update protocol version * only check for client capabiltiy to strip packet trailer	2026-04-27 17:58:36 -07:00
Raja Subramanian	fc47e47866	Close peer connection unconditionally to unblock set local/remote (#4485 ) * Close peer connection unconditionally to unblock set local/remote description operations. Have been chasing a leak where participants have a lot of connectivity issues and analysed a goref with Claude. Output below. Jo Turk quickly patched sctp for reported issue - https://github.com/pion/sctp/pull/465. This PR moves the peer connection close to before waiting for events queue to be drained as event queue could be blocked on `SetLocal/RemoteDescription` hanging. The scenario is a bit far-fetched as a lot of things have to happen, but it does point to a scenario where things could hang. Remains to be seen if this helps. Note that closing the peer connection early could mean the contained objects (like data channels) could all be closed as part of the peer connection close. But, still keeping the explicit clean up path (which should effectively become no-op) to minimise changes. ------------------------------------------------------------------ The wedge is in pion/sctp's blocking-write gate, called synchronously from inside the PC's operations queue. Five things have to be true at the same time, and on this build they all are: 1. SCTPTransport.Start is synchronous in the SetRemoteDescription op The stuck stack: PeerConnection.SetRemoteDescription.func2 (peerconnection.go:1363) → startRTP → startSCTP → SCTPTransport.Start (sctptransport.go:141) → DataChannel.open (datachannel.go:178) → datachannel.Dial → Client → Stream.WriteSCTP → Association.sendPayloadData (association.go:3141) ← blocks here SCTPTransport.Start synchronously sends the DCEP "OPEN" for each pre-negotiated channel. The operations.start goroutine runs SetRemoteDescription's logic; it does not return until Start does. 2. The wait has no deadline Stream.WriteSCTP (stream.go:289) calls sendPayloadData(s.writeDeadline, ...). s.writeDeadline is the default zero-value deadline.Deadline — never armed, because DataChannel.Dial doesn't call Stream.SetWriteDeadline. So the <-ctx.Done() arm of the wait select can never fire. 3. EnableDataChannelBlockWrite(true) puts SCTP into a serialized-write gate At livekit-server/pkg/rtc/transport.go:362 livekit calls se.EnableDataChannelBlockWrite(true). That flips the sendPayloadData path to: // association.go:3138-3148 if a.blockWrite { for a.writePending { a.lock.Unlock() select { case <-ctx.Done(): // never (no deadline) case <-a.writeNotify: // only fires when writeLoop fully drains pendingQueue } a.lock.Lock() } a.writePending = true } 4. writeNotify only fires after the writeLoop drains everything The only place notifyBlockWritable is called is gatherOutbound (association.go:3085-3088), and only when len(chunks) > 0 && a.pendingQueue.size() == 0 — i.e., the writeLoop actually managed to move all pending chunks to inflight. If cwnd is full and SACKs stop arriving, the writeLoop wakes up, sees zero room, sends nothing, and writePending stays true. 5. There is no association-level abort timer for data writes At association.go:764: assoc.t3RTX = newRTXTimer(timerT3RTX, assoc, noMaxRetrans, rtoMax) noMaxRetrans means the retransmission timer never gives up. INIT has maxInitRetrans, but data does not. There is no equivalent of TCP's tcp_retries2 → ETIMEDOUT → ABORT. So once the path is dead post-handshake, t3RTX keeps firing into the void and the association never transitions out of established on its own. What it takes to wake it up Only an external close: somebody has to terminate the underlying DTLS conn (which makes Association.readLoop's netConn.Read fail, which closes closeWriteLoopCh, which lets timerLoop exit). But — and this is the kicker — readLoop's defer at association.go:976-996 closes everything except it does not call notifyBlockWritable. So even if readLoop unwinds, any goroutine parked on <-a.writeNotify stays parked unless it was watching ctx (which here it isn't). So the trigger sequence on this pod was almost certainly: 1. Peer establishes ICE+DTLS+SCTP, association goes established. 2. Peer disappears (ICE silently fails, NAT rebinding, OS sleep, kill -9, etc.). 3. The first DCEP-OPEN for one of livekit's pre-negotiated channels is queued; cwnd never opens because no SACKs return. 4. writePending is now true for the lifetime of the process, with no deadline, no ctx, no kill. 5. The PC's operations queue is wedged, SetRemoteDescription never returns, livekit-server's handleRemoteOfferReceived event handler is parked, the participant is never torn down, and the SCTP timerLoop pins the entire participant graph in memory until OOM-kill. Realistic fixes (in order of how clean they are) 1. Upstream: in pion/sctp, broadcast notifyBlockWritable() (or close writeNotify) inside readLoop's defer cleanup, so a closed association unblocks any pending writers. This is the right fix. 2. livekit-server: wrap pc.SetRemoteDescription(...) with a timeout, and on timeout call pc.Close() — Close ultimately tears down the DTLS conn, which lets readLoop exit (point 1 still needs to be true for the writer goroutine to actually unblock, though). 3. Workaround: call stream.SetWriteDeadline(...) on the SCTP stream before issuing the DCEP open, so the ctx arm of the select can fire. Requires reaching past webrtc.DataChannel though. 4. Heaviest hammer: don't pre-negotiate the data channels inline with SetRemoteDescription — open them lazily after PC reaches connected so a stuck open never blocks signaling. Without (1), even (2) leaves the writer goroutine itself parked forever — but at least the PC and its participant-side state would be released; only the SCTP goroutine subtree (much smaller) would leak. * revert probe stop change * handle nil offer	2026-04-27 21:38:46 +05:30
renovate[bot]	639406eb96	Update module github.com/pion/ice/v4 to v4.2.3 (#4481 ) Generated by renovateBot Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>	2026-04-26 22:33:26 -07:00
Raja Subramanian	dc6b75058e	reduce some heap use in packet path (#4478 )	2026-04-25 14:24:23 +05:30
Fabian Stehle	f3b80b2886	fix: wrap IPv6 addresses in brackets in UDP TURN URLs (RFC 3986) (#4476 ) `iceServersForParticipant` builds UDP TURN URLs by interpolating the node IP directly into a format string: fmt.Sprintf("turn:%s:%d?transport=udp", ip, port) When `NodeIP.V6` is set, `ToStringSlice()` includes the bare IPv6 address, producing URLs like: turn:2a05:d014:ee4:1201:7039:38c:f652:a252:443?transport=udp RFC 3986 §3.2.2 requires IPv6 addresses in URIs to be enclosed in square brackets. Without them the port is ambiguous and WebRTC clients (e.g. libdatachannel) reject the URL with "Invalid ICE server port". Use `net.JoinHostPort` which handles bracketing for IPv6 and is a no-op for IPv4, producing well-formed URLs: turn:[2a05:d014:ee4:1201:7039:38c:f652:a252]:443?transport=udp turn:1.2.3.4:443?transport=udp	2026-04-24 14:28:25 +05:30
Raja Subramanian	3a7f2628b0	Turn off transceiver re-use on Safari. (#4474 ) There are issues with insertable streams + Safari which causes tracks to go missing mid-stream sometimes.	2026-04-23 19:04:10 +05:30
Raja Subramanian	d84f3d7a4e	add more types to signum (#4473 )	2026-04-23 15:41:55 +05:30
Raja Subramanian	701a37c2d1	Convert sort.Slice -> slices.SortFunc (#4472 ) * Convert sort.Slice -> slices.SortFunc * active speaker loudness in descending order	2026-04-23 15:12:24 +05:30
Raja Subramanian	85be9d70fb	Avoid stream allocator event data cast to interface and back. (#4471 )	2026-04-23 13:33:11 +05:30
Raja Subramanian	b43685e88c	Keep a shadow copy of tracks for use by different stream allocator state (#4470 ) changes. SHould help in cases where the congestion controller is active and is managing a bunch of video tracks.	2026-04-23 12:45:40 +05:30
Raja Subramanian	27c2b149d7	Consolidate RTCP packets and do RTCP callback outside lock. (#4469 ) Planning to do some find grained changes based on analysis by different models. Will keep them as small as possible and focused.	2026-04-23 12:20:16 +05:30
Raja Subramanian	31083307ec	do not log data track stats if not started (#4468 )	2026-04-23 10:46:33 +05:30
Anunay Maheshwari	9ee06635d6	feat(pion/ice): replace deprecated NAT1To1 with SetAddressRewriteRules (#4466 ) * feat(pion/ice): replace deprecated NAT1To1 with SetAddressRewriteRules * update deps	2026-04-22 12:49:36 +05:30
Raja Subramanian	8ccad68d76	Release v1.11.0 (#4459 ) v1.11.0	2026-04-17 21:38:13 +05:30
Raja Subramanian	dbf5cf6196	Store concrete ICE candidate for remote candidates. (#4458 )	2026-04-17 13:14:47 +05:30
Paul Wells	2a04bc3ca8	fix publisher frame count reporting for simulcast streams (#4457 )	2026-04-16 11:08:33 -07:00
Anunay Maheshwari	1d804737f9	fix: limit join request and WHIP request body to http.DefaultMaxHeaderBytes (#4450 ) * fix: CS-1665 * cleanup * cleanup and testes * updates	2026-04-16 01:12:33 +05:30
Raja Subramanian	3cfb71e7ca	Use Muted in TrackInfo to propagated published track muted. (#4453 ) * Use Muted in TrackInfo to propagated published track muted. When the track is muted as a receiver is created, the receiver potentially was not getting the muted property. That would result in quality scorer expecting packets. Use TrackInfo consistently for mute and apply the mute on start up of a receiver. * update mute of subscriptions	2026-04-16 01:03:40 +05:30
Raja Subramanian	69aa94797b	Some drive-by clean up (#4452 )	2026-04-15 12:23:33 +05:30
Raja Subramanian	6c81f67858	Add subscriber stream start event notification (#4449 )	2026-04-14 22:08:31 +05:30
cnderrauber	ce1bf47b5c	Revert "fix: ensure num_participants is accurate in webhook events (#4265 ) (#…" (#4448 ) This reverts commit `cdb0769c38`.	2026-04-13 22:21:22 +08:00
Onyeka Obi	cdb0769c38	fix: ensure num_participants is accurate in webhook events (#4265 ) (#4422 ) * fix: ensure num_participants is accurate in webhook events (#4265) Three fixes for stale/incorrect num_participants in webhook payloads: 1. Move participant map insertion before MarkDirty in join path so updateProto() counts the new participant. 2. Use fresh room.ToProto() for participant_joined webhook instead of a stale snapshot captured at session start. 3. Remove direct NumParticipants-- in leave path (inconsistent with updateProto's IsDependent check), force immediate proto update, and wait for completion before triggering onClose callbacks. * fix: use ToProtoConsistent for webhook events instead of forcing immediate updates	2026-04-13 09:26:14 +08:00
Raja Subramanian	c91e79af35	Switch to stdlib maps, slices (#4445 ) * Switch to stdlib maps, slices * slices	2026-04-13 00:11:48 +05:30
renovate[bot]	ea7b9c6fe1	Update module github.com/livekit/protocol to v1.45.3 (#4435 ) Generated by renovateBot Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>	2026-04-11 16:17:10 -07:00
renovate[bot]	97378368dd	Update go deps (major) (#3179 ) * Update go deps Generated by renovateBot * update api usage --------- Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com> Co-authored-by: David Zhao <dz@livekit.io>	2026-04-11 14:28:33 -07:00

1 2 3 4 5 ...

3710 Commits