livekit

mirror of https://github.com/livekit/livekit.git synced 2026-05-24 23:26:11 +00:00

Author	SHA1	Message	Date
Raja Subramanian	9702d3b541	A couple of more opportunities in stream allocator. (#1906 ) 1. When re-allocating for a track in DEFICIENT state, try to use available headroom to accommodate change before trying to steal bits from other tracks. 2. If the changing track gives back bits (because of muting or moving to a lower layer subscription), use the returned bits to try and boost deficient track(s).	2023-07-26 15:35:07 +05:30
Raja Subramanian	0484a68342	Plug a couple of holes in stream transitions. (#1905 ) * Plug a couple of holes in stream transitions. 1. Missed negative sign meant stealing bits from other tracks was not working. 2. When a track change (mute, unmute, subscription change) cannot be allocated, explicitly pause so that stream state update happens. Refactor stream state update a bit to make it a bit cleaner. * correct comment	2023-07-26 13:36:58 +05:30
Raja Subramanian	5ae1387c68	Return a copy of down tracks from spreader. (#1902 ) As shadow copy can change, do not return as is. Also use the broacast function to broadcast up track changes to down tracks.	2023-07-25 19:00:43 +05:30
Raja Subramanian	ffd6dc2210	Packet level ddebug logs. (#1900 ) Only for debugging for a bit. Not for deploy.	2023-07-25 13:53:21 +05:30
Raja Subramanian	43fa6f57d1	A very simple leaky bucket pacer. (#1899 )	2023-07-23 10:11:35 +05:30
Raja Subramanian	7e6aa00426	Remove unused fields left over from refactor (#1897 )	2023-07-21 16:23:00 +05:30
Raja Subramanian	dd995899bf	Handle extreme case of sender report lagging. (#1892 )	2023-07-19 12:50:03 +05:30
Raja Subramanian	cf8cf1a87f	Forgot to log important bits :-( (#1891 )	2023-07-19 10:22:51 +05:30
Raja Subramanian	66de9ff4a0	Add debug log for RTCP sender report. (#1890 ) * Add debug log for RTCP sender report. Temporary to collect more data. Hitting scenarios under congestion where the sender report gets off sync. Need some data to pore through and understand and implement changes. * Debugw	2023-07-18 23:21:06 +05:30
Raja Subramanian	f41b93657e	Log a bit more in sender report warp report. (#1888 )	2023-07-18 09:14:41 +05:30
David Zhao	5d1d454a98	Fix missed label arg in logger (#1886 )	2023-07-16 20:05:41 -07:00
Raja Subramanian	11e1eb00fa	Attempt to avoid out-of-order max subscribed layer notifications. (#1882 ) * Check for request layer lock only in the goroutine * check before sending PLI * max layer notifier worker * test cleanup * clean up * do notification in the callback	2023-07-16 23:28:20 +05:30
Raja Subramanian	4c02a6d717	Time stamp adjustments v2 (I think) (#1875 ) * WIP commit * WIP commit * WIP commit * Some clean up - Removed a chatty debug log - some spelling, punctuation correction in comments - missed an `Abs` in check, add it.	2023-07-14 11:47:07 +05:30
Raja Subramanian	e746fe14e1	Mark active when switching to parked layer. (#1873 ) * Mark active when switching to parked layer. Parked layer lock is not a switch. It is just a restart at the same layer. * make explicit bool for switching	2023-07-13 10:42:23 +05:30
Raja Subramanian	8dc2c005c3	Add ability to roll back video layer selection. (#1871 ) * Add ability to roll back video layer selection. Not currently useful, but it is possible to do things like not applying a layer switch if the switch point time stamp is too far back. Add ability to roll back a layer switch and invoke rollback if a packet was selected for forwarding, but a subsequent error or decision to drop the packet can rollback layer switch if that was the switching packet. In current code, the paths where a packet can be dropped after selection does not happen at switch points. So, it was okay to apply the selection unconditionally. But, adding the call to rollback in the current code also in all paths where packet is dropped after selection for consistent code flow. * separate switch for temporal layer	2023-07-12 14:12:00 +05:30
Raja Subramanian	5459bd2931	Push track quality to poor on a bandwidth constrained pause. (#1867 ) * Push track quality to poor on a bandwidth constrained pause. * add tests * scale distance by divisor * fix test distance to desired * wait longer for subscription manager to reconcile	2023-07-11 15:29:35 +05:30
Raja Subramanian	e6f5f2f344	Prevent anachronous sample reading. (#1863 ) * Prevenet anachronous sample reading. Not so pretty way of solving this. Please let me know if you have thoughts. Passing in time allows testing easier. But, that also leads to time reversal problems. Example scenario 1. Connection stats worker gets a time and initiates quality calculation. 2. A layer transition is recorded after that. 3. By the time, scorer is called to calculate score with time from Step 1, there is time reversal and results in anachronous sample. One option is to use a scorer lock in connection stats module and wrap all calls to scorer in that lock, but that does not prevent the passed in time stamps themselves getting out of order. Also, stand alond use of scorer in some other context will be problematic. Doing the hybrid thing of taking current time in scorer if passed in time is zero so that scorer lock domain controls it. * use zero time everywhere in normal flow * make APIs with and without time passed in as Paul suggested	2023-07-10 08:39:52 +05:30
Raja Subramanian	bf3732b898	Remove noisy debug logs. (#1858 )	2023-07-08 11:58:56 +05:30
cnderrauber	873c87f24b	Fix nack issue for svc codecs (#1856 ) * Fix nack issue for svc codecs * Fix test	2023-07-07 15:46:18 +08:00
Raja Subramanian	e3954d1d64	Use timed aggregator. (#1843 ) * Use timed aggregator. For aggregate bitrate and average distance from desired. Also, clean up debug added to track leak. * update deps	2023-07-01 10:21:15 +05:30
Raja Subramanian	06f9b574cb	Delete down track from receiver in close always. (#1842 ) * Delete down track from receiver in close always. I think with the parallel close in goroutines, it so happens that peer connection can get closed first and unbind the track. The delete down track and RTCP reader close was inside if `bound` block. So, they were not running leaving a dangling down track in the receiver. * fix tests * fix test	2023-06-30 20:44:57 +05:30
Raja Subramanian	496656627e	Logging more to understand layer transition leak better. (#1840 )	2023-06-30 11:59:53 +05:30
Raja Subramanian	69a1e572be	Attempt to reduce disruption due to probe. (#1839 ) * Make congestion controller probe config * Wait for enough estimate samples * fixes * format * limit number of times a packet is ACKed * ramp up probe duration * go format * correct comment * restore default * add float64 type to generated CLI	2023-06-30 11:09:46 +05:30
Raja Subramanian	eaf70d5549	Pacer in down stream path. (#1835 ) * Pacer interface to send packets * notify outside lock * use select * use pass through pacer * add error to OnSent * Remove log which could get noisy * Starting TWCC work (#1727) * add packet time * WIP commit * WIP commit * WIP commit * minor comments * Some measurements (#1736) * WIP commit * some notes * WIP commit * variable name change and do not post to closed channel * unlock * clean up * comment * Hooking up some more bits for TWCC (#1752) * wake under lock * Pacer in down stream path. Splitting out only the pacer from a feature branch to introduce the concept of pacer. Currently, there should be no difference in functionality as a pass through pacer is used. Another implementation exists which is just put it in a queue and send it from one goroutine. A potential implementation to try would be data paced by bandwidth estimate. That could include priority queues and such. But, the main goal here is to introduce notion of pacer in the down stream path and prepare for more congestion control possibilities down the line. * Don't need peak detector * remove throttling of write IO errors	2023-06-28 13:22:44 +05:30
Raja Subramanian	2b0a470474	Less flapping in probe. (#1834 ) - Increase max interval between probes to 2 minutes. - Use a minimum probe rate of 200 kbps. This is to ensure that the probe rate is decent and can produce a stronger signal.	2023-06-28 12:48:38 +05:30
Raja Subramanian	cea41e4189	Discount out-of-order packets in downstream score. (#1831 ) * Discount out-of-order packets in downstream score. More notes inline. * correct comment * clean up comment	2023-06-27 17:44:53 +05:30
cnderrauber	5b975af55f	Refine dependency descriptor based selection forwarder (#1808 ) * Don't update dependency info if unordered packet received * Trace all active svc chains for downtrack * Try to keep lower decode target decodable * remove comments * Test case * clean code * solve comments	2023-06-27 15:11:06 +08:00
Raja Subramanian	8ac394c5bb	Removing commented out short cut path, don't need more debug data. (#1822 )	2023-06-23 14:18:55 +05:30
Raja Subramanian	2438058474	Drop error logs due to pipe close (#1813 )	2023-06-21 14:11:17 +05:30
Raja Subramanian	84994b39ab	Make the samples string more readable. (#1810 )	2023-06-21 11:35:38 +05:30
Raja Subramanian	27051e9999	It is possible that pipe is closed before blank frame send, do not warn (#1807 )	2023-06-20 11:58:01 +05:30
Raja Subramanian	2383234f6e	Simplify sliding window collapse. (#1802 ) * Simplify sliding window collapse. Keep the same value collapsing simple. Add it to sliding window as long as same value is received for longer than collapse threshold. But, add a prune with three conditions to process the siliding window to ensure only valid samples are kept. * flip the order of validity window and same value pruning * increase collapse threshold to 0.5 seconds during non-probe	2023-06-17 18:56:38 +05:30
Raja Subramanian	395f403132	Small stream allocator tweaks. (#1800 ) 1. Probe end time needs to include the probe cluster running time also. 2. Apply collapse window only within the sliding window. This is to prevent cases of some old data declaring congestion. For example, an estimate could have fallen 15 seconds ago and there might have been a bunch of estimates at that fallen value. And the whole sliding window could have that value at some point. But, a further drop may trigger congestion detection. But, that might be acting too fast, i. e. on one instance of value fall. Change it so that we detect if there is a fall within the sliding window and apply collapse based on that.	2023-06-17 12:35:29 +05:30
Raja Subramanian	908b7a9bb1	Promote some migration logs to Infow (#1798 )	2023-06-16 19:00:17 +05:30
Raja Subramanian	6946d0a3a1	Do not mute forwarder when paused to bandwidth congestion. (#1796 ) * Do not mute forwarder when paused to bandwidth congestion. Detailed notes in code. * remove word	2023-06-16 12:08:01 +05:30
Raja Subramanian	afa7733748	Promote switch logs to Infow. (#1790 )	2023-06-12 17:30:56 +05:30
Raja Subramanian	9809b8bc3a	Use nack queue params. (#1789 ) * Use nack queue params. * fix test	2023-06-12 13:01:02 +05:30
cnderrauber	c91889edfd	Add dependency descriptor stream tracker for svc codecs (#1788 ) * Add dependency descriptor stream tracker for svc codecs * Solve comments	2023-06-12 15:07:47 +08:00
Raja Subramanian	3d696ac39f	Keep next timestamp on switch closer to ref. (#1784 ) If ref is coming in slow (due to pacing), it is possible that expected is ahead. Pulling next too far towards expected causes warps in a subsequent report. Keep switches closer to ref.	2023-06-10 11:38:46 +05:30
Raja Subramanian	4805dec1f0	Create channel observer on probe reset. (#1783 ) On a state change, it was possible an aborted probe was pending finalize. When probe controller is reset, the probe channel observer was not reset. Create a new non-probe channel observer on state change to get a fresh start. Also limit probe finalize wait to 10 seconds max. It is possible that the estimate is very low and we have sent a bunch of probes. Calculating wait based on that could lead to finalize waiting for a long time (could be minutes).	2023-06-10 10:54:55 +05:30
Raja Subramanian	0e7bdeabcb	Simplify probe done handling. (#1782 ) * Simplify probe done handling. Seeing a case where the channel abserver is not re-created after an aborted probe. Simplifying probe done (no callbacks, making it synchronous). * log more	2023-06-10 02:07:28 +05:30
Raja Subramanian	72ed5b19f7	Use receiver report stats for loss/rtt/jitter. (#1781 ) * Use receiver report stats for loss/rtt/jitter. Reversing a bit of https://github.com/livekit/livekit/pull/1664. That PR did two snapshots (one based on what SFU is sending and one based on combination of what SFU is sending reconciled with stats reported from client via RTCP Receiver Report). That PR reported SFU only view to analytics. But, that view does not have information about loss seen by client in the downstream. Also, that does not have RTT/jitter information. The rationale behind using SFU only view is that SFU should report what it sends irrespective of client is receiving or not. But, that view did not have proper loss/RTT/jitter. So, switch back to reporting SFU + receiver report reconciled view. The down side is that when receiver reports are not receiver, packets sent/bytes sent will not be reported to analytics. An option is to report SFU only view if there are no receiver reports. But, it becomes complex because of the offset. Receiver report would acknowledge certain range whereas SFU only view could be different because of propagation delay. To simplify, just using the reconciled view to report to analytics. Using the available view will require a bunch more work to produce accurate data. (NOTE: all this started due to a bug where RTCP was not restarted on a track resume which killed receiver reports and we went on this path to distinguish between publisher stopping vs RTCP receiver report not happening) One optimisation to here here concerns the check to see if publisher is sending data. Using a full DeltaInfo for that is an overkill. Can do a lighter weight for that later. * return available streams * fix test	2023-06-09 23:31:25 +05:30
Raja Subramanian	f518f5d743	Log head SN when packet cannot be fetched (#1780 )	2023-06-09 12:13:06 +05:30
Raja Subramanian	22813cd2be	Recreate channel observer irrespective of probe success/fail. (#1778 )	2023-06-08 01:40:07 +05:30
Raja Subramanian	b591140d66	Ignore receiver report till initialized (#1773 )	2023-06-06 21:43:49 +05:30
Raja Subramanian	7ed3af193a	No proof that this helps (#1772 )	2023-06-06 11:28:13 +05:30
Raja Subramanian	076d8cad73	Promote switch log to Infow (#1771 )	2023-06-06 11:20:57 +05:30
Raja Subramanian	f5c5d4e079	Wait for a more stable measurement of sample rate. (#1764 )	2023-06-03 14:26:26 +05:30
Raja Subramanian	c2ae34151c	Enable some debug logs to debug freeze (#1761 ) * Enable some debug logs to debug freeze * log receiver sender report also	2023-06-02 16:31:19 +05:30
David Zhao	b5c8fe5294	Perform unsubscribe in parallel to avoid blocking (#1760 ) * Perform unsubscribe in parallel to avoid blocking When unsubscribing from tracks, we flush a blank frame in order to prepare the transceivers for re-use. This process is blocking for ~200ms. If the unsubscribes are performed serially, it would prevent other subscribe operation from continuing. This PR parallelizes that operation, and ensures subsequent subscribe operations could reuse the existing transceivers. * also perform in parallel when uptrack close * fix a few log fields	2023-06-02 00:13:18 -07:00

1 2 3 4 5 ...

501 Commits