livekit

mirror of https://github.com/livekit/livekit.git synced 2026-07-12 00:59:14 +00:00

Author	SHA1	Message	Date
Raja Subramanian	047a4ac870	Apply repair to the newest cached report (#2186 )	2023-10-26 03:43:52 +05:30
Raja Subramanian	fa01297d96	Slight sequencer tweaks. (#2184 ) The buffer is not for padding packets. So, calculate adjusted sequence numbers before comparing against size. Also, it is possible that invalidated slot is accessed due to not being able to exclude padding range. This was causing time stamp reset to 0. Will remove the error log after this goes out and the condition does not show up for a few days.	2023-10-25 23:12:14 +05:30
Raja Subramanian	f4a3618000	Log error on 0 time stamp. (#2174 ) Need backtrace for source of it. Also, do not reset start if 0, that is incorrect.	2023-10-23 23:00:03 +05:30
Raja Subramanian	f622fc2490	Sample clock skew down by an order of magnitude (#2173 )	2023-10-23 16:58:02 +05:30
Raja Subramanian	3e9450c774	Log more details in warns. (#2166 ) Logging more details in warns so that we do not have to enable Infow for some logs later.	2023-10-21 11:02:34 +05:30
Raja Subramanian	b591c56aa3	Logging reduction. (#2165 ) Move some to Debugw and add sampling for a few.	2023-10-21 10:26:30 +05:30
Raja Subramanian	0407eb4833	Log audio packets in forwarding path. (#2162 ) Seeing a time stamp jump that I am not able to explain. Basically, it looks like the time stamp doubles at some point. There is no code which doubles the timestamp. Can understand an erroneous roll over/wrap around, but doubling is very strange. So, logging only audio packets. Will disable as soon as I have some smaples from canary.	2023-10-21 01:37:30 +05:30
Raja Subramanian	5bf2e5fd4a	Log clock deviations in sender report. (#2161 ) Seeing some unexplained jumps in sender report time stamp in canary. Wonder if the calculated clock rate is way off during some interval. Logging clock deviations to understand better.	2023-10-20 23:06:34 +05:30
Raja Subramanian	43a0ca57b5	Clear flags in packet metadata cache before setting them. (#2160 ) Not sure if this could have resulted in bad FPS calculation, but could have contributed to it.	2023-10-20 12:13:29 +05:30
Raja Subramanian	0d7477178e	More fine grained filtering NACKs after a key frame. (#2159 ) * More fine grained filtering NACKs after a key frame. There are applications with periodic key frame. So, a packet lost before a key frame will not be retransmitted. But, decoder could wait (jitter buffer, play out time) and cause a stutter. Idea behind disabling NACKs after key frame was another knob to throttle retransmission bit rate. But, with spaced out retransmissions and max retransmissions per sequence number, there are throttles. This would provide more throttling, but affects some applications. So, disabling filtering NACKs after a key frame. Introducing another flag to disallow layers. This would still be quite useful, i. e. under congestion the stream allocator would move the target lower. But, because of congestion, higher layer would have lost a bunch of packets. Client would NACK those. Retransmitting those higher layer packets would congest the channel more. The new flag (default enabled) would disallow higher layers retransmission. This was happening before this change also, just splitting out the flag for more control. * split flag	2023-10-20 00:44:39 +05:30
Raja Subramanian	e461e9cd79	Log skew in clock rate. (#2158 ) * Log skew in clock rate. Remember seeing sender report time stamp moving backward across mute with replaceTrack(null). Not able to reproduce it in JS sample app, but have seen it elsewhere. Logging to understand it better. Wondering if the sender report should be reset on time stamp moving backward or if we should drop backwards moving reports. * set threshold at 20%	2023-10-19 13:58:50 +05:30
Raja Subramanian	f653efcf10	Do not update highest time on padding packet. (#2157 ) * Error log of padding updating highest time to get backtrace. * Do not update highest time on padding packet. Padding packets use time stamp of last packet sent. Padding packets could be sent when probing much after last packet was sent. Updating highest time on that screws up sender report calculations. We have ways of making sure sender reports do not get too out-of-whack, but it logs during that repair. That repair should be unnecessary unless the source is behaving weird (things like publisher sending all packets at the same time, publisher sample rate is incorrect, etc.)	2023-10-19 12:01:48 +05:30
Raja Subramanian	7c830ea5b9	Log highest time update on padding packet. (#2154 ) * Log highest time update on padding packet. Seeing a strange case of what looks like highest time getting updated on a padding packet. Can't see how it happens in code. So, logging to check. Will be removing log after checking. * log sequence number also	2023-10-19 00:53:50 +05:30
Raja Subramanian	f97242c8ba	Use 32-bit time stamp to get reference time stamp on a switch. (#2153 ) * Use 32-bit time stamp to get reference time stamp on a switch. With relay and dyncast and migration, it is possible that different layers of a simulcast get out of sync in terms of extended type, i. e. layer 0 could keep running and its timestamp could have wrapped around and bumped the extended timestamp. But, another layer could start and stop. One possible solution is sending the extended timestamp across relay. But, that breaks down during migration if publisher has started afresh. Subscriber could still be using extended range. So, use 32-bit timestamp to infer reference timestamp and patch it with expected extended time stamp to derive the extended reference. * use calculated value * make it test friendly	2023-10-18 21:48:41 +05:30
Raja Subramanian	3e4cd3a161	Accept more range for first packet time adjustment. (#2150 )	2023-10-17 23:52:14 +05:30
Raja Subramanian	70b60101f4	do the proper large negative check (#2139 )	2023-10-10 11:01:49 +05:30
Raja Subramanian	31b042ddce	Log larga negative gap. (#2138 ) Seeing a large positive gap which I am not able to explain. Wondering if at some other time, a large negative is happening and the large positive is just a correction.	2023-10-09 14:32:10 +05:30
Raja Subramanian	ebe1470e46	Simplify (#2137 )	2023-10-07 13:15:51 +05:30
Raja Subramanian	9fc276481a	Increase accuracy of delay since last sender report. (#2136 ) It is in 1 / 65536 seconds units. That is about 0.015 ms. So, use microseconds to increase accuracy.	2023-10-07 12:25:47 +05:30
Raja Subramanian	4ea284fae0	Log potential sequence number de-sync in receiver reports. (#2128 ) * Log potential sequence number de-sync in receicer reports. Seeing some cases of a roll over being missed. That ends up as largish range to search in an interval and reports missing packets in the packet metadata cache. Logging some details. * just log in one place	2023-10-05 13:10:13 +05:30
Raja Subramanian	5aa093f65d	Log time when there are too many packets. (#2127 ) Ideally, can remove the nil return when there are too many packets as we have more information with extended sequence numbers, but logging duration first to understand what is happening better.	2023-10-05 12:02:55 +05:30
Raja Subramanian	6c49d1a160	Logging a few bits at Infow (#2126 ) Seeing sequencer errors with egress (related to dummy start). So, logging a few bits at Infow to understand them better.	2023-10-05 11:16:31 +05:30
Raja Subramanian	c710ab901e	Log stream start at Infow (#2125 ) Seeing some sequence number adjustment where timestamp is 0. Want to check the stream start.	2023-10-05 00:12:41 +05:30
Raja Subramanian	cb0a48c12d	Handle RED extended sequence number. (#2123 ) When converting from RED -> Opus, if there is a loss, SFU recovers that loss if it can using a subsequent redundant packet. That path was not setting the extended sequence number properly. Also, ensuring use of monotonic clock for first packet time adjustment also.	2023-10-03 19:38:55 +05:30
Raja Subramanian	96ccf696d3	Cap expected packets to padding diff. (#2122 ) * Cap expected packets to padding diff. On the receiver, no longer using packet metadata cache to calculate interval stats. An optimisation to get rid of packet metadata cache on receiver side. Because of that, padding packets in an interval could be more than expected packets. As padding packets is just a counter, out-of-order padding packets will make the diff look larger than expected packets in a window. Cap the expected to 0. NOTE: This makes it so that the count is not accurate in a window, but that is okay occasionally. It will affect reported stats and quality calculations, but it should be rare. For example, if 30 packets were received in a window and 60 out-of-order padding packets were received, it would reported as 0 packets were received. One option is to not increment padding packets when they are out-of-order, but that will mess up overall stats. Will make that change if we see this happen a lot. * log unexpected padding packets	2023-10-03 12:36:19 +05:30
Raja Subramanian	e6e3e2a729	sligtly easier readability (#2121 )	2023-10-02 22:47:40 +05:30
Raja Subramanian	989d621c97	A small change to move out of order check before RTT calc. (#2117 )	2023-10-02 11:51:46 +05:30
Raja Subramanian	180ad541fc	Mark packet not handled if restart is rejected (#2115 ) * Mark packet not handled if restart is rejected * log both sn and ts on restart/rollback	2023-09-30 10:24:13 +05:30
Raja Subramanian	ee3a7c01bc	Log resync at Infow. (#2114 ) * Log resync at Infow. Seeing potentially large sequence number jumps on a resync. And it seems to happen on a lot of subscribe/unsubscribe. Logging at Infow to understand better. Probably need to find a way to avoid resync. But, logging for now to check if I can catch one. * Remove resync and log large sequence number jumps	2023-09-30 08:42:46 +05:30
Pingos	4f9467040e	Bind() function fails when mime == "audio/red" (#2104 )	2023-09-26 13:36:54 +08:00
Raja Subramanian	97048a923c	Reducing rtp stats memory consumption - part 2 (#2078 ) * WIP commit * move a struct to sender only * Snapshot intervals * make receiver history 4K too	2023-09-16 18:54:18 +05:30
Raja Subramanian	f29887dcd0	Use bit map. (#2075 ) * Use bit map. Also, duplicate packet detection is impoetant for dropping padding only packets at the publisher side itself. In the last PR, mentioned that it is only for stats. * clean up * Update deps	2023-09-16 02:03:50 +05:30
Raja Subramanian	044f6cec40	Reduce packet meta data cache - part 1 (#2073 ) * Reduce packet meta data cache - part 1 Packet meta data cache takes a good amount of space. That cache is 8K entries deep and each entry is 8 bytes. So, that takes 64KB per RTP stream. It is mostly needed for down stream to line up with receiver reports. So, removing cache from up stream (RTPStatsReceiver) as part 1. Will look at optimising the down stream in part 2. * Remove caching from RTPStatsReceiver * clean up a bit more * maintain history and fix test	2023-09-15 21:39:03 +05:30
Raja Subramanian	a55c50f61d	Throttle packet errors/warns a bit. (#2068 ) * Throttle packet errors/warns a bit. In very bad network conditions, it is possible that packets arrive out-of-order, have choppy behaviour. Use some counters and temper logs. * slight change in comment	2023-09-13 15:52:10 +05:30
Raja Subramanian	5f701ece34	Include top bits from start in highest sequence number from RR. (#2064 ) Streaming could start after 16-bits has rolled over. So, have to add that base back to what is received in receiver report. Otherwise, it looks like there are not packets received in window leading to poor quality.	2023-09-12 14:36:25 +05:30
Raja Subramanian	f333ce05ed	fix start (#2060 )	2023-09-11 20:59:47 +05:30
Raja Subramanian	c09d8d0878	Split RTPStats into receiver and sender. (#2055 ) * Split RTPStats into receiver and sender. For receiver, short types are input and need to calculate extended type. For sender (subscriber), it can operate only in extended type. This makes the subscriber side a little simpler and should make it more efficient as it can do simple comparisons in extended type space. There was also an issue with subscriber using shorter type and calculating extended type. When subscriber starts after the publisher has already rolled over in sequence number OR timestamp, when subsequent publisher side sender reports are used to adjust subscriber time stamps, they were out of whack. Using extended type on subscriber does not face that. * fix test * extended types from sequencer * log	2023-09-11 07:33:39 +05:30
Raja Subramanian	b5f2f83278	Fix time stamp adjustment when starting with dummy packets. (#2053 ) * Fix time stamp adjustment when starting with dmummy packets. - Populated extended values in ExtPacket on dummy packet. - Have to pass reference time stamp offset to first packet time adjustment. * display participant version info	2023-09-09 17:33:26 +05:30
Raja Subramanian	d24f409354	Simplify SN cache a bit. (#2052 )	2023-09-09 13:00:59 +05:30
Raja Subramanian	492eb3bf18	Sequencer small optimisations (#2049 ) * Sequencer small optimisations 1. Use range map to exclude padding only packets. Should take lesser space as we are not using slice to hold pointer to actual data. 2. Avoid `time.Now()` when adding each packet. Just use the arrival time as it should be close enough. `time.Now()` was showing up in profile. * remove debug * correct comment	2023-09-08 23:48:54 +05:30
Raja Subramanian	b95670f56b	Removing one snapshot in down track. (#2047 ) Profiling showed updating jitter going through the snapshot maps. With the reduction of one, there should only be one snapshot and hopefully that should gain some cycles back.	2023-09-07 22:22:00 +05:30
Raja Subramanian	1590b96686	Need to set reference layer when starting with dummy packets. (#2034 ) Dummy packets are used at start to trigger Pion's OnTrack.	2023-09-05 12:00:00 +05:30
Raja Subramanian	bc5b4d68af	Do NACKs and reports always. (#2022 ) * Do NACKs and reports always. With padding packet drops, it is possible that a lot of packets go by without RTCP RR. Do NACKs and RTCP RR always. * remove local variable	2023-08-31 12:41:25 +05:30
Raja Subramanian	790954bbe9	Use RTCP SR to resync. (#2021 ) Remove packet debug code that was added temporarily.	2023-08-31 11:45:42 +05:30
Raja Subramanian	6e3a20ebf4	Temporary packet debug (#2018 )	2023-08-31 00:33:00 +05:30
Raja Subramanian	da52167cd9	Adjust extended sequence number to account for dropped packets (#2017 )	2023-08-30 23:08:53 +05:30
Raja Subramanian	f9b613be41	Log resync data (#2016 )	2023-08-30 22:18:27 +05:30
Raja Subramanian	9afb0873ae	Do not process packets not processed by RTPStats. (#2015 ) Seeing the case of a stream starting with padding packets on migration. As publisher in that case is always sending packets. it is possible that the new node gets padding packets at the start. Processing them in buffer leads to trying to drop that padding packet and adding an exclusion range. That fails because the extended sequence number is not available for unprocessed packets. It is okay to drop them as they will be dropped anyway. But, they are useful for bandwidth estimation. So, headers are processed even if the packet is RTPStats unprocessed.	2023-08-30 19:43:24 +05:30
Raja Subramanian	126872047d	Handle duplicate padding packet in the up stream. (#2012 ) * Handle duplicate padding packet in the up stream. The following sequence would have produce incorrect results - Sequence number 39 - regular packet - offset = 0 - Sequence number 40 - padding only - drop - offset = 1 - Sequence number 40 - padding only duplicate - was not dropped (this is the bug) - apply offet - sequence number becomes 39 and clashes with previous packet - Sequence number 41 - regular packet - apply offset - goes through as 40. - Sequence number 40 again - does not get dropped - will pass through as 39. * fix duplicate dropping * fix tests * accept repeat last value as padding injection could cause that * use exclusion ranges * more UT and more specific errors	2023-08-30 16:46:39 +05:30
Raja Subramanian	33b48d986f	Fix typo, have to check against start (#2011 )	2023-08-29 21:45:58 +05:30

1 2 3 4 5 ...

260 Commits