6376 Commits

Author SHA1 Message Date
antirez
83923afa8c Track number of logically expired keys still in memory.
This commit adds two new fields in the INFO output, stats section:

expired_stale_perc:0.34
expired_time_cap_reached_count:58

The first field is an estimate of the number of keys that are yet in
memory but are already logically expired. They reason why those keys are
yet not reclaimed is because the active expire cycle can't spend more
time on the process of reclaiming the keys, and at the same time nobody
is accessing such keys. However as the active expire cycle runs, while
it will eventually have to return to the caller, because of time limit
or because there are less than 25% of keys logically expired in each
given database, it collects the stats in order to populate this INFO
field.

Note that expired_stale_perc is a running average, where the current
sample accounts for 5% and the history for 95%, so you'll see it
changing smoothly over time.

The other field, expired_time_cap_reached_count, counts the number
of times the expire cycle had to stop, even if still it was finding a
sizeable number of keys yet to expire, because of the time limit.
This allows people handling operations to understand if the Redis
server, during mass-expiration events, is able to collect keys fast
enough usually. It is normal for this field to increment during mass
expires, but normally it should very rarely increment. When instead it
constantly increments, it means that the current workloads is using
a very important percentage of CPU time to expire keys.

This feature was created thanks to the hints of Rashmi Ramesh and
Bart Robinson from Twitter. In private email exchanges, they noted how
it was important to improve the observability of this parameter in the
Redis server. Actually in big deployments, the amount of keys that are
yet to expire in each server, even if they are logically expired, may
account for a very big amount of wasted memory.
2018-02-19 11:22:34 +01:00
antirez
256ddbf6dc Remove non semantical spaces from module.c. 2018-02-15 21:47:50 +01:00
antirez
280c3e3987 Fix typo in notifyKeyspaceEvent() comment. 2018-02-15 21:47:42 +01:00
Dvir Volk
7c4623b0d3 Add doc comment about notification flags 2018-02-15 21:47:38 +01:00
Dvir Volk
f4e7502e4f Fix indentation and comment style in testmodule 2018-02-15 21:46:44 +01:00
Dvir Volk
3c8456c641 Use one static client for all keyspace notification callbacks 2018-02-15 21:46:38 +01:00
Dvir Volk
aaaff8bd1c Remove the NOTIFY_MODULE flag and simplify the module notification flow if there aren't subscribers 2018-02-15 21:46:31 +01:00
Dvir Volk
0be51b8f54 Document flags for notifications 2018-02-15 21:45:41 +01:00
Dvir Volk
3b95c89cdb removed some trailing whitespaces 2018-02-15 21:45:37 +01:00
Dvir Volk
84c6f1e3ca removed hellonotify.c 2018-02-15 21:45:32 +01:00
Dvir Volk
53b85e53e3 fixed test 2018-02-15 21:45:27 +01:00
Dvir Volk
b43f66c9d4 finished implementation of notifications. Tests unfinished 2018-02-15 21:45:22 +01:00
antirez
eddf5deb38 More verbose logging when slave sends errors to master.
See #3832.
2018-02-15 21:43:23 +01:00
oranagra
c09cc0a9b7 when a slave experiances an error on commands that come from master, print to the log
since slave isn't replying to it's master, these errors go unnoticed.
since we don't expect the master to send garbadge to the slave, this should be safe.
(as long as we don't log OOM errors there)
2018-02-15 21:43:17 +01:00
charsyam
5c374f94ef getting rid of duplicated code 2018-02-13 16:21:01 +01:00
Guy Benoish
a64f36e556 enlarged buffer given to ld2string 2018-02-13 15:51:36 +01:00
antirez
f170580195 Make it explicit with a comment why we kill the old AOF rewrite.
See #3858.
2018-02-13 15:46:53 +01:00
Guy Benoish
0c030dea73 rewriteAppendOnlyFileBackground() failure fix
It is possible to do BGREWRITEAOF even if appendonly=no. This is by design.
stopAppendonly() didn't turn off aof_rewrite_scheduled (it can be turned on
again by BGREWRITEAOF even while appendonly is off anyway).
After configuring `appendonly yes` it will see that the state is AOF_OFF,
there's no RDB fork, so it will do rewriteAppendOnlyFileBackground() which
will fail since the aof_child_pid is set (was scheduled and started by cron).

Solution:
stopAppendonly() will turn off the schedule flag (regardless of who asked for it).
startAppendonly() will terminate any existing fork and start a new one (so it is the most recent).
2018-02-13 15:46:50 +01:00
Oran Agra
5807397460 fix to latency monitor reporting wrong max latency
in some cases LATENCY HISTORY reported latency that was
higher than the max latency reported by LATENCY LATEST / DOCTOR
2018-02-13 15:31:43 +01:00
antirez
f17d82961d Redis 4.0.8. 4.0.8 2018-02-02 17:39:14 +01:00
antirez
f603940f7c Rax updated to latest antirez/rax commit. 2018-02-02 11:10:30 +01:00
antirez
2c1fc582c7 Redis 4.0.7. 4.0.7 2018-01-24 11:16:18 +01:00
jianqingdu
2b99d77a57 fix not call va_end when syncWrite() failed
fix not call va_end when syncWrite() failed in sendSynchronousCommand()
2018-01-24 10:58:57 +01:00
Yusaku Kaneta
5f9b9e1194 Fix the firstkey, lastkey, and keystep of moduleCommand 2018-01-24 10:58:39 +01:00
Mark Nunberg
ba2d3e8e6e redismodule.h: Check ModuleNameBusy before calling it
Older versions might not have this function.
2018-01-24 10:48:42 +01:00
antirez
05c1f18d6a Fix integration test NOREPLICAS error time dependent false positive. 2018-01-24 10:24:22 +01:00
antirez
4acd6973bf Fix migrateCommand() access of not initialized byte. 2018-01-18 12:41:23 +01:00
Guy Benoish
548e4fe088 Replication buffer fills up on high rate traffic.
When feeding the master with a high rate traffic the the slave's feed is much slower.
This causes the replication buffer to grow (indefinitely) which leads to slave disconnection.
The problem is that writeToClient() decides to stop writing after NET_MAX_WRITES_PER_EVENT
writes (In order to be fair to clients).
We should ignore this when the client is a slave.
It's better if clients wait longer, the alternative is that the slave has no chance to stay in
sync in this situation.
2018-01-18 12:16:50 +01:00
antirez
efa7063c52 Cluster: improve anti-affinity algo in redis-trib.rb.
See #3462 and related PRs.

We use a simple algorithm to calculate the level of affinity violation,
and then an optimizer that performs random swaps until things improve.
2018-01-18 12:16:46 +01:00
antirez
48568ab6d7 Remove useless comment from serverCron().
The behavior is well specified by the code itself.
2018-01-18 12:16:42 +01:00
heqin
0201dea577 fixbug for #4545 dead loop aof rewrite 2018-01-18 12:16:37 +01:00
antirez
926beaa3c4 Hopefully more clear comment to explain the change in #4607. 2018-01-18 12:16:31 +01:00
qinchao
019ad3e2e3 fix assert problem in ZIP_DECODE_PREVLENSIZE
, see issue: https://github.com/antirez/redis/issues/4587
2018-01-18 12:16:23 +01:00
Oran Agra
8d9dff84ce PSYNC2 fix - promoted slave should hold on to it's backlog
after a slave is promoted (assuming it has no slaves
and it booted over an hour ago), it will lose it's replication
backlog at the next replication cron, rather than waiting for slaves
to connect to it.
so on a simple master/slave faiover, if the new slave doesn't connect
immediately, it may be too later and PSYNC2 will fail.
2018-01-18 12:16:05 +01:00
zhaozhao.zz
fba2e169f9 aof: format code and comment 2018-01-18 12:15:57 +01:00
antirez
7777be7b0f Put more details in the comment introduced by #4601. 2018-01-18 12:15:53 +01:00
zhaozhao.zz
91c1568b1a lazyfree: fix memory leak for lazyfree-lazy-server-del 2018-01-18 12:15:47 +01:00
antirez
f9c2c1acc6 Fix getKeysUsingCommandTable() in the case of nagative arity.
This fixes a crash with Redis Cluster when OBJECT is mis-used, because
getKeysUsingCommandTable() will call serverPanic() detecting we are
accessing an invalid argument in the case "OBJECT foo" is called.

This bug was introduced when OBJECT HELP was introduced, because the key
argument is set fixed at index 2 in the command table, however now
OBJECT may be called with an insufficient number of arguments to extract
the key.

The "Right Thing" would be to have a specific function to extract keys
from the OBJECT command, however this is kinda of an overkill, so I
preferred to make getKeysUsingCommandTable() more robust and just return
no keys when it's not possible to honor the command table, because new
commands are often added and also there are a number with an HELP
subcommand violating the normal form, and crashing for this trivial
reason or having many command-specific key extraction functions is not
great.
2018-01-18 12:15:23 +01:00
antirez
61135f1806 Document new protocol options in #4568 into redis.conf. 2018-01-18 12:15:12 +01:00
antirez
e77fba4d03 proto-max-querybuf-len -> client-query-buffer-limit.
We already had client buffer limits exported as configuration options.
Stick with the naming scheme already used.

See #4568.
2018-01-18 12:15:00 +01:00
antirez
87fe813b3a New config options about protocol prefixed with "proto".
Related to #4568.
2018-01-18 12:14:53 +01:00
gnuhpc
2e0d241420 Fix a typo(maybe instruction?) in crash log 2018-01-18 12:14:48 +01:00
Dvir Volk
9f7e214e8c Added RM_UnlinkKey - a low level analog to UNLINK command 2018-01-18 12:14:42 +01:00
zhaozhao.zz
947077bbcb redis-benchmark: bugfix - handle zero liveclients in right way 2018-01-18 12:14:22 +01:00
Oran Agra
ff2e628f4e Add config options for max-bulk-len and max-querybuf-len mainly to support RESTORE of large keys 2018-01-18 12:13:50 +01:00
Oran Agra
aefa9caacf fix processing of large bulks (above 2GB)
- protocol parsing (processMultibulkBuffer) was limitted to 32big positions in the buffer
  readQueryFromClient potential overflow
- rioWriteBulkCount used int, although rioWriteBulkString gave it size_t
- several places in sds.c that used int for string length or index.
- bugfix in RM_SaveAuxField (return was 1 or -1 and not length)
- RM_SaveStringBuffer was limitted to 32bit length
2018-01-18 12:13:25 +01:00
heqin
896cf1a9d9 fixbug for #4545 dead loop aof rewrite 2018-01-18 12:12:58 +01:00
antirez
5abb12e04f Hyperloglog: refresh hdr variable correctly.
This is a fix for the #3819 improvements. The o->ptr may change because
of hllSparseSet() calls, so 'hdr' must be correctly re-fetched.
2018-01-09 18:51:04 +01:00
antirez
c39a0f7c2a Hyperloglog: Support for PFMERGE sparse encoding as target.
This is a fix for #3819.
2018-01-09 18:51:01 +01:00
antirez
8a012df9c6 Hyperloglog: refactoring of sparse/dense add function.
The commit splits the add functions into a set() and add() set of
functions, so that it's possible to set registers in an independent way
just having the index and count.

Related to #3819, otherwise a fix is not possible.
2018-01-09 18:50:58 +01:00