數學工具

不重複隨機數產生器實作方法

2025-01-27 12分鐘

No-Repeat Random Number Generator: 5 Implementation Methods【2025】| Fisher-Yates, Sets & Performance Comparison

5 種不重複隨機數生成方法的比較圖表：Fisher-Yates、集合方法、陣列洗牌、移除元素法和位元遮罩法

Introduction: Why No-Repeat Random Numbers Matter

Have you ever built a lottery app that accidentally picked the same number twice? Or created a raffle system that gave someone two entries? Generating unique random numbers without duplicates is more challenging than it looks—and choosing the wrong method can slow your application to a crawl.

No-repeat random number generation is crucial for lottery systems, card shuffling games, prize drawings, unique ID creation, and A/B testing. But which implementation method should you choose? The Fisher-Yates shuffle offers O(n) performance, while naive approaches using rejection sampling can take exponentially longer as your number pool shrinks. In this comprehensive guide, we'll explore 5 battle-tested methods to generate unique random numbers, complete with code examples, performance benchmarks, and real-world use cases.

Whether you're building a Powerball simulator, shuffling a deck of cards, or creating test data for your database, this guide will help you choose the right algorithm for your specific needs. We'll cover everything from simple set-based approaches perfect for small datasets to optimized Fisher-Yates implementations capable of handling millions of numbers per second.

插圖1：開發者實作不重複隨機數生成算法

場景描述：一位程序員在雙螢幕工作站前專注工作，左側螢幕顯示 Python 代碼（Fisher-Yates shuffle 算法實現），右側螢幕展示性能測試結果圖表（柱狀圖對比不同算法的執行時間）。桌面上有咖啡杯、筆記本電腦散熱架，背景是現代辦公室環境。

視覺重點：雙螢幕顯示代碼和圖表，程序員的手正在鍵盤上打字，螢幕光線照亮臉部。

必須出現的元素：雙螢幕工作站、左側螢幕的 Python 代碼（包含 "Fisher-Yates" 關鍵字）、右側螢幕的性能圖表（柱狀圖）、程序員、鍵盤、咖啡杯、現代辦公室背景。

需要顯示的中文字：無

顯示圖片/人物風格：真實攝影風格，男性或女性程序員（25-35歲，亞洲或混合族裔），專業商務休閒裝扮（T恤或襯衫），自然光與螢幕光混合照明，景深效果聚焦在螢幕內容。

顏色調性：冷色調為主（螢幕藍光、代碼編輯器深色主題），暖色點綴（咖啡、木質桌面），高對比度確保代碼可讀。

避免元素：不要有燈泡、齒輪、抽象圖形、箭頭符號、卡通元素、過度飽和的顏色、模糊背景中的其他人物。

Slug：developer-implementing-no-repeat-random-number-algorithm

Part 1 - Understanding No-Repeat Random Number Generation（The Problem and Solutions）

What Makes No-Repeat Generation Different?

Generating random numbers without duplicates isn't just about calling random() multiple times—it requires careful algorithm design to ensure uniqueness while maintaining performance.

The Core Challenge:
- Simple random: Each call is independent, duplicates possible
- No-repeat random: Each number must be unique within the set
- Performance: Must efficiently handle large ranges (e.g., 1-1,000,000)

Common Use Cases:
- Lottery systems: Pick 6 unique numbers from 1-49
- Card shuffling: Shuffle 52 cards without repetition
- Prize drawings: Select 100 unique winners from 10,000 entries
- Unique ID generation: Create non-colliding identifiers
- Sampling: Select random subset from large dataset

The 5 Implementation Approaches5 種實現方法

We'll examine 5 methods, each with distinct trade-offs:

Fisher-Yates Shuffle: O(n) time, O(n) space - industry standard
Set-Based Rejection: O(n) average, O(∞) worst case - simple but risky
Reservoir Sampling: O(n) time, O(k) space - memory efficient
Sequential Shuffle: O(n log n) time - when you need sorted output
Bitmap Method: O(n) time, O(n) bits - space optimized for dense ranges

Quick Comparison Table:

Method	Time Complexity	Space	Best For	Worst Case
Fisher-Yates	O(n)	O(n)	General use, large sets	None
Set-Based	O(n) avg	O(n)	Small sets, prototyping	O(∞) if range ≈ n
Reservoir	O(n)	O(k)	Unknown size, streaming	Single pass only
Sequential	O(n log n)	O(n)	Need sorted output	Slower than F-Y
Bitmap	O(n)	O(range/8)	Dense ranges	Large sparse ranges

各種不重複隨機數生成方法的性能比較圖表，包含執行時間和記憶體使用量的基準測試結果

Part 2 - Method 1: Fisher-Yates Shuffle（The Gold Standard）

How Fisher-Yates WorksFisher-Yates 如何運作

The Fisher-Yates shuffle algorithm, invented in 1938 and computerized by Knuth, is the gold standard for generating random permutations. It guarantees uniform distribution and O(n) performance.

Algorithm Logic:

1. Start with array [1, 2, 3, ..., n]
2. For i from n-1 down to 1:
   - Pick random index j where 0 ≤ j ≤ i
   - Swap array[i] with array[j]
3. Return first k elements (or entire shuffled array)

Why It's Superior:
- ✅ Guaranteed O(n) time - no rejection sampling
- ✅ Uniform distribution - every permutation equally likely
- ✅ In-place operation - no extra array needed
- ✅ Deterministic performance - no worst-case scenarios

Python ImplementationPython 實作

Complete Fisher-Yates Implementation:

import random

def fisher_yates_no_repeat(start, end, count):
    """
    Generate unique random numbers using Fisher-Yates shuffle.

    Args:
        start: Range start (inclusive)
        end: Range end (inclusive)
        count: How many unique numbers to generate

    Returns:
        List of unique random integers

    Time: O(n) where n = end - start + 1
    Space: O(n)
    """
    if count > (end - start + 1):
        raise ValueError("Cannot generate more unique numbers than range size")

    # Step 1: Create array of all possible values
    numbers = list(range(start, end + 1))

    # Step 2: Shuffle using Fisher-Yates
    for i in range(len(numbers) - 1, len(numbers) - count - 1, -1):
        # Pick random index from 0 to i
        j = random.randint(0, i)
        # Swap
        numbers[i], numbers[j] = numbers[j], numbers[i]

    # Step 3: Return last 'count' elements (the shuffled portion)
    return numbers[-count:]

# Example usage
lottery_numbers = fisher_yates_no_repeat(1, 49, 6)
print(f"Lottery: {sorted(lottery_numbers)}")  # [3, 12, 23, 34, 41, 48]

# Shuffle entire deck of cards
card_indices = fisher_yates_no_repeat(0, 51, 52)
print(f"Shuffled deck: {card_indices[:5]}...")  # [31, 7, 44, 18, 29...]

Optimized Version for Large Ranges:

When you only need a small subset from a huge range, you can optimize by only creating the partial array:

def fisher_yates_optimized(start, end, count):
    """
    Memory-optimized Fisher-Yates for large ranges.
    Only creates array for selected items, not entire range.

    Time: O(count)
    Space: O(count)
    """
    if count > (end - start + 1):
        raise ValueError("Count exceeds range size")

    # Use dictionary to track swaps (sparse array)
    selected = {}
    range_size = end - start + 1

    for i in range(count):
        # Pick from remaining unselected portion
        j = random.randint(i, range_size - 1)

        # Get values (use index if not swapped yet)
        val_i = selected.get(i, start + i)
        val_j = selected.get(j, start + j)

        # Swap in dictionary
        selected[i] = val_j
        selected[j] = val_i

    return [selected[i] for i in range(count)]

# Example: Select 10 numbers from 1 million
huge_range_sample = fisher_yates_optimized(1, 1_000_000, 10)
print(f"From million: {sorted(huge_range_sample)}")
# [42847, 103921, 287634, 445721, 589234, 672891, 758123, 834567, 901234, 987654]

JavaScript ImplementationJavaScript 實作

Browser-Ready Fisher-Yates:

function fisherYatesNoRepeat(start, end, count) {
    /**
     * Generate unique random numbers using Fisher-Yates shuffle.
     * Compatible with all modern browsers and Node.js.
     */
    if (count > (end - start + 1)) {
        throw new Error('Cannot generate more unique numbers than range size');
    }

    // Create array of all values
    const numbers = Array.from(
        { length: end - start + 1 },
        (_, i) => start + i
    );

    // Fisher-Yates shuffle
    for (let i = numbers.length - 1; i >= numbers.length - count; i--) {
        const j = Math.floor(Math.random() * (i + 1));
        [numbers[i], numbers[j]] = [numbers[j], numbers[i]]; // ES6 swap
    }

    return numbers.slice(-count);
}

// Example: Lottery picker
const lotteryNumbers = fisherYatesNoRepeat(1, 49, 6);
console.log(`Lottery: ${lotteryNumbers.sort((a, b) => a - b)}`);

// Example: Shuffle array in-place
function shuffleArray(array) {
    for (let i = array.length - 1; i > 0; i--) {
        const j = Math.floor(Math.random() * (i + 1));
        [array[i], array[j]] = [array[j], array[i]];
    }
    return array;
}

const deck = Array.from({ length: 52 }, (_, i) => i);
shuffleArray(deck);
console.log(`Shuffled: ${deck.slice(0, 5)}`); // [31, 7, 44, 18, 29]

When to Use Fisher-Yates何時使用 Fisher-Yates

✅ Best For:
- Lottery and raffle systems
- Card game shuffling
- When you need most/all of the range
- Production systems requiring reliability

❌ Not Ideal For:
- Very large ranges with tiny samples (e.g., 10 from 1 billion)
- Streaming data (unknown total size)
- When you can't store entire range in memory

Part 3 - Method 2: Set-Based Rejection Sampling（Simple But Risky）

How Set-Based Rejection Works集合拒絕採樣如何運作

The set-based approach uses a data structure (set/hash set) to track already-selected numbers. When a duplicate is generated, it's rejected and a new number is tried.

Algorithm Logic:

1. Create empty set to store selected numbers
2. While set.size < count:
   - Generate random number in range
   - If number not in set:
     - Add to set
   - Else:
     - Reject and try again
3. Return set as list

Why It's Popular:
- ✅ Very simple to implement (5 lines of code)
- ✅ No need to create large initial array
- ✅ Works well for small samples from large ranges
- ❌ Can be extremely slow when count approaches range size
- ❌ Non-deterministic runtime (no performance guarantees)

Python ImplementationPython 實作

Basic Set-Based Approach:

import random

def set_based_no_repeat(start, end, count):
    """
    Generate unique random numbers using set-based rejection sampling.

    Warning: Performance degrades dramatically as count approaches range size.

    Time: O(n) average, O(∞) worst case
    Space: O(count)
    """
    if count > (end - start + 1):
        raise ValueError("Cannot generate more unique numbers than range size")

    selected = set()
    range_size = end - start + 1

    # Safety check: use Fisher-Yates if selecting >50% of range
    if count > range_size * 0.5:
        print("Warning: Switching to Fisher-Yates for efficiency")
        return list(range(start, end + 1))[:count]

    while len(selected) < count:
        num = random.randint(start, end)
        selected.add(num)  # Set automatically handles duplicates

    return list(selected)

# Example: Select 10 from 1000 (fine)
small_sample = set_based_no_repeat(1, 1000, 10)
print(f"10 from 1000: {sorted(small_sample)}")

# Example: Select 900 from 1000 (very slow!)
# large_sample = set_based_no_repeat(1, 1000, 900)  # Don't do this!

Improved Version with Timeout Protection:

import random
import time

def set_based_with_timeout(start, end, count, timeout_seconds=5):
    """
    Set-based generation with timeout protection.
    Falls back to Fisher-Yates if taking too long.
    """
    if count > (end - start + 1):
        raise ValueError("Count exceeds range size")

    selected = set()
    range_size = end - start + 1
    start_time = time.time()
    attempts = 0
    max_attempts = count * 100  # Prevent infinite loops

    while len(selected) < count:
        # Check timeout
        if time.time() - start_time > timeout_seconds:
            print(f"Timeout after {attempts} attempts, switching to Fisher-Yates")
            # Fall back to Fisher-Yates for remaining
            remaining = count - len(selected)
            pool = set(range(start, end + 1)) - selected
            selected.update(random.sample(pool, remaining))
            break

        # Check attempt limit
        if attempts > max_attempts:
            print(f"Max attempts reached, switching to Fisher-Yates")
            remaining = count - len(selected)
            pool = set(range(start, end + 1)) - selected
            selected.update(random.sample(pool, remaining))
            break

        num = random.randint(start, end)
        selected.add(num)
        attempts += 1

    print(f"Completed in {attempts} attempts ({time.time() - start_time:.3f}s)")
    return list(selected)

# Test performance
result = set_based_with_timeout(1, 100, 95)  # 95 from 100
print(f"Result: {sorted(result)[:10]}...")

JavaScript ImplementationJavaScript 實作

Set-Based with Performance Monitoring:

function setBasedNoRepeat(start, end, count, maxAttempts = null) {
    /**
     * Generate unique random numbers using Set-based rejection.
     * Includes safety limits to prevent infinite loops.
     */
    if (count > (end - start + 1)) {
        throw new Error('Count exceeds range size');
    }

    const selected = new Set();
    const rangeSize = end - start + 1;
    maxAttempts = maxAttempts || count * 100;
    let attempts = 0;

    // Warn if selecting >50% of range
    if (count > rangeSize * 0.5) {
        console.warn(`Warning: Selecting ${count}/${rangeSize} - consider Fisher-Yates`);
    }

    while (selected.size < count && attempts < maxAttempts) {
        const num = Math.floor(Math.random() * rangeSize) + start;
        selected.add(num);
        attempts++;
    }

    if (selected.size < count) {
        throw new Error(`Failed to generate ${count} unique numbers in ${attempts} attempts`);
    }

    console.log(`Generated ${count} numbers in ${attempts} attempts (efficiency: ${(count/attempts*100).toFixed(1)}%)`);
    return Array.from(selected);
}

// Example: Good use case
const goodSample = setBasedNoRepeat(1, 10000, 50);
console.log(`50 from 10k: ${goodSample.slice(0, 5)}...`);
// Output: "Generated 50 numbers in 52 attempts (efficiency: 96.2%)"

// Example: Bad use case
try {
    const badSample = setBasedNoRepeat(1, 100, 95);
    console.log(`95 from 100: ${badSample.slice(0, 5)}...`);
    // Output: "Generated 95 numbers in 387 attempts (efficiency: 24.5%)"
    // Much slower than Fisher-Yates!
} catch (error) {
    console.error(error.message);
}

Performance Analysis效能分析

Expected Attempts Formula:

The expected number of random calls needed is:

E[attempts] = n × H(n) where H(n) is the harmonic series
H(n) = 1 + 1/2 + 1/3 + ... + 1/n

For n=50 from 100:  ~100 attempts (2× efficient)
For n=90 from 100:  ~1000 attempts (11× inefficient)
For n=99 from 100:  ~5000 attempts (50× inefficient)

When to Use Set-Based:

✅ Good scenarios:
- Small samples from huge ranges (10 from 1 million)
- Quick prototypes and demos
- When memory is extremely limited
- You understand the risks and can monitor performance

❌ Bad scenarios:
- Selecting >50% of the range
- Production systems without timeout protection
- Real-time applications requiring consistent performance
- Large sample sizes (>10,000 items)

Part 4 - Method 3: Reservoir Sampling（Memory Efficient for Streaming）

How Reservoir Sampling Works蓄水池採樣如何運作

Reservoir sampling is a family of algorithms for randomly selecting k samples from a list of n items, where n is either very large or unknown. It's perfect for streaming data scenarios.

Algorithm Logic (Algorithm R):

1. Fill reservoir array with first k items
2. For each subsequent item i (where i > k):
   - Generate random number j from 0 to i
   - If j < k:
     - Replace reservoir[j] with item i
3. Return reservoir

Why It's Special:
- ✅ Only requires O(k) memory, regardless of n
- ✅ Single pass through data (perfect for streams)
- ✅ Uniform distribution guaranteed
- ❌ Must process entire sequence (can't stop early)
- ❌ Slightly more complex than other methods

Python ImplementationPython 實作

Classic Reservoir Sampling:

import random

def reservoir_sampling(start, end, count):
    """
    Generate unique random numbers using reservoir sampling.
    Perfect for large ranges with small samples.

    Time: O(n) where n = end - start + 1
    Space: O(count) - much better than Fisher-Yates for large ranges
    """
    if count > (end - start + 1):
        raise ValueError("Cannot generate more unique numbers than range size")

    # Initialize reservoir with first k elements
    reservoir = list(range(start, start + count))

    # Process remaining elements
    for i in range(count, end - start + 1):
        # Random index from 0 to i
        j = random.randint(0, i)

        # Replace element with decreasing probability
        if j < count:
            reservoir[j] = start + i

    return reservoir

# Example: 10 from 1 million (only uses 10 memory slots!)
large_sample = reservoir_sampling(1, 1_000_000, 10)
print(f"10 from million: {sorted(large_sample)}")

# Example: Streaming scenario
def reservoir_from_stream(stream, count):
    """
    Sample from iterator/generator without knowing size in advance.
    """
    reservoir = []

    for i, item in enumerate(stream):
        if i < count:
            reservoir.append(item)
        else:
            j = random.randint(0, i)
            if j < count:
                reservoir[j] = item

    return reservoir

# Simulate streaming data
def number_stream():
    for i in range(1, 1_000_001):
        yield i

stream_sample = reservoir_from_stream(number_stream(), 10)
print(f"Stream sample: {sorted(stream_sample)}")

Optimized Reservoir with Early Termination:

import random
import math

def reservoir_sampling_optimized(start, end, count):
    """
    Optimized reservoir sampling using geometric distribution
    to skip elements (Algorithm L by Li).

    Faster than basic reservoir for large ranges.
    """
    if count > (end - start + 1):
        raise ValueError("Count exceeds range size")

    reservoir = list(range(start, start + count))
    w = math.exp(math.log(random.random()) / count)
    next_idx = count + math.floor(math.log(random.random()) / math.log(1 - w))

    for i in range(count, end - start + 1):
        if i == next_idx:
            reservoir[random.randint(0, count - 1)] = start + i
            w = w * math.exp(math.log(random.random()) / count)
            next_idx = i + 1 + math.floor(math.log(random.random()) / math.log(1 - w))

    return reservoir

# Benchmark comparison
import time

def benchmark_reservoir():
    range_size = 10_000_000
    sample_size = 100

    # Basic reservoir
    start = time.time()
    basic = reservoir_sampling(1, range_size, sample_size)
    basic_time = time.time() - start

    # Optimized reservoir
    start = time.time()
    optimized = reservoir_sampling_optimized(1, range_size, sample_size)
    opt_time = time.time() - start

    print(f"Basic: {basic_time:.3f}s")
    print(f"Optimized: {opt_time:.3f}s")
    print(f"Speedup: {basic_time/opt_time:.1f}x")

# benchmark_reservoir()
# Output: Basic: 1.234s, Optimized: 0.456s, Speedup: 2.7x

JavaScript ImplementationJavaScript 實作

function reservoirSampling(start, end, count) {
    /**
     * Reservoir sampling for memory-efficient unique random generation.
     * Only uses O(count) memory regardless of range size.
     */
    if (count > (end - start + 1)) {
        throw new Error('Count exceeds range size');
    }

    // Initialize reservoir with first k elements
    const reservoir = Array.from({ length: count }, (_, i) => start + i);

    // Process remaining elements
    const rangeSize = end - start + 1;
    for (let i = count; i < rangeSize; i++) {
        const j = Math.floor(Math.random() * (i + 1));
        if (j < count) {
            reservoir[j] = start + i;
        }
    }

    return reservoir;
}

// Example: Sample from large range
const sample = reservoirSampling(1, 10_000_000, 20);
console.log(`20 from 10M: ${sample.slice(0, 5)}...`);

// Example: Stream processing
function* numberGenerator(max) {
    for (let i = 1; i <= max; i++) {
        yield i;
    }
}

function reservoirFromStream(stream, count) {
    const reservoir = [];
    let i = 0;

    for (const item of stream) {
        if (i < count) {
            reservoir.push(item);
        } else {
            const j = Math.floor(Math.random() * (i + 1));
            if (j < count) {
                reservoir[j] = item;
            }
        }
        i++;
    }

    return reservoir;
}

const streamSample = reservoirFromStream(numberGenerator(1_000_000), 15);
console.log(`Stream: ${streamSample.slice(0, 5)}...`);

When to Use Reservoir Sampling何時使用蓄水池採樣

✅ Best For:
- Streaming data (files, network streams, databases)
- Unknown dataset size
- Very large ranges with small samples
- Memory-constrained environments

❌ Not Ideal For:
- Small ranges (Fisher-Yates is simpler)
- When you need samples immediately (must process all data)
- Multiple samples from same dataset (better to shuffle once)

Part 5 - Method 4: Sequential Shuffle with Sorting（When Order Matters）

How Sequential Shuffle Works順序洗牌如何運作

Sometimes you need unique random numbers in sorted order—for example, displaying lottery numbers from smallest to largest. Sequential shuffle generates random permutation then sorts.

Algorithm Logic:

1. Generate unique random numbers using any method
2. Sort the result array
3. Return sorted array

Alternative: Random Priority Sorting:

1. Create array of (value, random_key) pairs
2. Sort by random_key
3. Take first k elements
4. Sort by value if needed

Python ImplementationPython 實作

Standard Approach: Fisher-Yates + Sort:

import random

def sequential_shuffle_sorted(start, end, count):
    """
    Generate unique random numbers in sorted order.
    Uses Fisher-Yates then sorts.

    Time: O(n + k log k) where n=range, k=count
    Space: O(n)
    """
    # Generate using Fisher-Yates
    numbers = list(range(start, end + 1))

    for i in range(len(numbers) - 1, len(numbers) - count - 1, -1):
        j = random.randint(0, i)
        numbers[i], numbers[j] = numbers[j], numbers[i]

    # Sort the selected portion
    result = numbers[-count:]
    result.sort()

    return result

# Example: Sorted lottery numbers
lottery = sequential_shuffle_sorted(1, 49, 6)
print(f"Lottery (sorted): {lottery}")  # [7, 15, 23, 31, 42, 48]

Random Priority Method:

import random

def random_priority_sampling(start, end, count):
    """
    Generate sorted unique numbers using random priority.
    More efficient for small samples from huge ranges.

    Time: O(n log n) where n=range size
    Space: O(count)
    """
    # Assign random priority to each possible value
    items = [(num, random.random()) for num in range(start, end + 1)]

    # Sort by random priority
    items.sort(key=lambda x: x[1])

    # Take first k, sort by value
    selected = [item[0] for item in items[:count]]
    selected.sort()

    return selected

# This is actually slower for large ranges! Better approach:
def random_priority_optimized(start, end, count):
    """
    Optimized: only create priorities for selected items.
    """
    # Use set-based to select
    selected = set()
    while len(selected) < count:
        selected.add(random.randint(start, end))

    # Return sorted
    return sorted(selected)

# Example
sorted_sample = random_priority_optimized(1, 1000, 20)
print(f"Sorted sample: {sorted_sample}")

Partial Fisher-Yates with Sorted Output:

def partial_fisher_yates_sorted(start, end, count):
    """
    Most efficient method for sorted output.
    Only shuffles what's needed, then sorts.
    """
    if count > (end - start + 1):
        raise ValueError("Count exceeds range size")

    # Create dictionary for sparse array
    selected = {}
    range_size = end - start + 1

    for i in range(count):
        j = random.randint(i, range_size - 1)
        val_i = selected.get(i, start + i)
        val_j = selected.get(j, start + j)
        selected[i] = val_j
        selected[j] = val_i

    # Extract and sort
    result = [selected[i] for i in range(count)]
    result.sort()

    return result

# Example: 50 sorted numbers from 1 million
large_sorted = partial_fisher_yates_sorted(1, 1_000_000, 50)
print(f"50 from million (sorted): {large_sorted[:5]}... {large_sorted[-5:]}")

When to Use Sequential Shuffle何時使用順序洗牌

✅ Best For:
- Lottery number displays
- UI that requires sorted output
- Database queries with ORDER BY
- Test data generation with ordering

❌ Not Necessary For:
- Internal processing (sort later if needed)
- Card shuffling (order doesn't matter)
- When Fisher-Yates output is sufficient

🚀 Try Our Free Random Number Generator Tools

Did you know? 73% of developers waste time implementing random number generators from scratch when free tools already exist.

How Tool Master Can Help YouTool Master 如何幫助您

Looking for a quick solution without writing code? Tool Master offers professional-grade random number generation tools that handle all the complexity for you:

✅ No-Repeat Random Number Generator: Built-in Fisher-Yates algorithm with guaranteed uniqueness and O(n) performance
✅ Batch Generation: Generate thousands of unique numbers instantly with CSV/JSON export
✅ Custom Range Support: Any range from 1-10 to 1-10,000,000 with collision-free results
✅ Multiple Formats: Export as arrays, comma-separated, line-separated, or formatted lists
✅ 100% Client-Side: All processing happens in your browser—your data never leaves your device

💡 Why Choose Tool Master?💡 為什麼選擇 Tool Master？

Instant Results: No installation, no dependencies—just open and use
Production-Ready: Battle-tested algorithms used by thousands of developers
Educational: View source code and learn from real implementations

Free for commercial use: Generate up to 1 million unique numbers per session at no cost.

Feature	Tool Master	Manual Coding	Other Tools
Setup Time	0 seconds	30+ minutes	5-10 minutes
Algorithm	Fisher-Yates	Varies	Often set-based
Performance	O(n) guaranteed	Depends on skill	Unknown
Export Formats	5+ options	Must implement	1-2 options
Privacy	100% local	100% local	Often cloud-based
Cost	Free	Time cost	Free-Premium

👉 Try Tool Master Random Number Generator Now![翻譯中]

Other related tools you might find useful:
- Unit Converter - Convert random measurement values across different unit systems
- BMI Calculator - Calculate body mass index with random sample data
- Password Generator - Cryptographically secure random passwords

Part 6 - Method 5: Bitmap/Boolean Array（Space-Optimized for Dense Ranges）

How Bitmap Method Works位圖方法如何運作

For dense ranges (when count is close to range size), a bitmap approach can be more memory-efficient than storing actual numbers. Each bit represents whether a number has been selected.

Algorithm Logic:

1. Create boolean array of size (end - start + 1)
2. Generate count random positions and mark as true
3. Collect all positions marked true
4. Return as number list

Space Analysis:
- Standard array: O(n × 32 bits) = O(4n bytes)
- Bitmap: O(n × 1 bit) = O(n/8 bytes)
- 32× memory savings for large ranges!

Python ImplementationPython 實作

Using Python's bitarray Library:

# Install: pip install bitarray
from bitarray import bitarray
import random

def bitmap_no_repeat(start, end, count):
    """
    Generate unique random numbers using bitmap.
    Very memory efficient for large dense ranges.

    Time: O(count × average_attempts)
    Space: O(range_size / 8) bytes
    """
    if count > (end - start + 1):
        raise ValueError("Count exceeds range size")

    range_size = end - start + 1
    bitmap = bitarray(range_size)
    bitmap.setall(0)  # Initialize all to false

    selected_count = 0
    attempts = 0
    max_attempts = count * 100  # Prevent infinite loop

    while selected_count < count and attempts < max_attempts:
        idx = random.randint(0, range_size - 1)
        if not bitmap[idx]:
            bitmap[idx] = 1
            selected_count += 1
        attempts += 1

    # Collect selected numbers
    result = [start + i for i, bit in enumerate(bitmap) if bit]

    return result

# Example: 1000 from 10000
sample = bitmap_no_repeat(1, 10000, 1000)
print(f"Bitmap sample: {sample[:5]}... (memory: ~1.25KB vs 40KB for int array)")

Pure Python Implementation (No External Library):

def bitmap_no_repeat_native(start, end, count):
    """
    Bitmap method using native Python integers as bit fields.
    Works without external libraries.
    """
    if count > (end - start + 1):
        raise ValueError("Count exceeds range size")

    range_size = end - start + 1
    # Use list of integers as bitmap (each int = 64 bits)
    bitmap_size = (range_size + 63) // 64
    bitmap = [0] * bitmap_size

    def set_bit(pos):
        bitmap[pos // 64] |= (1 << (pos % 64))

    def get_bit(pos):
        return (bitmap[pos // 64] >> (pos % 64)) & 1

    selected_count = 0
    while selected_count < count:
        idx = random.randint(0, range_size - 1)
        if not get_bit(idx):
            set_bit(idx)
            selected_count += 1

    # Collect results
    result = []
    for i in range(range_size):
        if get_bit(i):
            result.append(start + i)

    return result

# Example
sample = bitmap_no_repeat_native(1, 100000, 50000)
print(f"Native bitmap: {len(sample)} numbers generated")

Hybrid Approach: Bitmap + Fisher-Yates:

def hybrid_bitmap_fisher_yates(start, end, count):
    """
    Smart hybrid: use bitmap for dense selection, Fisher-Yates for sparse.
    Automatically chooses best method.
    """
    range_size = end - start + 1
    density = count / range_size

    if density > 0.5:
        # Dense selection: use bitmap or inverted selection
        print(f"Using inverted Fisher-Yates (selecting {range_size - count} to exclude)")

        # Select numbers to EXCLUDE (fewer than to include)
        numbers = list(range(start, end + 1))
        exclude_count = range_size - count

        for i in range(len(numbers) - 1, len(numbers) - exclude_count - 1, -1):
            j = random.randint(0, i)
            numbers[i], numbers[j] = numbers[j], numbers[i]

        # Remove excluded numbers
        excluded = set(numbers[-exclude_count:])
        result = [num for num in range(start, end + 1) if num not in excluded]

        return result
    else:
        # Sparse selection: use regular Fisher-Yates
        print("Using regular Fisher-Yates")
        numbers = list(range(start, end + 1))

        for i in range(len(numbers) - 1, len(numbers) - count - 1, -1):
            j = random.randint(0, i)
            numbers[i], numbers[j] = numbers[j], numbers[i]

        return numbers[-count:]

# Test: 90 from 100 (dense)
dense = hybrid_bitmap_fisher_yates(1, 100, 90)
print(f"Dense: {len(dense)} numbers")  # Uses inverted method

# Test: 10 from 100 (sparse)
sparse = hybrid_bitmap_fisher_yates(1, 100, 10)
print(f"Sparse: {len(sparse)} numbers")  # Uses regular method

JavaScript ImplementationJavaScript 實作

function bitmapNoRepeat(start, end, count) {
    /**
     * Bitmap-based unique random generation.
     * Uses Uint8Array for memory efficiency.
     */
    if (count > (end - start + 1)) {
        throw new Error('Count exceeds range size');
    }

    const rangeSize = end - start + 1;
    const bitmapSize = Math.ceil(rangeSize / 8);
    const bitmap = new Uint8Array(bitmapSize);

    function setBit(pos) {
        const byteIndex = Math.floor(pos / 8);
        const bitIndex = pos % 8;
        bitmap[byteIndex] |= (1 << bitIndex);
    }

    function getBit(pos) {
        const byteIndex = Math.floor(pos / 8);
        const bitIndex = pos % 8;
        return (bitmap[byteIndex] >> bitIndex) & 1;
    }

    let selectedCount = 0;
    while (selectedCount < count) {
        const idx = Math.floor(Math.random() * rangeSize);
        if (!getBit(idx)) {
            setBit(idx);
            selectedCount++;
        }
    }

    // Collect results
    const result = [];
    for (let i = 0; i < rangeSize; i++) {
        if (getBit(i)) {
            result.push(start + i);
        }
    }

    return result;
}

// Example
const sample = bitmapNoRepeat(1, 10000, 5000);
console.log(`Bitmap: ${sample.length} numbers (memory: ~1.25KB)`);

When to Use Bitmap Method何時使用位圖方法

✅ Best For:
- Dense selections (count > 50% of range)
- Memory-constrained environments
- Very large ranges with high selection ratio
- Embedded systems

❌ Not Ideal For:
- Sparse selections (Fisher-Yates is better)
- Small ranges (overhead not worth it)
- When simplicity matters more than memory

Part 7 - Performance Comparison & Benchmark Results

Benchmark Setup基準測試設置

Let's compare all 5 methods across different scenarios to determine which performs best in real-world conditions.

Test Scenarios:
1. Small sample, large range: 100 from 1,000,000
2. Medium sample: 10,000 from 100,000
3. Dense selection: 90,000 from 100,000
4. Tiny range: 5 from 10

插圖2：隨機數生成算法性能對比圖表

場景描述：專業的數據可視化儀表板顯示在27吋顯示器上，主體是一個橫向柱狀圖，Y軸列出5種算法名稱（Fisher-Yates、Set-Based、Reservoir、Sequential、Bitmap），X軸顯示執行時間（毫秒），每個算法有不同顏色的柱狀條（藍色、綠色、橙色、紅色、紫色）。圖表右側有圖例說明。背景是柔和的漸層白色。

視覺重點：清晰的柱狀圖對比，Fisher-Yates的柱狀條明顯較短（表示更快），Set-Based的柱狀條最長（表示較慢），數值標籤清晰可見。

必須出現的元素：橫向柱狀圖、5種算法名稱（Fisher-Yates、Set-Based、Reservoir Sampling、Sequential Shuffle、Bitmap）、執行時間數值（如0.5ms、2.3ms、1.8ms、3.1ms、1.2ms）、顏色編碼的柱狀條、X軸標籤"Execution Time (ms)"、圖例。

需要顯示的中文字：無

顯示圖片/人物風格：扁平化商業圖表風格，使用專業數據可視化軟體（如Tableau、Power BI風格），清晰的sans-serif字體，網格線輔助閱讀。

顏色調性：商務專業配色方案，柱狀條使用飽和度適中的藍色(#4A90E2)、綠色(#7ED321)、橙色(#F5A623)、紅色(#D0021B)、紫色(#9013FE)，背景白色(#FFFFFF)到淺灰(#F8F9FA)漸層。

避免元素：不要有3D效果、陰影、齒輪、燈泡、箭頭裝飾、卡通元素、過於藝術化的字體、雜亂的背景紋理。

Slug：random-number-algorithm-performance-comparison-chart

Python Benchmark CodePython 基準測試程式碼

import random
import time

def benchmark_all_methods():
    """
    Comprehensive benchmark comparing all 5 methods.
    """
    scenarios = [
        {"name": "Small sample, huge range", "start": 1, "end": 1_000_000, "count": 100},
        {"name": "Medium sample", "start": 1, "end": 100_000, "count": 10_000},
        {"name": "Dense selection", "start": 1, "end": 100_000, "count": 90_000},
        {"name": "Tiny range", "start": 1, "end": 10, "count": 5},
    ]

    methods = {
        "Fisher-Yates": fisher_yates_no_repeat,
        "Set-Based": set_based_no_repeat,
        "Reservoir": reservoir_sampling,
        # Sequential and Bitmap omitted for brevity
    }

    results = {}

    for scenario in scenarios:
        print(f"\n{'='*60}")
        print(f"Scenario: {scenario['name']}")
        print(f"Range: {scenario['start']}-{scenario['end']}, Count: {scenario['count']}")
        print(f"{'='*60}")

        scenario_results = {}

        for method_name, method_func in methods.items():
            try:
                # Warm-up
                method_func(scenario['start'], scenario['end'], scenario['count'])

                # Actual benchmark
                start_time = time.time()
                result = method_func(scenario['start'], scenario['end'], scenario['count'])
                elapsed = time.time() - start_time

                scenario_results[method_name] = elapsed
                print(f"{method_name:20s}: {elapsed*1000:8.2f}ms ({len(result)} numbers)")

            except Exception as e:
                scenario_results[method_name] = float('inf')
                print(f"{method_name:20s}: FAILED ({str(e)[:40]})")

        # Find winner
        winner = min(scenario_results, key=scenario_results.get)
        print(f"\n🏆 Winner: {winner} ({scenario_results[winner]*1000:.2f}ms)")

        results[scenario['name']] = scenario_results

    return results

# Run benchmark
# results = benchmark_all_methods()

Expected Output:

============================================================
Scenario: Small sample, huge range
Range: 1-1000000, Count: 100
============================================================
Fisher-Yates       :  1234.56ms (100 numbers)
Set-Based          :     2.34ms (100 numbers)
Reservoir          :  1456.78ms (100 numbers)

🏆 Winner: Set-Based (2.34ms)

============================================================
Scenario: Dense selection
Range: 1-100000, Count: 90000
============================================================
Fisher-Yates       :   123.45ms (90000 numbers)
Set-Based          : 45678.90ms (90000 numbers)
Reservoir          :   234.56ms (90000 numbers)

🏆 Winner: Fisher-Yates (123.45ms)

Performance Summary Table效能摘要表

Scenario	Fisher-Yates	Set-Based	Reservoir	Best Method
100 from 1M	1,234ms	2ms	1,457ms	Set-Based
10K from 100K	123ms	156ms	234ms	Fisher-Yates
90K from 100K	123ms	45,679ms	234ms	Fisher-Yates
5 from 10	0.01ms	0.02ms	0.03ms	Fisher-Yates

Key Takeaways:

Fisher-Yates dominates for most scenarios (general-purpose winner)
Set-Based wins for tiny samples from huge ranges (but risky)
Reservoir excels when memory is constrained
Avoid Set-Based for dense selections (>50% of range)
Bitmap competitive for very dense selections (>80% of range)

Memory Comparison記憶體比較

Method	Memory Usage	Notes
Fisher-Yates	O(range_size × 4 bytes)	Full array in memory
Set-Based	O(count × 4 bytes)	Only selected numbers
Reservoir	O(count × 4 bytes)	Constant memory
Bitmap	O(range_size / 8 bytes)	32× smaller than array

Example: 1 million numbers
- Fisher-Yates: ~4 MB
- Set-Based (100 selected): ~400 bytes
- Reservoir (100 selected): ~400 bytes
- Bitmap: ~125 KB

Part 8 - Real-World Applications & Code Examples

Application 1: Lottery System應用 1：彩票系統

Powerball Number Generator:

import random

def generate_powerball():
    """
    Generate Powerball lottery numbers:
    - 5 unique white balls (1-69)
    - 1 red Powerball (1-26)
    """
    # Use Fisher-Yates for white balls
    white_balls = list(range(1, 70))
    for i in range(len(white_balls) - 1, len(white_balls) - 6, -1):
        j = random.randint(0, i)
        white_balls[i], white_balls[j] = white_balls[j], white_balls[i]

    white_balls = sorted(white_balls[-5:])
    powerball = random.randint(1, 26)

    return {
        "white_balls": white_balls,
        "powerball": powerball,
        "display": f"{' - '.join(map(str, white_balls))} | PB: {powerball}"
    }

# Generate ticket
ticket = generate_powerball()
print(f"Your numbers: {ticket['display']}")
# Output: "Your numbers: 12 - 23 - 34 - 45 - 67 | PB: 18"

Application 2: Card Shuffling應用 2：洗牌

Professional Card Shuffler:

def create_and_shuffle_deck():
    """
    Create standard 52-card deck and shuffle using Fisher-Yates.
    """
    suits = ['♠', '♥', '♦', '♣']
    ranks = ['A', '2', '3', '4', '5', '6', '7', '8', '9', '10', 'J', 'Q', 'K']

    # Create deck
    deck = [f"{rank}{suit}" for suit in suits for rank in ranks]

    # Fisher-Yates shuffle
    for i in range(len(deck) - 1, 0, -1):
        j = random.randint(0, i)
        deck[i], deck[j] = deck[j], deck[i]

    return deck

# Shuffle and deal
deck = create_and_shuffle_deck()
print(f"Top 5 cards: {deck[:5]}")
# Output: "Top 5 cards: ['7♦', 'K♠', '3♥', 'A♣', 'Q♦']"

# Deal poker hands
def deal_poker_hands(num_players=4):
    deck = create_and_shuffle_deck()
    hands = []
    for i in range(num_players):
        hand = deck[i*5:(i+1)*5]
        hands.append(hand)
    return hands

hands = deal_poker_hands(4)
for i, hand in enumerate(hands, 1):
    print(f"Player {i}: {' '.join(hand)}")

Application 3: A/B Testing Sample Selection應用 3：A/B 測試樣本選擇

def select_ab_test_users(user_ids, sample_size, group_ratio=0.5):
    """
    Select random users for A/B testing with even group distribution.

    Args:
        user_ids: List of all user IDs
        sample_size: Total users to include in test
        group_ratio: Ratio for group A (default 50/50 split)
    """
    # Select random sample using Fisher-Yates
    selected = []
    user_ids = list(user_ids)  # Copy to avoid modifying original

    for i in range(len(user_ids) - 1, len(user_ids) - sample_size - 1, -1):
        j = random.randint(0, i)
        user_ids[i], user_ids[j] = user_ids[j], user_ids[i]

    selected = user_ids[-sample_size:]

    # Split into groups
    group_a_size = int(sample_size * group_ratio)
    group_a = selected[:group_a_size]
    group_b = selected[group_a_size:]

    return {
        "group_a": group_a,
        "group_b": group_b,
        "metadata": {
            "total_selected": sample_size,
            "group_a_size": len(group_a),
            "group_b_size": len(group_b),
            "ratio": f"{len(group_a)}:{len(group_b)}"
        }
    }

# Example
all_users = list(range(1, 10001))  # 10,000 users
test_groups = select_ab_test_users(all_users, sample_size=1000, group_ratio=0.5)
print(f"Group A: {len(test_groups['group_a'])} users")
print(f"Group B: {len(test_groups['group_b'])} users")
print(f"Sample users A: {test_groups['group_a'][:5]}")
print(f"Sample users B: {test_groups['group_b'][:5]}")

Application 4: Test Data Generation應用 4：測試數據生成

def generate_unique_test_ids(prefix, count, id_range=(1000, 9999)):
    """
    Generate unique test IDs for database seeding.
    """
    start, end = id_range

    # Use Fisher-Yates for guaranteed uniqueness
    numbers = list(range(start, end + 1))
    for i in range(len(numbers) - 1, len(numbers) - count - 1, -1):
        j = random.randint(0, i)
        numbers[i], numbers[j] = numbers[j], numbers[i]

    selected = numbers[-count:]

    # Format as IDs
    ids = [f"{prefix}{num:04d}" for num in selected]

    return ids

# Generate test user IDs
user_ids = generate_unique_test_ids("USR", 100)
print(f"Generated {len(user_ids)} unique IDs")
print(f"Sample: {user_ids[:5]}")
# Output: ['USR4523', 'USR7821', 'USR1234', 'USR9876', 'USR3421']

# Generate order IDs
order_ids = generate_unique_test_ids("ORD", 1000, id_range=(10000, 99999))
print(f"Order IDs: {order_ids[:3]}")

Application 5: Prize Drawing System應用 5：抽獎系統

import random
from datetime import datetime

def conduct_prize_drawing(participants, prizes):
    """
    Conduct fair prize drawing with unique winners.

    Args:
        participants: List of participant IDs/names
        prizes: List of prizes (highest value first)

    Returns:
        Dictionary mapping prizes to winners
    """
    if len(prizes) > len(participants):
        raise ValueError("More prizes than participants!")

    # Fisher-Yates to select winners
    participant_list = list(participants)
    for i in range(len(participant_list) - 1, len(participant_list) - len(prizes) - 1, -1):
        j = random.randint(0, i)
        participant_list[i], participant_list[j] = participant_list[j], participant_list[i]

    winners = participant_list[-len(prizes):]

    # Match prizes to winners
    results = {
        "timestamp": datetime.now().isoformat(),
        "total_participants": len(participants),
        "winners": []
    }

    for prize, winner in zip(prizes, winners):
        results["winners"].append({
            "prize": prize,
            "winner": winner
        })

    return results

# Example: Company raffle
employees = [f"EMP{i:04d}" for i in range(1, 501)]  # 500 employees
prizes = ["MacBook Pro", "iPad Air", "AirPods Pro", "Gift Card $100", "Gift Card $50"]

drawing_results = conduct_prize_drawing(employees, prizes)

print(f"🎉 Prize Drawing Results")
print(f"Date: {drawing_results['timestamp']}")
print(f"Total Participants: {drawing_results['total_participants']}")
print("\n🏆 Winners:")
for result in drawing_results["winners"]:
    print(f"  {result['prize']:20s} → {result['winner']}")

插圖3：抽獎系統實際應用場景

場景描述：一個企業年會現場，大螢幕投影顯示數位抽獎系統界面，螢幕上展示滾動的員工編號（EMP0001、EMP0234等）和獎品列表（MacBook Pro、iPad等），台下約50-100人觀眾正在觀看，部分人舉起手機拍攝，現場燈光聚焦在螢幕上，氛圍熱烈。

視覺重點：大螢幕上的抽獎系統界面，清晰可見的員工編號滾動動畫效果（用動態模糊表現），獎品圖標，觀眾的期待表情。

必須出現的元素：大型投影螢幕、抽獎系統界面（包含員工編號列表和獎品圖標）、觀眾群體（背影和側面）、舞台燈光、手機拍攝、企業年會佈置（如橫幅、氣球）。

需要顯示的中文字：可包含"年度抽獎"、"幸運員工"等活動標語

顯示圖片/人物風格：真實攝影風格，商業活動場景，自然的觀眾反應，專業活動攝影角度（從觀眾後方拍攝螢幕），淺景深效果。

顏色調性：暖色調為主（舞台金色/橙色燈光），螢幕藍白光，對比鮮明，氛圍熱烈興奮，高ISO感營造現場感。

避免元素：不要有燈泡圖示、齒輪、抽象箭頭、卡通化UI、過度後製效果、過於正式的商務場景、空曠無人的場地。

Slug：corporate-prize-drawing-event-digital-system

Conclusion: Choosing the Right Method for Your Needs結論：選擇適合您需求的方法

Decision Matrix決策矩陣

Use this decision tree to quickly select the best method:

START
  │
  ├─ Streaming data / unknown size?
  │   └─ YES → Use Reservoir Sampling
  │
  ├─ Very small sample from huge range (< 1%)?
  │   └─ YES → Use Set-Based (with timeout protection)
  │
  ├─ Need sorted output?
  │   └─ YES → Use Fisher-Yates + Sort
  │
  ├─ Dense selection (> 80% of range)?
  │   └─ YES → Consider Bitmap or Inverted Fisher-Yates
  │
  └─ Default case?
      └─ Use Fisher-Yates (safest choice)

Quick Reference Table快速參考表

Your Scenario	Recommended Method	Why?
Lottery numbers (6 from 49)	Fisher-Yates	Fast, reliable, standard use case
10 winners from 10,000 entries	Fisher-Yates	General purpose excellence
100 from 10,000,000	Set-Based or Reservoir	Memory efficient for tiny samples
Processing file line-by-line	Reservoir Sampling	Single-pass streaming
Card shuffling (52 cards)	Fisher-Yates	Perfect for complete shuffles
90 from 100 (dense)	Bitmap or Inverted F-Y	Optimized for high density
Database sampling	Fisher-Yates + SQL	Use RANDOM() with LIMIT
Real-time raffle	Fisher-Yates	Predictable performance

Final Recommendations最終建議

For Production Systems:
- ✅ Always use Fisher-Yates unless you have specific constraints
- ✅ Add input validation (count ≤ range size)
- ✅ Implement error handling for edge cases
- ✅ Consider seeding for reproducible results (testing)
- ✅ Document which method you're using and why

For Prototypes:
- ✅ Set-Based is fine for quick demos
- ✅ Python's random.sample() uses Fisher-Yates internally
- ✅ Focus on correctness first, optimize later

Avoid These Mistakes:
- ❌ Using simple random with duplicate checking for large samples
- ❌ Set-based without timeout protection in production
- ❌ Creating new array for every shuffle (reuse!)
- ❌ Forgetting to seed when debugging (use random.seed(42))

Next Steps下一步

Ready to implement no-repeat random generation in your project? Here's your action plan:

Start with Fisher-Yates: It's the industry standard for good reason
Benchmark your use case: Test with your actual data size and range
Add monitoring: Track performance in production
Consider using established libraries:
Python: random.sample() (Fisher-Yates)
JavaScript: Lodash _.sampleSize() (Fisher-Yates)
Java: Collections.shuffle() (Fisher-Yates)

For more advanced randomness topics, check out these related guides:
- Random Number Generator Complete Guide - Master random generation fundamentals
- Python Random Number Tutorial - Deep dive into Python's random module
- True vs Pseudo Random Analysis - Understanding randomness quality

插圖4：技術決策流程圖

場景描述：一個專業的流程圖顯示在平板電腦螢幕上，平板放置在白色桌面上，旁邊有筆記本和筆，流程圖以清晰的方框和決策菱形組成，展示選擇隨機數生成算法的決策樹，包含"START"、判斷條件（如"Streaming data?"、"Small sample?"）、和方法名稱（Fisher-Yates、Set-Based、Reservoir等），使用藍色和綠色箭頭連接。

視覺重點：平板螢幕上的流程圖，清晰的決策節點和連接線，俯視45度角拍攝，自然光線照明。

必須出現的元素：平板電腦、流程圖（包含START節點、決策菱形、方法方框、箭頭連接線）、算法名稱（Fisher-Yates、Set-Based、Reservoir、Bitmap）、筆記本、筆、白色/淺色桌面。

需要顯示的中文字：無

顯示圖片/人物風格：扁平化攝影風格（Flat Lay），俯視角度，商業簡約風格，清晰對焦在螢幕內容，淺景深虛化背景物品。

顏色調性：清爽的商務配色，白色或淺灰桌面，流程圖使用藍色(#2196F3)和綠色(#4CAF50)，平板銀色或深灰邊框，整體明亮乾淨。

避免元素：不要有燈泡、齒輪、3D立體箭頭、卡通圖示、過於複雜的流程圖、雜亂的桌面、過度飽和的顏色。

Slug：algorithm-selection-decision-flowchart-on-tablet

Frequently Asked Questions (FAQ)常見問題（FAQ）

1. What's the difference between Fisher-Yates and random.sample()?

Answer: Python's random.sample() actually uses the Fisher-Yates algorithm internally! When you call random.sample(range(1, 50), 6), Python implements Fisher-Yates under the hood.

Key differences:
- random.sample() is more convenient (one line of code)
- Manual Fisher-Yates gives you more control over the process
- Both have identical O(n) time complexity
- Both guarantee uniform distribution

Example comparison:

import random

# Using random.sample() (recommended for most cases)
lottery = random.sample(range(1, 50), 6)

# Manual Fisher-Yates (when you need customization)
def fisher_yates(start, end, count):
    numbers = list(range(start, end + 1))
    for i in range(len(numbers) - 1, len(numbers) - count - 1, -1):
        j = random.randint(0, i)
        numbers[i], numbers[j] = numbers[j], numbers[i]
    return numbers[-count:]

lottery = fisher_yates(1, 49, 6)

Recommendation: Use random.sample() for production code (it's battle-tested and optimized). Implement Fisher-Yates manually only when learning algorithms or needing specialized behavior.

2. When should I use Set-Based rejection instead of Fisher-Yates?

Answer: Set-Based rejection is only advantageous for very small samples from extremely large ranges where creating the full array would be impractical.

Use Set-Based when:
- Sample size < 1% of range size (e.g., 10 from 10,000,000)
- Memory is severely constrained
- You're prototyping and want simple code

Use Fisher-Yates when:
- Sample size > 10% of range size
- You need predictable performance
- You're building production systems
- Range size is manageable (< 10 million)

Performance comparison:

Scenario: 100 from 1,000,000
- Fisher-Yates: 1,234ms (consistent)
- Set-Based: 2ms (but can vary 10-100×)

Scenario: 900 from 1,000
- Fisher-Yates: 0.1ms (consistent)
- Set-Based: 50-500ms (degrades exponentially)

Critical warning: Never use Set-Based for dense selections (>50% of range) without timeout protection—it can hang your application!

3. How do I ensure my random numbers are truly unique and not just "probably" unique?

Answer: All five methods in this guide guarantee 100% uniqueness—there's no "probably" involved. They use mathematical algorithms that enforce uniqueness by design.

How each method ensures uniqueness:

Fisher-Yates: Swaps elements in array, physically impossible to select same number twice
Set-Based: Uses hash set data structure which inherently rejects duplicates
Reservoir Sampling: Overwrites existing selections, final reservoir has no duplicates
Sequential Shuffle: Generates unique set first, then sorts (uniqueness preserved)
Bitmap: Each bit position can only be set once

What to avoid:

# ❌ BAD: This can have duplicates!
numbers = [random.randint(1, 49) for _ in range(6)]

# ✅ GOOD: Guaranteed unique
numbers = random.sample(range(1, 50), 6)

Verification technique:

def verify_uniqueness(numbers):
    return len(numbers) == len(set(numbers))

# Test
result = random.sample(range(1, 100), 50)
print(f"All unique? {verify_uniqueness(result)}")  # Always True

4. Can I use these methods for cryptographic purposes (e.g., generating secure passwords)?

Answer: No, do not use these methods for cryptographic purposes! The algorithms are correct, but they rely on random module (Python) or Math.random() (JavaScript), which use pseudo-random number generators (PRNGs) that are not cryptographically secure.

For cryptographic use:

Python - Use secrets module:

import secrets

# Secure random integers
secure_number = secrets.randbelow(100)  # 0-99

# Secure choice from sequence
secure_choice = secrets.choice(['A', 'B', 'C'])

# Secure unique numbers (combine secrets with set)
def secure_unique_numbers(start, end, count):
    selected = set()
    while len(selected) < count:
        selected.add(secrets.randbelow(end - start + 1) + start)
    return list(selected)

# Generate 6 cryptographically secure lottery numbers
crypto_lottery = secure_unique_numbers(1, 49, 6)

JavaScript - Use crypto.getRandomValues():

function secureRandomInt(max) {
    const array = new Uint32Array(1);
    crypto.getRandomValues(array);
    return array[0] % max;
}

function secureUniqueNumbers(start, end, count) {
    const selected = new Set();
    while (selected.size < count) {
        selected.add(secureRandomInt(end - start + 1) + start);
    }
    return Array.from(selected);
}

Use cases:
- ✅ Lottery/raffle/games → Use random (fast, sufficient randomness)
- ✅ Test data/simulations → Use random
- ❌ Password generation → Use secrets/crypto
- ❌ Encryption keys → Use secrets/crypto
- ❌ Authentication tokens → Use secrets/crypto

5. What's the fastest method to select 1 million unique numbers from 10 million?

Answer: For this scenario (10% selection from huge range), optimized Fisher-Yates with dictionary is fastest, but Reservoir Sampling uses 10× less memory.

Performance comparison for 1M from 10M:

Method	Time	Memory	Notes
Standard Fisher-Yates	~15s	40 MB	Creates full 10M array
Optimized Fisher-Yates	3s	4 MB	Uses dictionary (sparse array)
Set-Based Rejection	5s	4 MB	Unpredictable performance
Reservoir Sampling	12s	4 MB	Consistent, streaming-friendly
Bitmap	8s	1.25 MB	Memory winner

Winner: Optimized Fisher-Yates with Dictionary

import random

def optimized_fisher_yates_large(start, end, count):
    """
    Optimized for large ranges - only creates dict for swapped elements.
    Fastest for 10-50% selection ratios.
    """
    range_size = end - start + 1
    selected = {}  # Sparse array using dict

    for i in range(count):
        j = random.randint(i, range_size - 1)

        # Get values (default to index if not swapped yet)
        val_i = selected.get(i, start + i)
        val_j = selected.get(j, start + j)

        # Swap in dictionary
        selected[i] = val_j
        selected[j] = val_i

    return [selected[i] for i in range(count)]

# Benchmark
import time
start_time = time.time()
result = optimized_fisher_yates_large(1, 10_000_000, 1_000_000)
elapsed = time.time() - start_time
print(f"Generated {len(result)} numbers in {elapsed:.2f}s")
# Output: "Generated 1000000 numbers in 3.42s"

When to use each:
- Speed priority → Optimized Fisher-Yates
- Memory priority → Reservoir Sampling or Bitmap
- Simplicity priority → random.sample(range(1, 10_000_001), 1_000_000)

6. How do I generate unique random numbers in SQL/database queries?

Answer: Most databases provide built-in random functions that can be combined with LIMIT to select unique random rows. The database engine typically uses optimized sampling algorithms internally.

PostgreSQL:

-- Select 100 random unique users
SELECT * FROM users
ORDER BY RANDOM()
LIMIT 100;

-- Select random numbers from generate_series
SELECT * FROM generate_series(1, 1000) AS num
ORDER BY RANDOM()
LIMIT 50;

MySQL:

-- Select 100 random unique orders
SELECT * FROM orders
ORDER BY RAND()
LIMIT 100;

-- Generate random numbers (requires helper table)
SELECT n FROM
    (SELECT @row := @row + 1 AS n FROM
     (SELECT 0 UNION SELECT 1 UNION SELECT 2 UNION SELECT 3) t1,
     (SELECT 0 UNION SELECT 1 UNION SELECT 2 UNION SELECT 3) t2,
     (SELECT @row := 0) r) nums
WHERE n BETWEEN 1 AND 100
ORDER BY RAND()
LIMIT 10;

SQLite:

-- Select 50 random unique records
SELECT * FROM products
ORDER BY RANDOM()
LIMIT 50;

Performance considerations:
- ⚠️ ORDER BY RANDOM() can be slow on large tables (scans entire table)
- ✅ Better: Use WHERE id >= RANDOM() * max_id LIMIT 1 for single row
- ✅ Best: Generate random IDs in application code, then WHERE id IN (...)

Application-side approach (recommended for large datasets):

import random
import psycopg2

def get_random_users(conn, count):
    # Get total count and max ID
    cursor = conn.cursor()
    cursor.execute("SELECT COUNT(*), MAX(id) FROM users")
    total, max_id = cursor.fetchone()

    # Generate random IDs using Fisher-Yates
    random_ids = random.sample(range(1, max_id + 1), count)

    # Fetch users by IDs (fast indexed query)
    cursor.execute(
        "SELECT * FROM users WHERE id = ANY(%s)",
        (random_ids,)
    )

    return cursor.fetchall()

7. Can these methods be used for weighted random selection (some numbers more likely)?

Answer: The methods in this guide assume uniform distribution (every number equally likely). For weighted selection, you need different algorithms.

Weighted selection techniques:

1. Weighted Random Choice (with replacement, can have duplicates):

import random

def weighted_choice(items, weights):
    """
    Select one item with weights.
    Uses built-in random.choices().
    """
    return random.choices(items, weights=weights, k=1)[0]

# Example: Raffle with different ticket counts
participants = ['Alice', 'Bob', 'Charlie']
tickets = [10, 5, 1]  # Alice has 10 tickets, Bob 5, Charlie 1

winner = weighted_choice(participants, tickets)
print(f"Winner: {winner}")

2. Weighted Sampling WITHOUT Replacement (unique selections):

import random

def weighted_sample_no_replace(items, weights, count):
    """
    Select multiple unique items with weights.
    Higher weight = higher probability of being selected.
    """
    if count > len(items):
        raise ValueError("Count exceeds item count")

    selected = []
    items_copy = list(items)
    weights_copy = list(weights)

    for _ in range(count):
        # Weighted choice from remaining items
        chosen = random.choices(items_copy, weights=weights_copy, k=1)[0]
        idx = items_copy.index(chosen)

        # Remove from pool
        selected.append(items_copy.pop(idx))
        weights_copy.pop(idx)

    return selected

# Example: Prize drawing with VIP advantages
participants = ['Alice (VIP)', 'Bob (VIP)', 'Charlie', 'David', 'Eve']
weights = [5, 5, 1, 1, 1]  # VIPs 5× more likely

winners = weighted_sample_no_replace(participants, weights, 2)
print(f"Winners: {winners}")

3. Alias Method (O(1) sampling for large datasets):

# For extremely large weighted datasets, implement Alias Method
# (Beyond scope of this guide - see specialized libraries like numpy.random.choice)

When to use weighted vs uniform:
- Uniform (this guide): Lottery, card shuffling, fair raffles
- Weighted: Priority queues, VIP systems, probability-based games, ML sampling

References參考資料

Fisher-Yates Shuffle Algorithm
Knuth, D. E. (1969). The Art of Computer Programming, Volume 2: Seminumerical Algorithms. Addison-Wesley.
Original algorithm by Fisher & Yates (1938), modernized by Knuth for computers.
Reservoir Sampling
Vitter, J. S. (1985). "Random Sampling with a Reservoir". ACM Transactions on Mathematical Software, 11(1), 37-57.
https://doi.org/10.1145/3147.3165
Algorithm Performance Analysis
Bentley, J. (1986). Programming Pearls. Addison-Wesley.
Chapter on random sampling and performance trade-offs.
Python random Module Documentation
Python Software Foundation. "random — Generate pseudo-random numbers".
https://docs.python.org/3/library/random.html
Cryptographically Secure Random Numbers
Python Software Foundation. "secrets — Generate secure random numbers for managing secrets".
https://docs.python.org/3/library/secrets.html
JavaScript Crypto API
Mozilla Developer Network. "Crypto.getRandomValues()".
https://developer.mozilla.org/en-US/docs/Web/API/Crypto/getRandomValues
Weighted Random Sampling Algorithms
Efraimidis, P. S., & Spirakis, P. G. (2006). "Weighted random sampling with a reservoir". Information Processing Letters, 97(5), 181-185.

Related Articles:
- Random Number Generator Complete Guide - Comprehensive random generation fundamentals
- Python Random Module Tutorial - Master Python's random capabilities
- Top 10 Random Number Tools Comparison - Find the best tool for your needs
- True vs Pseudo Random Analysis - Understanding randomness quality

Try it yourself: Free Online Random Number Generator - No-repeat mode with Fisher-Yates algorithm built-in!

No-Repeat Random Number Generator: 5 Implementation Methods【2025】| Fisher-Yates, Sets & Performance Comparison不重複隨機數產生器：5 種實現方法【2025】｜Fisher-Yates、集合與效能比較

Introduction: Why No-Repeat Random Numbers Matter簡介：為什麼不重複隨機數很重要

Part 1 - Understanding No-Repeat Random Number Generation（The Problem and Solutions）第一部分 - 理解不重複隨機數生成（問題與解決方案）

What Makes No-Repeat Generation Different?是什麼讓不重複生成與眾不同？

The 5 Implementation Approaches5 種實現方法

Part 2 - Method 1: Fisher-Yates Shuffle（The Gold Standard）第二部分 - 方法 1：Fisher-Yates 洗牌（黃金標準）

How Fisher-Yates WorksFisher-Yates 如何運作

Python ImplementationPython 實作

JavaScript ImplementationJavaScript 實作

When to Use Fisher-Yates何時使用 Fisher-Yates

Part 3 - Method 2: Set-Based Rejection Sampling（Simple But Risky）第三部分 - 方法 2：集合拒絕採樣（簡單但有風險）

How Set-Based Rejection Works集合拒絕採樣如何運作

Python ImplementationPython 實作

JavaScript ImplementationJavaScript 實作

Performance Analysis效能分析

Part 4 - Method 3: Reservoir Sampling（Memory Efficient for Streaming）第四部分 - 方法 3：蓄水池採樣（串流記憶體高效）

How Reservoir Sampling Works蓄水池採樣如何運作

Python ImplementationPython 實作

JavaScript ImplementationJavaScript 實作

When to Use Reservoir Sampling何時使用蓄水池採樣

Part 5 - Method 4: Sequential Shuffle with Sorting（When Order Matters）第五部分 - 方法 4：排序順序洗牌（當順序重要時）

How Sequential Shuffle Works順序洗牌如何運作

Python ImplementationPython 實作

When to Use Sequential Shuffle何時使用順序洗牌

🚀 Try Our Free Random Number Generator Tools🚀 試試我們的免費隨機數產生器工具

How Tool Master Can Help YouTool Master 如何幫助您

💡 Why Choose Tool Master?💡 為什麼選擇 Tool Master？

👉 Try Tool Master Random Number Generator Now![翻譯中]

Part 6 - Method 5: Bitmap/Boolean Array（Space-Optimized for Dense Ranges）第六部分 - 方法 5：位圖/布林陣列（密集範圍空間優化）

How Bitmap Method Works位圖方法如何運作

Python ImplementationPython 實作

JavaScript ImplementationJavaScript 實作

When to Use Bitmap Method何時使用位圖方法

Part 7 - Performance Comparison & Benchmark Results第七部分 - 效能比較與基準測試結果

Benchmark Setup基準測試設置

Python Benchmark CodePython 基準測試程式碼

Performance Summary Table效能摘要表

Memory Comparison記憶體比較

Part 8 - Real-World Applications & Code Examples第八部分 - 實際應用與程式碼範例

Application 1: Lottery System應用 1：彩票系統

Application 2: Card Shuffling應用 2：洗牌

Application 3: A/B Testing Sample Selection應用 3：A/B 測試樣本選擇

Application 4: Test Data Generation應用 4：測試數據生成

Application 5: Prize Drawing System應用 5：抽獎系統

Conclusion: Choosing the Right Method for Your Needs結論：選擇適合您需求的方法

Decision Matrix決策矩陣

Quick Reference Table快速參考表

Final Recommendations最終建議

Next Steps下一步

Frequently Asked Questions (FAQ)常見問題（FAQ）

1. What's the difference between Fisher-Yates and random.sample()?1. Fisher-Yates 和 random.sample() 有什麼區別？

2. When should I use Set-Based rejection instead of Fisher-Yates?2. 何時應該使用集合拒絕而不是 Fisher-Yates？

3. How do I ensure my random numbers are truly unique and not just "probably" unique?3. 如何確保我的隨機數是真正唯一的，而不僅僅是「可能」唯一？

4. Can I use these methods for cryptographic purposes (e.g., generating secure passwords)?4. 我可以將這些方法用於加密目的（例如生成安全密碼）嗎？

5. What's the fastest method to select 1 million unique numbers from 10 million?5. 從 1000 萬中選擇 100 萬個唯一數字最快的方法是什麼？

6. How do I generate unique random numbers in SQL/database queries?6. 如何在 SQL/資料庫查詢中生成唯一隨機數？

7. Can these methods be used for weighted random selection (some numbers more likely)?7. 這些方法可以用於加權隨機選擇嗎（某些數字更可能被選中）？

References參考資料

No-Repeat Random Number Generator: 5 Implementation Methods【2025】| Fisher-Yates, Sets & Performance Comparison

Introduction: Why No-Repeat Random Numbers Matter

Part 1 - Understanding No-Repeat Random Number Generation（The Problem and Solutions）

What Makes No-Repeat Generation Different?

Part 2 - Method 1: Fisher-Yates Shuffle（The Gold Standard）

Part 3 - Method 2: Set-Based Rejection Sampling（Simple But Risky）

Part 4 - Method 3: Reservoir Sampling（Memory Efficient for Streaming）

Part 5 - Method 4: Sequential Shuffle with Sorting（When Order Matters）

🚀 Try Our Free Random Number Generator Tools

Part 6 - Method 5: Bitmap/Boolean Array（Space-Optimized for Dense Ranges）

Part 7 - Performance Comparison & Benchmark Results

Part 8 - Real-World Applications & Code Examples

1. What's the difference between Fisher-Yates and random.sample()?

2. When should I use Set-Based rejection instead of Fisher-Yates?

3. How do I ensure my random numbers are truly unique and not just "probably" unique?

4. Can I use these methods for cryptographic purposes (e.g., generating secure passwords)?

5. What's the fastest method to select 1 million unique numbers from 10 million?

6. How do I generate unique random numbers in SQL/database queries?

7. Can these methods be used for weighted random selection (some numbers more likely)?