Search Results for "q learning algorithm" - Page 5

Showing 301 open source projects for "q learning algorithm"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser

    Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
    Download Chrome
  • 1
    AmPEP and AxPEP

    AmPEP and AxPEP

    Sequence-based Antimicrobial Peptide Prediction by Random Forest

    Antimicrobial peptides (AMPs) are promising candidates in the fight against multidrug-resistant pathogens due to its broad range of activities and low toxicity. However, identification of AMPs through wet-lab experiment is still expensive and time consuming. AmPEP is an accurate computational method for AMP prediction using the random forest algorithm. The prediction model is based on the distribution patterns of amino acid properties along the sequence. Our optimal model, AmPEP with 1:3...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    PaddlePaddle models

    PaddlePaddle models

    Pre-trained and Reproduced Deep Learning Models

    Pre-trained and Reproduced Deep Learning Models ("Flying Paddle" official model library, including a variety of academic frontier and industrial scene verification of deep learning models) Flying Paddle's industrial-level model library includes a large number of mainstream models that have been polished by industrial practice for a long time and models that have won championships in international competitions; it provides many scenarios for semantic understanding, image classification,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Supervised Reptile

    Supervised Reptile

    Code for the paper "On First-Order Meta-Learning Algorithms"

    The supervised-reptile repository contains code associated with the paper “On First-Order Meta-Learning Algorithms”, which introduces Reptile, a meta-learning algorithm for learning model parameter initializations that adapt quickly to new tasks. The implementation here is aimed at supervised few-shot learning settings (e.g. Omniglot, Mini-ImageNet), not reinforcement learning, and includes scripts to run training and evaluation for few-shot classification. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    ML.NET Samples

    ML.NET Samples

    Samples for ML.NET, an open source and cross-platform machine learning

    ML.NET is a cross-platform open-source machine learning framework that makes machine learning accessible to .NET developers. In this GitHub repo, we provide samples that will help you get started with ML.NET and how to infuse ML into existing and new .NET apps. We're working on simplifying ML.NET usage with additional technologies that automate the creation of the model for you so you don't need to write the code by yourself to train a model, you simply need to provide your datasets. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Turn traffic into pipeline and prospects into customers Icon
    Turn traffic into pipeline and prospects into customers

    For account executives and sales engineers looking for a solution to manage their insights and sales data

    Docket is an AI-powered sales enablement platform designed to unify go-to-market (GTM) data through its proprietary Sales Knowledge Lake™ and activate it with intelligent AI agents. The platform helps marketing teams increase pipeline generation by 15% by engaging website visitors in human-like conversations and qualifying leads. For sales teams, Docket improves seller efficiency by 33% by providing instant product knowledge, retrieving collateral, and creating personalized documents. Built for GTM teams, Docket integrates with over 100 tools across the revenue tech stack and offers enterprise-grade security with SOC 2 Type II, GDPR, and ISO 27001 compliance. Customers report improved win rates, shorter sales cycles, and dramatically reduced response times. Docket’s scalable, accurate, and fast AI agents deliver reliable answers with confidence scores, empowering teams to close deals faster.
    Learn More
  • 5
    DeepCluster

    DeepCluster

    Deep Clustering for Unsupervised Learning of Visual Features

    DeepCluster is a classic self-supervised clustering-based representation learning algorithm that iteratively groups image features and uses the cluster assignments as pseudo-labels to train the network. In each round, features produced by the network are clustered (e.g. k-means), and the cluster IDs become supervision targets in the next epoch, encouraging the model to refine its representation to better separate semantic groups.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Smart Algorithm

    Smart Algorithm

    Repository implementing a variety of intelligent algorithms

    Smart-Algorithm is a repository implementing a variety of intelligent / metaheuristic optimization algorithms (e.g. Genetic Algorithm, Ant Colony, Particle Swarm, Immune Algorithm). The implementations are provided in multiple languages (Java, Python, MATLAB). The repository’s aim is to offer reference implementations of “smart” algorithms for tasks like route planning, optimization, or algorithm learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Consistent Depth

    Consistent Depth

    We estimate dense, flicker-free, geometrically consistent depth

    Consistent Depth is a research project developed by Facebook Research that presents an algorithm for reconstructing dense and geometrically consistent depth information for all pixels in a monocular video. The system builds upon traditional structure-from-motion (SfM) techniques to provide geometric constraints while integrating a convolutional neural network trained for single-image depth estimation. During inference, the model fine-tunes itself to align with the geometric constraints of a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    SAR Synthetic Aperture Radar

    SAR Synthetic Aperture Radar

    Collection of MATLAB codes, simulations, and summaries

    This repository is a collection of MATLAB codes, simulations, and summaries related to Synthetic Aperture Radar (SAR), InSAR, and PolSAR. The author compiled implementations of classic SAR imaging algorithms (e.g. Range-Doppler (RD) algorithm, Chirp Scaling (CS) algorithm), synthetic scenes, InSAR (interferometric) simulation including multiple terrain types (flat, conical), and polarization calibration techniques (Whitt, PARC, Quegan, Ainsworth). The readme states that these were research/learning codes, with experiments and reports, and no further maintenance is expected. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    CNN Explainer

    CNN Explainer

    Learning Convolutional Neural Networks with Interactive Visualization

    In machine learning, a classifier assigns a class label to a data point. For example, an image classifier produces a class label (e.g, bird, plane) for what objects exist within an image. A convolutional neural network, or CNN for short, is a type of classifier, which excels at solving this problem! A CNN is a neural network: an algorithm used to recognize patterns in data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Pest Control Management Software Icon
    Pest Control Management Software

    Pocomos is a cloud-based field service solution that caters to businesses

    Built for the pest control industry, but also works great for Mosquito Control, Bin Cleaning, Window Washing, Solar Panel Cleaning, and other Home Service Businesses in need of an easy-to-use software that helps you simplify routing, scheduling, communications, payment processing, truck tracking, time tracking, and reporting.
    Learn More
  • 10
    Computer Vision Pretrained Models

    Computer Vision Pretrained Models

    A collection of computer vision pre-trained models

    ...Instead of building a model from scratch to solve a similar problem, we can use the model trained on other problem as a starting point. A pre-trained model may not be 100% accurate in your application. For example, if you want to build a self-learning car. You can spend years building a decent image recognition algorithm from scratch or you can take the inception model (a pre-trained model) from Google which was built on ImageNet data to identify images in those pictures. The model generates bounding boxes and segmentation masks for each instance of an object in the image. It's based on Feature Pyramid Network (FPN) and a ResNet101 backbone. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Forecasting Best Practices

    Forecasting Best Practices

    Time Series Forecasting Best Practices & Examples

    Time series forecasting is one of the most important topics in data science. Almost every business needs to predict the future in order to make better decisions and allocate resources more effectively. This repository provides examples and best practice guidelines for building forecasting solutions. The goal of this repository is to build a comprehensive set of tools and examples that leverage recent advances in forecasting algorithms to build solutions and operationalize them. Rather than...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    java-string-similarity

    java-string-similarity

    Implementation of various string similarity and distance algorithms

    Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity. A library implementing different string similarity and distance measures. A dozen of algorithms (including Levenshtein edit distance and sibblings, Jaro-Winkler, Longest Common Subsequence, cosine similarity etc.) are currently implemented. The main characteristics of each implemented algorithm are presented below. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    MADDPG

    MADDPG

    Code for the MADDPG algorithm from a paper

    MADDPG (Multi-Agent Deep Deterministic Policy Gradient) is the official code release from OpenAI’s paper Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments. The repository implements a multi-agent reinforcement learning algorithm that extends DDPG to scenarios where multiple agents interact in shared environments. Each agent has its own policy, but training uses centralized critics conditioned on the observations and actions of all agents, enabling learning in cooperative, competitive, and mixed settings. The code is built on top of TensorFlow and integrates with the Multiagent Particle Environments (MPE) for benchmarking. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    ViralPlaque

    ViralPlaque

    A fast, open-source and versatile ImageJ macro for the automated dete

    ...We present ViralPlaque, a fast, open-source and versatile ImageJ macro for the automated determination of viral plaque dimensions from digital images. Also, a machine learning plugin is integrated in the analysis algorithm for adaptation of ViralPlaque to the user’s needs and experimental conditions. A high correlation between manual and automated measurements of plaque dimensions was demonstrated. This macro will facilitate reliable and reproducible characterization of cytolytic viruses with an increased processing speed. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    karatasi - flip cards on iPhone
    Flip card learning program for iPhone with a spaced learning algorithm. Create your own databases and edit the cards directly on the iPhone. Import Palm databases or csv-formatted files and backup your data with our Java application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Python Machine Learning

    Python Machine Learning

    The "Python Machine Learning (2nd edition)" book code repository

    This repository accompanies the well-known textbook “Python Machine Learning, 2nd Edition” by Sebastian Raschka and Vahid Mirjalili, serving as a complete codebase of examples, notebooks, scripts and supporting materials for the book. It covers a wide range of topics including supervised learning, unsupervised learning, dimensionality reduction, model evaluation, deep learning with TensorFlow, and embedding models into web apps. Each chapter has Jupyter notebooks and Python scripts that...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    jieba

    jieba

    Stuttering Chinese word segmentation

    "Jaba" Chinese word segmentation, do the best Python Chinese word segmentation component. Four word segmentation modes are supported. Precise mode, which tries to cut the sentence most precisely, suitable for text analysis. Full mode, scans all the words that can be formed into words in the sentence, the speed is very fast, but the ambiguity cannot be resolved. The search engine mode, on the basis of the precise mode, divides the long words again to improve the recall rate, which is suitable...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    RecNN

    RecNN

    Reinforced Recommendation toolkit built around pytorch 1.7

    This is my school project. It focuses on Reinforcement Learning for personalized news recommendation. The main distinction is that it tries to solve online off-policy learning with dynamically generated item embeddings. I want to create a library with SOTA algorithms for reinforcement learning recommendation, providing the level of abstraction you like.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    CCZero (中国象棋Zero)

    CCZero (中国象棋Zero)

    Implement AlphaZero/AlphaGo Zero methods on Chinese chess

    ChineseChess-AlphaZero is a project that implements the AlphaZero algorithm for the game of Chinese Chess (Xiangqi). It adapts DeepMind’s AlphaZero method—combining neural networks and Monte Carlo Tree Search (MCTS)—to learn and play Chinese Chess without prior human data. The system includes self-play, training, and evaluation pipelines tailored to Xiangqi's unique game mechanics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    benchm-ml

    benchm-ml

    A benchmark of commonly used open source implementations

    ...The benchmarks cover algorithms like logistic regression, random forest, gradient boosting, and deep neural networks, and they compare across toolkits such as scikit-learn, R packages, xgboost, H2O, Spark MLlib, etc. The repository is structured in logical folders, each corresponding to algorithm categories.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Coach

    Coach

    Enables easy experimentation with state of the art algorithms

    Coach is a python framework that models the interaction between an agent and an environment in a modular way. With Coach, it is possible to model an agent by combining various building blocks, and training the agent on multiple environments. The available environments allow testing the agent in different fields such as robotics, autonomous driving, games and more. It exposes a set of easy-to-use APIs for experimenting with new RL algorithms and allows simple integration of new environments...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Active Learning

    Active Learning

    Framework and examples for active learning with machine learning model

    ...The main experiment runner (run_experiment.py) supports a wide range of configurations, including batch sizes, dataset subsets, model selection, and data preprocessing options. It includes several established active learning strategies such as uncertainty sampling, k-center greedy selection, and bandit-based methods, while also allowing for custom algorithm implementations. The framework integrates with both classical machine learning models (SVM, logistic regression) and neural networks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Lori's Help

    Lori's Help

    An Android app to help people with Down Syndrome in their literacy

    Lori Help is an Android application that provides support for the literacy of people with Down syndrome. The application has 4 activities to aid in learning, 3 of them with emphasis on the literacy process and 1 focused on sensory stimuli. Application activities are monitored by a biofeedback algorithm (known as Attention Meter). The algorithm observes the variations of the user's micro facial expressions with the intention of measuring the level of attention during the accomplishment of the activities. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    easy12306

    easy12306

    Automatic recognition of 12306 verification code

    Automatic recognition of 12306 verification code using machine learning algorithm. Identify never-before-seen pictures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    Objective Function Analysis

    An alternative to neural nets for machine learning.

    Objective Function Analysis models knowledge as a multi-dimensional probability density function (MD-PDF) of the perceptions and responses (which are themselves perceptions) of an entity and an objective function (OF). The learning algorithm is the action of choosing a response, given the perceptions, which maximizes the objective function. The MD-PDF is initially seeded by a uniform random number generator. The response is used to evaluate the OF and the OF is either reinforced or diminished in the probability subspace formed by the perceptions and responses. ...
    Downloads: 0 This Week
    Last Update:
    See Project