Job Title | Budget | ||||
---|---|---|---|---|---|
Computer Vision Engineer for Multi-Camera Calibration and 3D Vision Pipeline
|
500 USD | 2 hours ago |
Client Rank
- Good
$1'386 total spent
41 hires
, 1 active
63 jobs posted
65% hire rate,
3 open job
4.72
of 7 reviews
Registered at: 07/11/2021
United States
|
||
Required Connects: 14
Project Overview
I need a highly experienced Computer Vision/Imaging Engineer to develop an end-to-end multi-camera calibration pipeline. You will guide me through at-home camera calibration (using a GoPro Hero 10 and standard checkerboard techniques), then proceed to extrinsic calibration with real-world references. The final goal is to achieve consistent and accurate 3D reconstruction from multiple camera feeds, under real-world conditions. What You’ll Do ● Advise and assist on intrinsic calibration of my GoPro Hero 10 (I have the hardware; you’ll provide remote guidance). ● Develop and validate a robust extrinsic calibration process using known real-world distances or markers. ● Implement 2D-3D correspondences and pose estimation (PnP, bundle adjustment, etc.) to unify multiple camera views into a single coordinate system. ● Ensure outlier rejection and temporal stability across frames. Required Skills & Experience 1. Deep knowledge of OpenCV (calibration, feature detection, pose estimation). 2. Proven track record with multi-camera calibration in real-world scenarios. 3. Strong background in 3D geometry, transformations, and coordinate system alignment. 4. Coding proficiency (Python or C++), with an emphasis on optimized, real-time or near-real-time solutions. 5. Familiarity with bundle adjustment frameworks (e.g., Ceres Solver) is a significant plus. 6. Excellent communication skills to provide guidance and troubleshoot remotely. If you have built or deployed such calibration/vision pipelines and can demonstrate real-world success, please apply with examples of your work or a portfolio. We’ll discuss next steps upon review.
Skills: Computer Vision, Python, Deep Learning, OpenCV, Machine Learning, Neural Network, Artificial Intelligence
Fixed budget:
500 USD
2 hours ago
|
|||||
Data labeling for computer vision project
|
16 - 20 USD
/ hr
|
6 hours ago |
Client Rank
- Excellent
$248'291 total spent
21 hires
, 8 active
26 jobs posted
81% hire rate,
1 open job
18.59 /hr avg hourly rate paid
12836 hours
4.95
of 17 reviews
Registered at: 23/01/2017
United States
|
||
Required Connects: 17
I am looking for experts in American Football to help with a computer vision project. I have already collected all the images and you will be using a custom tool to review and annotate. You will just need to label or verify existing labels.
Here are the details: * You need to have advanced knowledge of American football - preferably playing at a competitive level * You will be using a custom tool to label * You will be provided labeling instructions * You choose your hours * You must have a PC or Mac * You need to be responsive and deliver high quality work Candidates will perform a paid trial session and then be hired for the project.
Skills: Data Entry, Computer Vision, Data Annotation
Hourly rate:
16 - 20 USD
6 hours ago
|
|||||
OCR Document Scanning and Data Extraction Specialist
|
not specified | 6 hours ago |
Client Rank
- Medium
1 jobs posted
1 open job
Registered at: 21/12/2024
United Arab Emirates
|
||
Required Connects: 11
We are seeking a skilled professional to develop an OCR solution that can accurately scan images and PDFs, extracting data into JSON format in both English and Arabic. The ideal candidate will have experience with optical character recognition technology, specifically focusing on multilingual text processing. This project requires attention to detail, as accuracy is paramount. If you have a passion for data extraction and the technical know-how to deliver high-quality results, we'd love to hear from you!
Skills: Python, Python Script, FastAPI, OCR Software, OCR Algorithm, Tesseract OCR, Computer Vision, Image Annotation, Roboflow, CVAT, GPT-4, YOLO, Machine Learning, Database, OpenCV
Budget:
not specified
6 hours ago
|
|||||
Enhance MasterThesis Model for Helmet Detection in Underground Mining(YOLOv8+Advanced Architectures)
|
50 USD | 8 hours ago |
Client Rank
- Medium
1 open job
China
|
||
Required Connects: 7
I am looking for an experienced computer vision and deep learning expert to assist in enhancing my thesis research on underground coal mine helmet detection. My thesis involves detecting whether workers(people) are wearing helmets or not in underground environments.
Here’s a summary of the current work: I have utilized YOLOv8 and its attention mechanisms (Efficient Channel Attention, Shuffle Attention, and Resblock CBAM) for detecting three classes: people, helmet, and no_helmet. The dataset includes 6152 manually labeled images for the above three classes, created with Roboflow. What I Want: I aim to improve my thesis by exploring additional models and approaches to make the research more innovative and effective. Scope of Work: Experiment with other deep learning architectures like Faster R-CNN, SSD, detectron2 and EfficientDet for performance comparison with YOLOv8. Explore hybrid models such as YOLOv8 combined with Vision Transformers or other attention mechanisms. Optimize model performance for improved accuracy in low-light and occluded scenarios typical of underground environments. (Optional) Introduce multi-task learning approaches for simultaneous detection of helmets in underground coalmine Required Skills: Proficiency in Deep Learning and Object Detection. Experience with YOLOv8, Attention Mechanisms, and other object detection frameworks (e.g., Faster R-CNN, SSD,Detectron2, EfficientDet). Familiarity with tools like Roboflow, PyTorch, or TensorFlow. Deliverables: Implementation of additional models for comparison. Detailed Thesis writing after implimentaction And comparing the performance of all models results ,graphs,evaluation metrics like percesion,revall,F1curve etc Budget: Open for discussion to increase budget based on expertise
Skills: Deep Learning, Computer Vision, Artificial Intelligence, Machine Learning, Deep Neural Network, Image Processing, Convolutional Neural Network
Fixed budget:
50 USD
8 hours ago
|
|||||
Multimodal AI Developer Needed for Medical Device Project
|
4,000 USD | 8 hours ago |
Client Rank
- Excellent
$26'738 total spent
22 hires
, 10 active
19 jobs posted
100% hire rate,
2 open job
37.06 /hr avg hourly rate paid
407 hours
4.77
of 13 reviews
Registered at: 19/05/2020
United States
|
||
Required Connects: 21
We are seeking an experienced Multimodal AI Developer to create a robust speech, image, and video processing system for a cutting-edge medical device. The ideal candidate will have a strong background in AI/ML, specifically in processing and integrating multimodal data with a strong preference for those with experience in medical devices
The system will transcribe speech, analyze visual content, and provide intelligent responses using local processing. Required System Capabilities: Audio Processing Real-time speech diarization (speaker identification/separation) High-accuracy speech-to-text conversion Handle multiple audio formats and sources Process conversation overlaps and multiple speakers Visual Processing Extract and analyze content from images and video Process multiple video formats Handle frames and temporal information OCR capabilities for text in images/video AI Integration Local deployment of multimodal models (like Llama 3.2) Process and analyze combined audio/visual data Generate contextual responses based on multimodal input Handle batch and real-time processing Technical Requirements: Experience with speech processing libraries (Whisper, Pyannote) Knowledge of video/image processing frameworks Expertise in local LLM deployment and optimization Background in multimodal AI systems Experience with GPU acceleration and optimization Deliverables: Complete processing pipeline Local model deployment configuration API for system interaction Processing dashboard Documentation and maintenance guide Required Skills: Advanced Python development Audio/video processing expertise Local LLM deployment experience GPU optimization knowledge Strong system architecture background Budget: $4,000-6,000 Timeline: 4-6 weeks Please Share: Similar multimodal systems you've built Experience with speech diarization Examples of local LLM deployments Your approach to system optimization Looking for candidates who can demonstrate experience with complex multimodal AI systems and local model deployment.
Skills: Artificial Intelligence, Machine Learning, Computer Vision, TensorFlow, Healthcare Software
Fixed budget:
4,000 USD
8 hours ago
|
|||||
Nature Image Classification Model Creation
|
~7 - 21 USD | 13 hours ago |
Client Rank
- Medium
6 open job
Registered at: 20/10/2024
Bangladesh
|
||
I'm looking for a skilled data scientist or machine learning engineer who can develop a precise image classification model for nature images. The model will specifically need to classify images into three categories: animals, plants, and landscapes.
Key Requirements: - The model should deliver high accuracy, ideally within the 95-100% range. - Proficiency in deep learning and computer vision is a must. - Prior experience in developing image classification models is highly desirable. Your expertise will play a crucial role in achieving the desired accuracy and performance of the model. Please provide examples of similar projects you've worked on in your proposal. Skills: Java, Python, Machine Learning (ML), Image Recognition, AI-Enhanced Classification
Fixed budget:
10 - 30 CAD
13 hours ago
|
|||||
Detect and handle schematics in PDFs
|
20 - 44 USD
/ hr
|
14 hours ago |
Client Rank
- Excellent
$121'917 total spent
177 hires
, 34 active
221 jobs posted
80% hire rate,
8 open job
19.78 /hr avg hourly rate paid
4365 hours
4.86
of 154 reviews
Registered at: 22/06/2018
United States
|
||
Featured
Required Connects: 14
Schematic Callout Extractor Project
Background Engineering schematics and technical drawings often contain callouts - letters or numbers that reference specific components. Currently, engineers manually identify and catalog these references when building documentation or maintenance systems. An automated solution would significantly reduce the time spent on this task and improve accuracy. Brief Summary A computer vision system that processes technical drawings to automatically detect, extract, and catalog callout references. The system identifies text labels, their locations, and associates them with their descriptions from the drawing's legend. Detailed Requirements 1. Input Processing Accept common image formats (PNG, JPG, PDF) Support both scanned and digital schematics Handle varying image qualities and resolutions Process single or multiple page documents 2. Callout Detection Identify single-letter or number callouts (A, B, C, 1, 2, 3, etc.) Detect callout bounding boxes Extract callout text content Record coordinates for later highlighting Handle multiple instances of the same callout 3. Legend Processing Identify the legend section of the drawing Extract callout-description pairs Match detected callouts with their legend descriptions Handle multi-line descriptions 4. Data Output Generate structured JSON output containing: Callout letter/number Coordinates (x, y, width, height) Associated description Reference count (if multiple instances) Support export to common database formats 5. Visualization Highlight detected callouts on the original image Generate preview with bounding boxes Create interactive web view showing callouts Support zooming and panning 6. Integration Features REST API for processing requests Batch processing capability Error handling and validation Processing status updates Configurable confidence thresholds
Skills: Python, Computer Vision
Hourly rate:
20 - 44 USD
14 hours ago
|
|||||
Consultation for Automating Image Segmentation Process
|
20 - 40 USD
/ hr
|
14 hours ago |
Client Rank
- Risky
1 open job
Registered at: 23/02/2021
United States
|
||
Required Connects: 8
I am seeking an expert consultant to guide me through possible automation processes for image segmentation, organization, and more. This includes understanding current methodologies, tools, and best practices in the field. The goal is to enhance efficiency and accuracy in processing images for my project. Ideal candidates will have practical experience in automation, vision models, and image processing. Please provide insights on potential frameworks and technologies that could be leveraged for this task.
Skills: Python, Automation, Computer Vision, C++, TensorFlow
Hourly rate:
20 - 40 USD
14 hours ago
|
|||||
Image recognition of currency serial number, date, and denomination
|
75 USD | 14 hours ago |
Client Rank
- Excellent
$6'874 total spent
48 hires
, 5 active
55 jobs posted
87% hire rate,
1 open job
9.06 /hr avg hourly rate paid
355 hours
4.95
of 44 reviews
Registered at: 12/03/2018
United States
|
||
Required Connects: 14
I need a program that can 1) recognize if an image is the front of back of a U.S bill and 2) recognize currency denominations, serial numbers, and dates. I have a folder of several hundred images that you can test on.
Skills: Computer Vision, Python
Fixed budget:
75 USD
14 hours ago
|
|||||
Data Scientist Needed: LLM, NLP, and Computer Vision Expertise
|
10 - 15 USD
/ hr
|
15 hours ago |
Client Rank
- Risky
2 jobs posted
1 open job
Registered at: 20/08/2024
India
|
||
Required Connects: 9
We are seeking a skilled Data Scientist with expertise in Large Language Models (LLM), Natural Language Processing (NLP), and Computer Vision (CV). Your role will involve developing and implementing innovative algorithms, analyzing complex datasets, and providing actionable insights. The ideal candidate will have experience in building predictive models and working with cutting-edge technologies. If you're passionate about leveraging data to drive business decisions, we want to hear from you!
Skills: Machine Learning, Python, TensorFlow, Deep Learning, Artificial Intelligence
Hourly rate:
10 - 15 USD
15 hours ago
|
|||||
Autonomous AI Agent for Rocket League
|
3,000 - 5,000 USD | 18 hours ago |
Client Rank
- Risky
1 open job
Registered at: 06/08/2023
Switzerland
|
||
We are seeking an expert in artificial intelligence, reinforcement learning (RL), and machine learning to develop an autonomous agent for the game Rocket League. This agent should leverage the Proximal Policy Optimization (PPO) algorithm and be capable of learning to compete with the best players in the world by emulating their playstyle.
The ultimate goal is to create a high-performing and realistic agent capable of playing at a professional level while adopting strategies and behaviors similar to elite players. Technical Project Details: Platform and Environment: Use the Rocket League environment for simulations (via frameworks like RLBot or similar APIs that allow game manipulation). Integration with a simulation engine to train the agent effectively. Algorithms and Techniques: Implementation of PPO as the core reinforcement learning algorithm. Use of advanced imitation learning and behavior cloning techniques to replicate the movements and decisions of top players. Integration of a hybrid approach combining imitation learning (from professional replays) with self-play to optimize performance. Training Data: Extraction and preprocessing of data from replays (Rocket League replay files) to analyze the playstyles of professional players. Construction of a dataset including actions, trajectories, and decisions based on these replays. Data preprocessing and augmentation to enhance training robustness. Expected Features: A model capable of playing 1v1, 2v2, and 3v3 modes. A fluid and natural playstyle that imitates the strategies of top players. Real-time adaptability to different opponent strategies. Performance comparable to professional players in test matches. Technologies to Be Used: RL/ML libraries: TensorFlow, PyTorch, or equivalent frameworks. RL algorithms: PPO, imitation learning, and fine-tuning methods based on in-game performance evaluation. Tools for Rocket League: RLBot, GameState integration, or similar APIs. Data processing: Python (NumPy, Pandas), and computer vision tools if necessary for replay analysis. Expected Outcomes: A fully trained and functional model ready to be tested in the Rocket League environment. Clear documentation of the code and training process. A guide explaining how to continue training or fine-tune the agent if needed. Deliverables: Fully documented source code. Trained model. Scripts for data preprocessing and model training. Videos or demonstrations showcasing the agent’s performance in test matches. Required Qualifications: Proven experience in developing RL agents and implementing algorithms such as PPO. Familiarity with tools specific to Rocket League (e.g., RLBot, replay file manipulation). Expertise in data extraction and analysis from game files. Deep understanding of imitation learning and reinforcement learning approaches. Strong programming skills (Python, ML/RL frameworks). Skills: Python, Algorithm, Machine Learning (ML), Pytorch, Reinforcement Learning
Fixed budget:
3,000 - 5,000 USD
18 hours ago
|
|||||
Face detection in live streams
|
200 USD | 19 hours ago |
Client Rank
- Excellent
$50'974 total spent
25 hires
, 4 active
38 jobs posted
66% hire rate,
2 open job
14.26 /hr avg hourly rate paid
128 hours
4.99
of 17 reviews
Registered at: 01/09/2013
Romania
|
||
Required Connects: 14
The goal of this project is to save screenshots from online live streams of TV stations, and detect faces in the images.
I can offer a detailed description. Please add 'harry potter' to your application so I make sure you have read it.
Skills: Machine Learning, Python, Web Development, Computer Vision
Fixed budget:
200 USD
19 hours ago
|
|||||
Computer vision tutor
|
100 USD | 21 hours ago |
Client Rank
- Medium
$290 total spent
5 hires
7 jobs posted
71% hire rate,
2 open job
32.72 /hr avg hourly rate paid
4 hours
5.00
of 4 reviews
Registered at: 02/12/2021
Saudi Arabia
|
||
Required Connects: 13
I'm looking for a tutor for computer vision , it'll be on demand topics , Will ask you 1st what's on my mind to know & yu can suggest me the right approach, mainly would learn via Pytorch tool.
It'll be learning by doing, i'm a beginner but with experience into AI business. Familiarization of real life scenarios & cloud modules is a bonus.
Skills: Computer Vision, TensorFlow, bytorch, Machine Learning, Artificial Intelligence, Deep Learning, Python
Fixed budget:
100 USD
21 hours ago
|
|||||
Personal Computer Engineer and Programming Tutor
|
35 - 60 USD
/ hr
|
1 day ago |
Client Rank
- Medium
2 open job
Registered at: 30/04/2022
United States
|
||
Required Connects: 11
Only freelancers located in the U.S. may apply.
I am looking for a skilled and patient Computer Engineer and Programming Tutor to provide hands-on guidance in running Python code and developing software programs step-by-step. This role requires expertise in both macOS and Windows operating systems, along with the ability to explain technical concepts clearly and practically.
The ideal candidate will assist with installing and configuring Python environments, including setting up IDEs such as PyCharm, VS Code, and Jupyter Notebook. They will teach programming concepts like variables, loops, functions, classes, and object-oriented programming (OOP) while demonstrating how to write, run, and debug Python code. Lessons should also cover virtual environments, package managers like pip and conda, and version control systems such as Git and GitHub for code management. Candidates should be prepared to troubleshoot compatibility issues, optimize system performance, and recommend productivity tools and debugging techniques. Experience with machine learning and computer vision models is a plus and may lead to opportunities for additional projects related to AI-driven applications. This role is ideal for someone passionate about programming, AI, and mentoring others to build and deploy software efficiently.
Skills: Python, Java, Node.js, JavaScript, macOS, Machine Learning
Hourly rate:
35 - 60 USD
1 day ago
|
|||||
AI, Automation and Software Development Fundamentals Tutor
|
50 - 75 USD
/ hr
|
1 day ago |
Client Rank
- Medium
2 open job
Registered at: 30/04/2022
United States
|
||
Required Connects: 11
Only freelancers located in the U.S. may apply.
I am seeking a knowledgeable and patient AI, Automation, and Software Development Fundamentals Tutor to teach the core concepts and tools needed to build software that leverages AI, automation, computer vision, deep learning models, and AI agents. This role focuses on providing clear, step-by-step instruction to help develop a strong foundation for creating AI-driven applications.
The ideal candidate should have experience with machine learning, NLP, computer vision, deep learning frameworks, AI agents, and automation tools. They should also have a strong background in evaluating and selecting tools for document processing, language understanding, data analytics, data extraction, and system integrations. Lessons should cover topics such as data preprocessing, model training and evaluation, hyperparameter tuning, and AI deployment, using tools like TensorFlow, PyTorch, Scikit-learn, OpenCV, Hugging Face Transformers, and LangChain. Instruction will also include automation workflows with tools like Zapier, UiPath, and Selenium, as well as integrating AI models into scalable systems. Applicants must have hands-on experience building software applications and integrating AI technologies into real-world systems. They should be able to simplify complex topics, provide practical examples, and demonstrate how to create AI agents for automation and decision-making, along with strategies for extracting and analyzing data from documents and integrating AI solutions into broader workflows. This role is ideal for someone passionate about helping others build AI-powered software while leveraging advanced tools for language understanding, data processing, and automation.
Skills: Machine Learning, TensorFlow, Artificial Neural Network, TensorFlow Stack
Hourly rate:
50 - 75 USD
1 day ago
|
|||||
Golf Club Tracking Using Oriented Bounding Boxes
|
40 - 50 USD
/ hr
|
1 day ago |
Client Rank
- Good
$1'750 total spent
2 hires
, 2 active
3 jobs posted
67% hire rate,
3 open job
Registered at: 29/08/2024
Hong Kong
|
||
Required Connects: 20
**Job Description: Golf Club Tracking Specialist**
We are excited to announce an opening for a skilled professional to join our team in the innovative domain of computer vision. The primary responsibility for this role will be to implement advanced tracking techniques specifically tailored for a golf club using oriented bounding boxes, calculating angles from it and then developing an api for a mobile application. This project aims to enhance our understanding of the dynamics involved in golf swings and club movements, contributing to developments in sports technology and analytics. The dataset for this project has been curated and is readily accessible on Roboflow. Your task will involve a comprehensive analysis of this dataset to effectively track and analyze the movements of the golf club throughout various stages of a swing. This will require a keen eye for detail and a strong understanding of the subtleties involved in golf mechanics. The ideal candidate for this position should possess a solid background in computer vision, with a particular emphasis on object detection and tracking techniques. Familiarity with machine learning algorithms and deep learning frameworks will be crucial in order to develop and optimize tracking models. Additionally, experience in working with oriented bounding boxes and related methodologies will be highly advantageous. In this role, you will collaborate closely with a multidisciplinary team, including data scientists, software engineers, and sports analysts. This collaborative environment will provide you with the opportunity to contribute your expertise while also gaining insights from other professionals in the field. You will be responsible for not only implementing the tracking system but also for testing and validating its performance to ensure accuracy and reliability. Moreover, we are looking for someone who is proactive and can effectively communicate complex technical concepts to team members and stakeholders who may not have a technical background. Strong problem-solving skills and the ability to think critically will be essential as you navigate challenges during the implementation process. As part of your application, please provide examples of previous work related to similar projects. We are particularly interested in seeing any relevant portfolios, case studies, or research work that showcases your experience in computer vision, object tracking, or related fields. If you are passionate about leveraging technology to enhance sports performance and have the skills and experience we are looking for, we would love to hear from you. Join us in pushing the boundaries of what's possible in sports technology and be part of a project that aims to redefine the way we analyze and understand golf dynamics.
Skills: AI App Development, Machine Learning Model, Machine Vision
Hourly rate:
40 - 50 USD
1 day ago
|
|||||
Computer Vision
|
not specified | 1 day ago |
Client Rank
- Risky
2 open job
Registered at: 02/09/2024
Benin
|
||
Required Connects: 9
Hi,
I need to scrap delta.com in following way : - A file contains list of keywords ( take example as leather belt ) - This can be in table 1 - When you enter leather belt in search box of amazon.com , you get list of products. - Scrap atleast 4-5 pages ~ 60-70 products with following attributes : Title, Image Link, Number of Reviews, Rating, Product Link, Price - Put this in a seperate table with keyword name as primary key - Update table 1 with status as Yes/No and date on which scraping was done. - The scraper should be able to scrap atleast 100 keywords per day.
Skills: Python, Machine Learning, Machine Learning Algorithm, Machine Learning Model, Computer Vision, Deep Learning, TensorFlow, Deep Learning Modeling, OpenCV, Machine Learning Framework, Machine Vision, Deep Learning Framework, PyTorch, Python Scikit-Learn, Natural Language Processing
Budget:
not specified
1 day ago
|
|||||
Academic Manuscript Editing and Publication Support
|
250 USD | 1 day ago |
Client Rank
- Risky
2 jobs posted
1 open job
Registered at: 19/12/2024
United States
|
||
Required Connects: 10
I am seeking an experienced professional to assist in editing and reviewing my manuscript for publication. The ideal candidate should possess strong academic writing skills and a solid understanding of computer vision concepts. Your role will involve refining the text, ensuring clarity and coherence, and providing guidance on the publication process. If you are passionate about helping researchers communicate their findings effectively, I would love to hear from you!
Skills: Academic Writing, Proofreading, Machine Learning
Fixed budget:
250 USD
1 day ago
|
|||||
Computer Vision Expert
|
5 USD | 1 day ago |
Client Rank
- Risky
2 open job
Registered at: 02/09/2024
Benin
|
||
Required Connects: 10
Hi,
I need to scrap delta.com in following way : - A file contains list of keywords ( take example as leather belt ) - This can be in table 1 - When you enter leather belt in search box of amazon.com , you get list of products. - Scrap atleast 4-5 pages ~ 60-70 products with following attributes : Title, Image Link, Number of Reviews, Rating, Product Link, Price - Put this in a seperate table with keyword name as primary key - Update table 1 with status as Yes/No and date on which scraping was done. - The scraper should be able to scrap atleast 100 keywords per day.
Skills: Computer Vision, Machine Learning, Python, OpenCV
Fixed budget:
5 USD
1 day ago
|
|||||
AI Consultant for Multi-Industry Solutions
|
15 - 25 USD
/ hr
|
1 day ago |
Client Rank
- Good
$9'786 total spent
14 hires
, 4 active
1 open job
5.00
of 3 reviews
Registered at: 08/03/2019
United States
|
||
I am in need of an experienced AI consultant with expertise across various domains and AI subfields. The ideal candidate should be able to assist in improving system performance, developing new AI solutions, and strategizing AI implementation.
Key Requirements: - Proficiency in Machine Learning, Natural Language Processing, and Computer Vision - Experience in Healthcare, Finance, and Retail industries - Ability to enhance existing systems' performance - Skill in creating innovative AI solutions - Expertise in planning comprehensive AI strategies This project is multi-faceted and requires a versatile AI professional. Please provide examples of your relevant experience in your proposal. Skills: Artificial Intelligence, AI (Artificial Intelligence) HW/SW
Hourly rate:
15 - 25 USD
1 day ago
|
|||||
AI Engineer for Image-to-Video Conversion in Used Clothing Marketplace
|
10 - 30 USD
/ hr
|
1 day ago |
Client Rank
- Medium
$206 total spent
1 hires
1 jobs posted
100% hire rate,
1 open job
5.00
of 1 reviews
Registered at: 03/11/2024
United States
|
||
Required Connects: 18
Project Title: AI Engineer for Image-to-Video Conversion in Used Clothing Marketplace App
Project Description: We're developing a groundbreaking marketplace app for used clothing that transforms static images into dynamic "Get Ready With Me" (GRWM) style video reels. Imagine a user uploads photos of a shirt, pants, and shoes – our app will automatically generate a short video showcasing this outfit, similar to popular trends on TikTok and Instagram Reels. Project Scope: We're seeking a skilled AI engineer to spearhead the technical development of our image-to-video conversion feature. This involves: Evaluating and selecting the most suitable AI model or approach for this task (e.g., GANs, RNNs, or pre-trained models with fine-tuning). Exploring and integrating open-source image-to-video AI tools and libraries (e.g., Viggle.ai, Comfy.org) or potentially developing custom solutions. Optimizing the chosen model for fast processing times, high-quality video output, and efficient handling of user-uploaded images. Developing a proof of concept using our existing dataset of labeled clothing images (500 each for both genders, categorized by tops, bottoms, shoes, accessories, etc.). Technical Requirements: Strong expertise in machine learning, deep learning, and computer vision. Experience with image-to-video AI models and techniques. Proficiency in Python and relevant AI/ML libraries (e.g., TensorFlow, PyTorch). Familiarity with open-source image-to-video AI tools (e.g., Viggle.ai, Comfy.org). Ability to optimize models for performance and efficiency. Deliverables: A functional prototype demonstrating the image-to-video conversion process. Documentation outlining the chosen AI model, implementation details, and potential for future development. Recommendations for scaling the solution and integrating it into our app. Ideal Candidate: You're passionate about AI and its potential to revolutionize e-commerce. You have a proven track record of successfully implementing AI solutions. You're a strong communicator and collaborator who can work effectively with our team. To Apply: Please submit your proposal outlining your relevant experience, proposed approach, and estimated timeline for completing the project. Include examples of previous AI projects, particularly those involving image or video processing. Crucially, please also provide: A fixed budget quote for the entire project. A clear timeline outlining the estimated duration of each phase. Note: This is an initial proof-of-concept project, with the potential for long-term collaboration as we develop and scale our app.
Skills: AI Video Generation, Python
Hourly rate:
10 - 30 USD
1 day ago
|
|||||
AI Video Software Developer Needed
|
1,200 USD | 1 day ago |
Client Rank
- Good
$2'392 total spent
30 hires
, 7 active
62 jobs posted
48% hire rate,
2 open job
4.59
of 9 reviews
Registered at: 23/11/2020
United Kingdom
|
||
Required Connects: 20
We are seeking an experienced developer to a create a piece of cutting-edge AI video software. The ideal candidate will have a strong background in machine learning, computer vision, and video processing.
We make videos which run for 180 minutes. The videos are made up of around 120 different video clips, (each one around 90 seconds) all are edited into 1 long video We want to insert a 5 second logo clip into the videos every 15 minutes. It will be the same video logo which will appear every 15 minutes to personalise the videos. We intend to upload 50,000 logos and we want AI to be able to insert and delete the new logo. So we upload the 180 minute video and AI will insert the video logo (5 seconds) every 15 minutes. The video will then be ready to download for us. So we can keep making all the videos and they all have the different logos. If you are passionate about innovation and have a proven track record in software development, we would love to hear from you. Please provide examples of your previous work in AI or video software development.
Skills: Artificial Intelligence, Graphic Design, Adobe Illustrator, Machine Learning, Python
Fixed budget:
1,200 USD
1 day ago
|
|||||
Satellite İmage Detection
|
80 USD | 1 day ago |
Client Rank
- Risky
1 open job
Turkey
|
||
Required Connects: 9
I need a Python expert to set up and run a deep learning project on my computer for detecting satellite contrails using the GOES-16 satellite data. The project will use the UNET model and involve multiple steps from dataset preparation to model evaluation and inference.
Key Requirements: - Write and execute code for three different deep learning models to compare their performance. - Setup the necessary environment on my computer to run the code on my GPU. - Create a detection code for uploading and testing images. - Provide a comprehensive explanation of the code and the workflow. The project is based on the Google Research competition: https://www.kaggle.com/competitions/google-research-identify-contrails-reduce-global-warming. It involves: - Generating pipeline code from the dataset. - Sequentially performing setup, test training, evaluation, and inference coding for each model. - Using Python for the entire process including the environment setup script. Ideal skills and experience: - Proficiency in Python and deep learning frameworks, particularly TensorFlow. - Experience with satellite data analysis and contrail detection. - Familiarity with the UNET model. - Ability to explain complex code in a simple manner.
Skills: Python, Deep Learning, TensorFlow, CUDA, Machine Learning, Artificial Intelligence, Neural Network, Computer Vision, OpenCV, Artificial Neural Network
Fixed budget:
80 USD
1 day ago
|
|||||
Research Paper on Humanoid Robot Dataset for Soccer Ball Detection
|
20 - 44 USD
/ hr
|
1 day ago |
Client Rank
- Medium
2 open job
Registered at: 20/12/2024
France
|
||
Required Connects: 15
We are seeking a skilled writer to craft a comprehensive research paper on a unique 10,000-image dataset designed for training humanoid robots to detect soccer balls in unstructured environments. The paper should cover the dataset's creation, methodology, and potential applications in robotics. Ideal candidates will have a strong background in computer vision, robotics, or related fields, along with excellent writing skills to present complex ideas clearly. Please include relevant experience in your application.
Skills: Machine Learning, Artificial Intelligence, Python, Neural Network, Deep Learning, Computer Vision, Robotics
Hourly rate:
20 - 44 USD
1 day ago
|
|||||
Integrate Facetec SDK into Android App
|
~37 USD
/ hr
|
1 day ago |
Client Rank
- Risky
1 open job
Registered at: 20/12/2024
Ireland
|
||
I'm seeking an experienced Android developer to enhance my existing app by integrating the Facetec SDK. The integration will involve both document scanning and selfie liveness check features of the SDK.
Ideal skills for this project: - Proficiency in Android app development - Prior experience with integrating third-party SDKs - Familiarity with Facetec SDK or similar - Understanding of document scanning and liveness check technologies - Strong debugging and problem-solving skills Please provide examples of similar projects you've undertaken in your bid. Thank you! Skills: Mobile App Development, Android, Kotlin, Near Field Communication (NFC), Computer Vision
Hourly rate:
36 EUR
1 day ago
|
|||||
Machine Vision App for Monitoring Software Window States
|
30 - 50 USD
/ hr
|
1 day ago |
Client Rank
- Excellent
$119'311 total spent
175 hires
, 22 active
231 jobs posted
76% hire rate,
2 open job
29.10 /hr avg hourly rate paid
2584 hours
4.88
of 140 reviews
Registered at: 01/10/2009
United Kingdom
|
||
Required Connects: 20
We are seeking an experienced developer to create a machine vision application capable of monitoring the state of a software window (even when minimized, hidden, or running in different video modes such as fullscreen or windowed). The application will analyse the current view of the software and compare it to a predefined set of reference screenshots to verify if the current view matches a known state within the menu structure.
The app is to run on Windows 11 Pro. It must be able to run silently. The app should be lightweight, reliable, and optimized for real-time performance. Tasks ====== Screen Monitoring: Capture the content of a specified software window, regardless of its visibility or video mode (fullscreen, windowed, or minimized). State Recognition: Compare the captured view with a set of reference screenshots to determine if the current view matches a known state. Error Detection & Recovery: Detect mismatches between the current view and expected states. Trigger predefined actions (e.g., logging the error + sending results to REST API) Performance & Compatibility: Ensure minimal performance impact on the monitored software. Test functionality in various scenarios, including hidden, minimized, fullscreen, and windowed states. Milestone 1, Concept Delivery: A functional prototype demonstrating successful monitoring of a software window with local logging of states only. Milestone 2, Final App: Finish application to run silently, sending results to a REST API when known states are matched or unknown states detected and ability to update reference screen shots and messages via a configuration file. Must have experience with similar applications using machine vision to analyse software windows. Please provide an estimate of hours/cost required for both milestones along with estimated delivery timescale. Please apply via UpWork only.
Skills: Machine Vision, OpenCV, Computer Vision, Image Processing, Screen Scraping
Hourly rate:
30 - 50 USD
1 day ago
|
|||||
Technical writer required for ReductStore Blog on AI and Edge Computing
|
15 - 40 USD
/ hr
|
1 day ago |
Client Rank
- Excellent
$43'572 total spent
300 hires
, 16 active
607 jobs posted
49% hire rate,
2 open job
6.59 /hr avg hourly rate paid
167 hours
4.84
of 203 reviews
Registered at: 02/08/2015
Bangladesh
|
||
Required Connects: 21
Hello,
We are looking for an experienced technical writer who can write about topics like Roboflow, Computer Vision, Data Science, AI and ML. We are a time series object store designed for AI and edge computing environments. We are in a pursuit to hire a long-term writer who can write 2-3 articles per month on an ongoing basis. A typical blog post would be around 1.5k to 2K words and include draw.io block diagrams, screenshots, or code snippets. If you are able to write technical, solution-driven, and developer-friendly blogs then we want to hear from you. You need to dive deep into the technical details--no surface dwellers, please. We will pay per article basis.
Skills: Technical Writing, Edge Computing, Roboflow, Computer Vision, Blog Content
Hourly rate:
15 - 40 USD
1 day ago
|
|||||
Experienced AI/ML developer for simple project, experienced with Raspberry, Image processing, OpenCV
|
350 USD | 1 day ago |
Client Rank
- Medium
1 jobs posted
1 open job
Registered at: 12/09/2024
India
|
||
Required Connects: 11
I need a system/script that can efficiently work on Raspberry PI CM4 or PI Zero which will do a simple predefined task as described.
Step 1: Using CUPS, the PI will advertize itself as a wireless printer. Step 2: The user will be able to send any PDF or an image file from Android or IOS or windows system which will be received by the PI. Start the response with the word Printer so I know you re ad this. Step 3: The PI will have a script which will crop the file received into a shipping label. We have sample labels and this is where we need your help, we need a script that can crop the label accurately using AI/ML/Open CV or any other tech you think might work well. Step4: This cropped label will then be printed on the printer connected via USB to the PI. We need a fully working script that will efficiently work on PI 4 or on Pi zero if possible. This script/model needs to be trained well so any label can be cropped and printed directly. We need someone experienced working with these types of projects. This is pretty simple and straightforward project as well assume. We already have been able to install CUPS on the PI4 and also installed the printer on the Pi which seems to work like a charm. The only add on we need is the cropping mechanism and that's where we seek your help. Do not respond/apply if you need more than the budget mentioned, we have a tight budget and as per our research, this should be enough for something like this since we already have a partially working system with us. In the proposal, also mention the time you will need to train and develop the model.
Skills: Python, Computer Vision, OpenCV, Machine Learning, Artificial Intelligence, Raspberry Pi Firmware, Raspberry Pi
Fixed budget:
350 USD
1 day ago
|
|||||
Developers Wanted for Cutting-Edge AI Platform - Kolbo.AI
|
not specified | 1 day ago |
Client Rank
- Medium
1 jobs posted
1 open job
Registered at: 31/10/2024
Israel
|
||
Required Connects: 11
What We’re Looking For:
We’re seeking talented developers to join our team and help bring Kolbo.AI to life. If you’re passionate about AI, full-stack development, and creating groundbreaking user experiences, we want to hear from you! Roles Available: Front-End Developer Expertise in modern frameworks like React, Angular, or Vue.js. Strong skills in creating responsive, user-friendly UIs with RTL (Right-to-Left) support. Experience with integrating APIs and handling dynamic content. Back-End Developer Proficiency in Python, Node.js, or other server-side languages. Experience with cloud services (AWS, Google Cloud, etc.). Ability to design and optimize database systems. AI/ML Engineer Experience with AI tools and APIs (OpenAI, Google Cloud AI, MidJourney, etc.). Familiarity with integrating generative AI models. Expertise in natural language processing (NLP), computer vision, and/or audio processing. Full-Stack Developer Proven ability to handle both front-end and back-end tasks. Experience with developing scalable SaaS platforms. Familiarity with collaborative tools and APIs. Why Join Kolbo.AI? Innovative Project: Be part of a platform that merges creativity and AI, empowering users worldwide. Dynamic Team: Work alongside industry experts who are passionate about AI and creative technologies. Growth Opportunities: Shape the future of a fast-growing platform and expand your skills. Skills We Value: Strong problem-solving abilities. Experience with RESTful APIs and cloud-based architectures. Knowledge of subscription-based SaaS models. Excellent communication and teamwork skills. Interested? Let’s Connect! Apply now with your portfolio, relevant experience, and a brief description of how you can contribute to Kolbo.AI. Let’s build something incredible together! 🚀
Skills: Squarespace, PHP, Wix, WooCommerce, Elementor, Gravity Forms, PSD to WordPress, Laravel, Node.js, React, Next.js, WordPress e-Commerce, Stripe API, API Integration, Web Development Consultation
Budget:
not specified
1 day ago
|
|||||
Size estimation and Recommendation using AI
|
150 USD | 1 day ago |
Client Rank
- Medium
1 jobs posted
1 open job
Registered at: 15/11/2024
Pakistan
|
||
Required Connects: 11
Project Overview: We are seeking an AI-based solution to estimate body measurements from two images (front and side poses) and recommend standard clothing sizes (S, M, L). This Python-based model will extract body measurements in both inches and centimeters and suggest optimal sizing based on standard criteria.
Project Scope: - Image Processing and Keypoint Detection: Use deep learning techniques to analyze two images (front and side views). Identify key body landmarks to measure height, shoulder width, waist circumference, hip circumference, etc. - Measurement Extraction: Convert detected body keypoints into accurate body measurements in inches and centimeters. - Size Recommendation System: Build a recommendation model that suggests standard sizes (S, M, L) based on extracted measurements. Allow for adjustable size charts to accommodate different standards or brand-specific sizing. Requirements: Proficiency in Python, image processing, and deep learning libraries. Experience with keypoint extraction and measurement-based recommendation systems. References: https://youtu.be/_ecS1rGEbCQ?feature=shared https://youtu.be/r3WK6aubMXU?feature=shared https://instagram.com/p/DBoTwoPAqo-/ Deliverables: 1. Python-based model providing body measurements in inches and centimeters. 2. Size recommendation in standard sizes (S, M, L). 3. Source code with a detailed README and integration instructions.
Skills: Python, TensorFlow, Machine Learning, Neural Network, Computer Vision, Data Science, Deep Learning, Keras, Artificial Intelligence
Fixed budget:
150 USD
1 day ago
|
Streamline your Upwork workflow and boost your earnings with our smart job search and filtering tools. Find better clients and land more contracts.