Ali Jamali’s Post

Postdoctoral Fellow (Remote Sensing) || Deep Learning || Python || Earth Observation Expert || GeoAI || GIS || Geoinformatics

3w Edited

I spent two days working on and fine-tuning the #SegmentAnything #Model (SAM) for #road #extraction from #satellite #images. SAM is the first #foundational model developed by Meta #AI #Research, #FAIR. I was impressed by the results achieved by SAM with one epoch model training (precision: 0.64, recall: 0.62, f-score: 0.62, dice: 0.62). We got precision (0.95) , recall (0.53), F1-score (0.65), and dice value of 0.63 by our previously developed model (ResUNetFormer) (https://lnkd.in/g6q-VW2Z)! #deeplearning #remotesensing #roadextraction #founadionalmodels #imageprocessing #datascience #dataengineering

44 Comments

Gijs van den Dool

Senior Geospatial Data Scientist / Independent Researcher

I shouldn't do this, but I posted the message below for awareness, so doing this here as well - please have a look at the third use case (in case you missed the call): https://www.linkedin.com/feed/update/urn:li:activity:7215256099061383168? ==> https://aiforgood.itu.int/event/launch-of-2024-itu-geoai-challenge/ Having two developed models could give you a nice advantage, and do some good at the same time (happy to chat more)

2 Reactions

Gabriel Durkin , DPhil.

Data Science and Quantum Physics

I saw work that SAM wasn’t great for geospatial. Either way, this is already a great challenge. Learning objective functions in segmentation that have some “per pixel” component are biased towards learning “blobby” objects with large area to perimeter ratio - and poorly on vascular threadlike features. It would be interesting to develop a différentiable cost function that focuses more on boundary correctness than bulk. We could also use the gradients of the segment masks as a mask. I feel like this must have been tried. Obviously your bigggest challenge is to improve recall. 📈

2 Reactions

Aninda Ghosh

Passionate Machine Learning Engineer and Data Scientist focused on AI & Computer Vision, with a track record of taking tech startups from 0 to 1 and designing scalable ML solutions and data pipelines.

Was it full model (which encoder version?) finetune or just the classifier and the regressor? Did you try to nudge the Encoder parameters?

1 Reaction

Vincent Markiet

Senior data scientist | Machine learning | hyperspectral | SAR | GeoAI

Have you tried extracting roads with SAMGEO, a finetuned SAM model for for remote sensing data? It might save you a lot of work. https://samgeo.gishub.org/

4 Reactions

Puneeth Shankar

Senior Remote Sensing Data Scientist

You may know that the extracted outcomes are best used when they are treated as a network. In this context the accuracy metric used would have less correlation with the GT. A good representation of model accuracy is defined in one of the space net challenges.

1 Reaction

Ezoa DJANGORAN

Machine learning researcher|Computer vision Engineer |Aircraft Maintenance Engineer B1.1 B2 | Predictive Maintenance ||QT Beech 200 |Store Aircraft Manager|all Module B1.1 B2 obtained

can i have the tutorial of sam finetuning

Simiao Ren, Ph.D.

Hungry new grad looking for growth :p

That is awesome! In one of the previous work the out-of-the-box performance for overhead imagery was very limited for SAM. Glad to see it actually performs much better upon further fine-tuning. cc Saad Lahrichi Link the post for previous work here: https://www.linkedin.com/posts/saadlahrichi_you-can-now-read-our-paper-segment-anything-activity-7191848006810271745-LNZh?utm_source=share&utm_medium=member_desktop

1 Reaction

Josh B.

ML Engineer

That is wild for one epoch. Do you have the fine tuning code public ?

1 Reaction

Ezoa DJANGORAN

Machine learning researcher|Computer vision Engineer |Aircraft Maintenance Engineer B1.1 B2 | Predictive Maintenance ||QT Beech 200 |Store Aircraft Manager|all Module B1.1 B2 obtained

please

See more comments

To view or add a comment, sign in

More Relevant Posts

Tyler Ponte

Senior Recruitment Consultant | AI & Machine Learning | Co-organizer of the AI in Action Podcast
7mo
Report this post
Serious questions for my #GenAI connections: 1. I'm sure you've seen it on X or here on linkedin before, when you ask GPT/DALLE to make you an image and then keep it asking it to make it more of a characteristic, you eventually lead to space drawings. Why does this happen? I kept asking to make a drawing more and more boring. It was extremely boring for about 8-10 pictures then eventually turned galactic. This happens with almost anything I've tried. 2. Why do I have to argue with GPT to make it draw me this. after about 7 tries it says its as boring as it'll ever be and I have to reason with it or trick it into keep drawing things? Very curious why these two things happen. Who can give me some insight? #LLM #GenAI #GPT #DALLE
1 Comment
Like Comment
To view or add a comment, sign in
Awais Mubashar

Artificial Intelligence | Machine Learning | Python | Data Science
4mo
Report this post
Meta introducing SceneScript, a novel method for reconstructing environments and representing the layout of physical spaces. SceneScript was trained in simulation using the Aria Synthetic Environments dataset, which is available for academic use. AI at Meta #meta #ai #datascience #emergingtechnologies #trend #latestnews #artificialintelligence
Like Comment
To view or add a comment, sign in
Sandeep Kumar Kushwaha

Data Scientist & GenAI Specialist | Driving Impactful Solutions with Expertise in LLMs, GenAI, Agents, NLP, & CV
5mo
Report this post
🌟 Exciting News! 🌟 I am thrilled to share another latest breakthrough in AI research! 🚀 The OpenAI team has been exploring the frontier of generative models and video data, and their newest achievement is the development of text-conditional diffusion models trained on both videos and images. 🎥🖼️ Through extensive experimentation, they've harnessed the power of transformer architectures to process spacetime patches of video and image latent codes. The culmination of their efforts has resulted in the creation of the most advanced model yet, Sora. 🌌 Sora is not just any model - it can generate a minute of high-fidelity video, paving the way for unprecedented possibilities in AI-driven content creation. I have compiled some of their premium-quality work in the video below. 🎬✨ But the significance of their work goes beyond just video generation. Imagine making a whole movie just by writing the scripts all by yourself. 🌍💡 Our journey to unlock the potential of AI in understanding and simulating the complexities of our world is just beginning. Every day we continue to push the boundaries of innovation and shape the future of technology! 👩💻🚀 #AI #GenerativeModels #VideoGeneration #Innovation #TechBreakthrough #ArtificialIntelligence #FutureTech #ResearchDevelopment #sora #openai #gemini #geminiai
Like Comment
To view or add a comment, sign in
Mohammad Alnobani

Co-Founder and CEO - The Middle Frame | One Young World Ambassador
3mo
Report this post
Crucial insight: #GenerativeAI for #images isn't accurately representing the #Arab world. It's a real problem... but we (The Middle Frame) are solving it Google DeepMind, OpenAI, and NVIDIA have made strides, but Arab imagery often falls short. Our platform is changing that. We curate authentic Arab imagery, showcasing the richness of our culture Join us in redefining the future of Arab #imagery. Let's ensure our stories are told accurately and our culture celebrated authentically, not only within the stock image industry, but also within the generative AI industry #TheMiddleFrame #GenAI #ArabImagery #Authenticity #Diversity #Innovation
8 Comments
Like Comment
To view or add a comment, sign in
Abhijeet Pujara

Data Engineer || ETL || Pipeline || Data Lake || Databricks || Data Warehouse || Azure Synapse || Data Factory || Data Analytics || Machine Learning Engineer || CI-CD
7mo Edited
Report this post
Hey LinkedIn fam! 👋 I'm thrilled to share my latest article, diving deep into the fascinating world of Mask R-CNN, a groundbreaking technique in computer vision. Read the full article here: https://lnkd.in/dYWeGqcJ Whether you're a seasoned computer vision enthusiast or just getting started, I guarantee there's something in it for everyone. 🎓🚀 Let's keep pushing the boundaries of AI and computer vision together! Feel free to share your thoughts and insights. I'd love to hear your perspective on the exciting advancements in this field. #ComputerVision #MaskRCNN #AI #DeepLearning
Like Comment
To view or add a comment, sign in
Ekhlaque Bari

Founder & CEO XdotO | Generative AI | Keynote Speaker | Masterclasses | Storyteller | AI Coach | ISB | IIM | Boards and CXOs | Enterprise Strategist@MINFY | Ex-CIO GE, Max Life, Jubilant, SMFG, SBI Card
4mo
Report this post
Introducing Claude - Anthropic's Intelligent and Ethical AI Assistant It is now available in India. Anthropic claims Claude is the most accurate, secure and intelligent conversational AI yet. They also claim that their AI development is most ethical. Claude offers a compelling and trustworthy assistant experience. Explore Claude's capabilities today and share whether its claims are true. https://claude.ai/ #generativeai #ai #chatbots #chatGPT #Gemini #bing

XdotO Consulting & Coaching

184 followers
4mo

Yesterday, Anthropic dropped some major news in the world of AI! They unveiled a super exciting update to the Claude 3 model family. One that’s like a big leap forward in the evolution of artificial intelligence! 🚀 So, what is the update? Well, Anthropic introduced two new members to the Claude 3 family: Opus and Sonnet. And get this, they're now accessible worldwide through claude.ai and the Claude API. Yep, that's right, they're available in 159 countries, including India! 🌍 And there's more to come with Haiku joining the crew soon! But wait, there's a standout star in this update — Claude 3 Opus! It's outperforming big players like OpenAI's GPT-4 and Google's Gemini 1.0 Ultra on benchmark exams. Anthropic is pretty proud of Opus, claiming it's almost like having a human brain with its crazy comprehension skills and top-notch vision capabilities. Oh, and if you want to dive deeper into the nitty-gritty, Anthropic's blog spills all the details. Opus can handle all sorts of visual stuff like photos, charts, and even technical diagrams like a pro. It's like having a visual genius right at your fingertips! At XdotO, we're buzzing with excitement about this game-changing development! We can't wait to see where Claude 3 takes us next in the world of AI. So, come join us on this thrilling journey as we explore the endless possibilities of this groundbreaking advancement! #XdotO #Claude3 #AI #TechNews #LearnWithXdotO Ekhlaque Bari Gopika Misra

5 Comments
Like Comment
To view or add a comment, sign in
Jyoti Dwivedi

Data Science Enthusiast | Data Analytics | Python Programming | MySQL | Data Visualization | Power BI | Tableau | MS Excel
5mo
Report this post
Hii Connection... SVM(Support Vector Machine) is a supervised learning algorithm used in machine learning to solve the classification and regression problems. For solving classification problems we use SVC (Support Vector Classifier). You can check my GitHub for more detailed explanations:https://lnkd.in/g3A5N9HQ #datascience #machinelearning #supportvetermachine #svc #svr #dataanalysis #datamodeling #dataprocessing #artificialintelligence #datamining #patternrecognition #modeltraining #ai
Like Comment
To view or add a comment, sign in
XdotO Consulting & Coaching

184 followers
4mo
Report this post
Yesterday, Anthropic dropped some major news in the world of AI! They unveiled a super exciting update to the Claude 3 model family. One that’s like a big leap forward in the evolution of artificial intelligence! 🚀 So, what is the update? Well, Anthropic introduced two new members to the Claude 3 family: Opus and Sonnet. And get this, they're now accessible worldwide through claude.ai and the Claude API. Yep, that's right, they're available in 159 countries, including India! 🌍 And there's more to come with Haiku joining the crew soon! But wait, there's a standout star in this update — Claude 3 Opus! It's outperforming big players like OpenAI's GPT-4 and Google's Gemini 1.0 Ultra on benchmark exams. Anthropic is pretty proud of Opus, claiming it's almost like having a human brain with its crazy comprehension skills and top-notch vision capabilities. Oh, and if you want to dive deeper into the nitty-gritty, Anthropic's blog spills all the details. Opus can handle all sorts of visual stuff like photos, charts, and even technical diagrams like a pro. It's like having a visual genius right at your fingertips! At XdotO, we're buzzing with excitement about this game-changing development! We can't wait to see where Claude 3 takes us next in the world of AI. So, come join us on this thrilling journey as we explore the endless possibilities of this groundbreaking advancement! #XdotO #Claude3 #AI #TechNews #LearnWithXdotO Ekhlaque Bari Gopika Misra

2 Comments
Like Comment
To view or add a comment, sign in
Mohsen Falahat

✔️️Data Engineer by Day, Data Scientist by Night From Pipelines to Predictions Follow for Latest AI News
5mo
Report this post
OpenAI company unveiled its first text-to-video model called Sora. This model has the ability to produce videos up to 60 seconds that show very detailed scenes, complex camera movements and multiple characters with lively emotions. Currently, very few people have access to this model, which will soon increase. The video you see was produced with Sora and Prompt below: "Beautiful, snowy Tokyo city is bustling. The camera moves through the bustling city street, following several people enjoying the beautiful snowy weather and shopping at nearby stalls. Gorgeous sakura petals are flying through the wind along with snowflakes." #machinelearning #ai #datasciences #openai #texttovideo
Like Comment
To view or add a comment, sign in
Anas Khan

Co-founder @ Modularity AI
3mo
Report this post
In the recent wave of open-source breakthroughs huggingface dropped a revolutionary 𝗔𝗻𝗶𝗺𝗮𝘁𝗲𝗗𝗶𝗳𝗳 pipeline that allows a diffusion model to generate subsequent frames with help of pre-trained motion adapter weights. 🚀 To demonstrate the working of this modular approach towards AI, 𝙈𝙤𝙙𝙪𝙡𝙖𝙧𝙞𝙩𝙮 𝘼𝙄 has developed an open source hugging face space. The space is capable of generating 2 second long animations (16 frames). Following are customizations available: 1. 𝗡𝗲𝗴𝗮𝘁𝗶𝘃𝗲 𝗣𝗿𝗼𝗺𝗽𝘁: Guides the diffusion process away from undesired features and styles. 2. 𝗚𝘂𝗶𝗱𝗮𝗻𝗰𝗲 𝗦𝗰𝗮𝗹𝗲: Determines how strongly the model adheres to the provided guidance versus exploring randomness (7.5 is a good balance for the diffusion model selected in the space) 3. 𝗜𝗻𝗳𝗲𝗿𝗲𝗻𝗰𝗲 𝗦𝘁𝗲𝗽𝘀: It is the number of iterations used during the diffusion process to gradually denoise a sample. It plays a crucial role in the quality of generated frames. (Our space provides a maximum of 24 steps) 4. 𝗔𝗱𝗮𝗽𝘁𝗲𝗿 𝗖𝗵𝗼𝗶𝗰𝗲: These are LoRA weights that help define motion of perspective for the Animate Diff pipeline. Link to HF Space: https://lnkd.in/dgav4Ftg #huggingface #stablediffusion #generativeAI #GenAi #AI #texttovideo
Like Comment
To view or add a comment, sign in

4,260 followers

142 Posts

View Profile Follow

Ali Jamali’s Post

More Relevant Posts

Explore topics