Page 66«..1020..65666768..8090..»

Eco-Friendly AI: How to Reduce the Carbon and Water Footprints of Your ML Models – Towards Data Science

11 min read

As we push the boundaries of AI, especially with generative models, we are confronted with a pressing question that is forecasted to only become more urgent: What is the environmental cost of our progress? Training, hosting, and running these models arent just compute-intensive they require substantial natural resources, leading to significant carbon and water footprints that often fly under the radar. This discussion has become even more timely with Googles recent report on July 2, 2024, highlighting the challenges in meeting their ambitious climate goals. The report revealed a 13% increase in emissions in 2023 compared to the previous year and a 48% rise compared to their baseline year of 2019. The demand for AI has significantly strained data centers, a trend reflected in Microsofts environmental sustainability report from May, which noted a 29% increase in emissions above their 2020 baseline due to data center usage. Additionally, the International Energy Agency predicts that global data center and AI electricity demand could double by 2026, underscoring the urgent need for sustainable practices. For everyone

See the original post:

Eco-Friendly AI: How to Reduce the Carbon and Water Footprints of Your ML Models - Towards Data Science

Read More..

Stratos Idreos Appointed Faculty Co-Director of Harvard Data Science Initiative – Harvard School of Engineering and Applied Sciences

Stratos Idreos, the Gordon McKay Professor of Computer Science at the Harvard John A. Paulson School of Engineering and Applied Sciences (SEAS), has been named Faculty Co-Director of the Harvard Data Science Initiative (HDSI). Idreos will join Francesca Dominici, Faculty Director of the HDSI, in leading the Initiative.

Since its launch in 2017, the HDSI has united data science research and education efforts across the university. It has brought together leading computer scientists, statisticians, and experts from a range of disciplines including law, business, education, medicine and public health to advance data science in personalized health, public policy, scientific discovery and more.

David C. Parkes, the John A. Paulson Dean of SEAS, served as Faculty Co-Director of the Initiative from 2017 to 2023.

At SEAS, Idreos directs the Data Systems Laboratory. His lab is developing fast self-designing data engines that accelerate research and improve productivity across several data-intensive areas including data analytics, data science and artificial intelligence. Self-designing data engines shape themselves automatically to the data, hardware and application context to achieve the best possible performance.

Idreos joined SEAS in 2014. He completed his undergraduate studies and masters degree at Technical University of Crete in Greece, and his Ph.D. at the University of Amsterdam. He has received the United States Department of Energy Early Career Award, an NSF CAREER Grant, and multiple recognitions from the Association for Computing Machinery's Special Interest Group on Management of Data. In 2023, he was awarded the Capers W. McDonald and Marion K. McDonald Award for Excellence in Mentoring and Advising.

Continue reading here:

Stratos Idreos Appointed Faculty Co-Director of Harvard Data Science Initiative - Harvard School of Engineering and Applied Sciences

Read More..

Warren Kibbe, Ph.D. – Deputy Director for Data Science and Strategy – National Cancer Institute (.gov)

Warren Kibbe, Ph.D., Deputy Director for Data Science and Strategy

As NCI deputy director for data science and strategy, Warren A. Kibbe, Ph.D., FACMI, serves as the senior advisor to the NCI director for all matters related to data science. He provides strategic direction to the NCI Center for Bioinformatics and Information Technology (CBIIT) and manages and oversees all aspects of data science for the institute.

In this role, Dr. Kibbe provides strategic counsel on the development and implementation of key data science initiatives, including the NCI Childhood Cancer Data Initiative, the Cancer Research Data Commons, and the ARPA-H Biomedical Data Fabric Toolbox. He also serves as senior data science liaison to a variety of NIH and other government committees.

Under Dr. Kibbes leadership, NCI is applying new approaches to enhance NCIs data ecosystem, growing a diverse and talented data science workforce and building strategic partnerships to develop and disseminate advanced technologies and methods.

Until June 2024, Dr. Kibbe served as the chief for Translational Biomedical Informatics and vice chair of the Department of Biostatistics and Bioinformatics in the Duke University School of Medicine and as chief data officer for the Duke Cancer Institute. He also served as director of informatics for the Duke Clinical and Translational Science Institute.

Before joining Duke, Dr. Kibbe served as an acting deputy director of NCI from 2016 to 2017 and director of NCI CBIIT from 2013 to 2017. While at NCI, he enhanced the institutes digital capabilities, including biomedical informatics, scientific management information systems, and computing resources. He also helped establish the Genomic Data Commons and the NCI Cloud Pilots (now Cloud Resources), was instrumental in establishing NCIs partnership with the U.S. Department of Energy to advance precision oncology and scientific computing, and played a pivotal role in precision medicine and Cancer Moonshot activities, engaging both federal and private sector partners in cancer research.

Dr. Kibbes research interests include data representation for clinical trials and improving data interoperability between electronic health records and decision support algorithms. He has been a proponent for open science and open data in biomedical research and helped define the data sharing policy for the Cancer Moonshot. In 2018, Dr. Kibbe was elected a fellow of the American College of Medical Informatics.

Dr. Kibbe received his Ph.D. in Chemistry from the California Institute of Technology and completed his postdoctoral fellowship in molecular genetics at the Max Planck Institute in Gttingen, Germany. He received his B.S. in Chemistry from Michigan Technological University.

Read more:

Warren Kibbe, Ph.D. - Deputy Director for Data Science and Strategy - National Cancer Institute (.gov)

Read More..

Northwestern Mutual Data Science Institute: The promise of corporate-academic partnerships – University of Wisconsin-Milwaukee

Ethan Gu, a PhD student in computer science at Marquette University, talks with Onochie Fan-Osuala, an associate professor at UW-Whitewater, at a Northwestern Mutual Data Science Institute event. (Jaclyn Tyler photo)

In 2018, three influential local organizations Northwestern Mutual, Marquette University and the University of Wisconsin-Milwaukee united to form the Northwestern Mutual Data Science Institute (NMDSI) with a shared focus on the ever-evolving and dynamic domains of data science and artificial intelligence.

Today, the NMDSI remains committed to advancing research innovation, creating education pathways and driving community impact; our mission holds greater relevancy than ever before.

Northwestern Mutual, Marquette and UWM all recognize and contribute to Southeastern Wisconsins trajectory as a national hub for technology. With nearly $75 million in investments committed to date, NMDSI continues to push the boundaries for what is possible.

The institute also provides funding for an endowed professorship at each university, research projects/grants, student scholarships, new faculty recruitment, development of expanded curriculum and new degree programs, K-12 STEM and pre-college programming, includinginternships and certificates. Fostering increased collaboration and generating innovative project ideation across the institute and our community partners, our NMDSI-affiliated faculty have garnered more than $17 million in grants to date.

Last fall, as part of our five-year renewed investment, we developed a new Center of Excellence initiative through which we announced three innovative programs and engagement opportunities for our university partners. As a result, we awarded $500,000 in research dollars as part of the Paving ROADS Seed Fund Program, $100,000 on behalf of the Pioneer Curricula Program and $175,000 for the NMDSI Student Research Scholars Program. Moreover, the NMDSI also organizes numerous events that have attracted thousands of attendees since our inception. This is only the beginning of what is possible when industry collaborates with academia.

I am extremely proud of how far we have come over the past six years and am even more excited about our future. We will soon be rolling out our NMDSI Industry Affiliate Program, an opportunity to add additional partners who share in the mission of the NMDSI. In addition to our ongoing monthly speaker series, we have various upcoming engagement opportunities through the end of the year, including a Faculty AI Summit in partnership with the Higher Education Regional Alliance and a national conference focused on the ethics of AI. I encourage you to visit the NMDSI website to stay up to date on all our news/events, sign up for our newsletter, and follow us on LinkedIn.

Continue reading here:

Northwestern Mutual Data Science Institute: The promise of corporate-academic partnerships - University of Wisconsin-Milwaukee

Read More..

Introduction to Linear Programming Part II | by Robert Lohne | Jul, 2024 – Towards Data Science

Last year, I was approached by a friend who works in a small, family-owned steel and metal business. He wanted to know if it was possible to create something that would help him solve the problem of minimising waste when cutting steel beams. Sounds like a problem for linear programming!

When I started out, there was not a huge amount of beginners articles on how to use linear programming in R that made sense for somebody not that versed in math. Linear programming with R was also an area where ChatGPT did not shine in early 2023, so I found myself wishing for a guide.

This series is my attempt at making such a guide. Hopefully it will be of use to someone.

This is part II of the series, if you are looking for a introduction to linear programming in R, have a look at part I.

If you read the theory behind linear programming, or linear optimisation, you end up with a lot of math. This can be off-putting if you dont have a math background (I dont). For me, its mostly because I never took enough math classes to understand a lot of the symbols. Initially, this made understanding the tutorials surrounding linear programming harder than it should have been. However, you dont need to understand the math behind the theory to apply the principles of code in this article.

See the rest here:

Introduction to Linear Programming Part II | by Robert Lohne | Jul, 2024 - Towards Data Science

Read More..

Continual Learning A Deep Dive Into Elastic Weight Consolidation Loss – Towards Data Science

One of the most significant challenges in training artificial neural networks is catastrophic forgetting. This problem arises when a neural network trained on one task (Task A) subsequently learns a new task (Task B) and, in the process, forgets how to perform the original task. In this article, we will explore a method to address this issue known as Elastic Weight Consolidation (EWC). EWC offers a promising approach to mitigate catastrophic forgetting enabling neural networks to retain knowledge of previously learned tasks while acquiring new skills.

All figures in this article are by author unless otherwise specified

It has been shown that there exist many configurations of optimal parameters with a desired low error on a task gray and yellow regions for tasks A and B respectively in the above figure. Assuming we found one such configuration * for task A, when continuing to train the model from such configuration to a new task B we have three different scenarios:

Read more:

Continual Learning A Deep Dive Into Elastic Weight Consolidation Loss - Towards Data Science

Read More..

Managing ML Projects: The CRISP-DM Process | by Mohsen Nabil | Jul, 2024 – DataDrivenInvestor

In this article, we will talk about solving problems using data science and machine learning. Before we go into the technical details, lets answer an important question: why should we follow a process?

As engineers, we love solving problems and often want to jump straight into finding solutions. However, if we dont properly define the problems were working on, we can waste a lot of time and money on the wrong issues. Additionally, if we dont follow a structured approach and perform the right tasks in the right order, we risk inefficiency and failure. For instance, if we start modeling before cleaning and processing our data, we might end up with a poor-quality model, regardless of our efforts. The old saying garbage in, garbage out is particularly relevant here.

A systematic process is essential for organizing work, distributing responsibilities among team members, and ensuring that each step is completed properly. Various processes are available for data science projects, but in this article, we will focus on the most common one: CRISP-DM (Cross Industry Standard Process for Data Mining).

CRISP-DM was developed in 1996 by a group of European companies from different industries. It is a flexible, industry-neutral approach to data mining and machine learning projects. Even though it is over 25 years old, it remains the most widely used method in data science. Major corporations, including IBM, use CRISP-DM or versions of it.

See the article here:

Managing ML Projects: The CRISP-DM Process | by Mohsen Nabil | Jul, 2024 - DataDrivenInvestor

Read More..

Modeling the Extinction of the Catalan Language | by Pol Marin | Jun, 2024 – Towards Data Science

Applying existing literature to a practical case Photo by Brett Jordan on Unsplash

Can we predict the extinction of a language? It doesn't sound easy, and it indeed shouldnt, but it shouldnt stop us from trying to model it.

I was recently interested in this topic and started reviewing some of the existing literature. I came across one article[1] that I enjoyed and thought of sharing.

So, in this post, Ill be sharing the insights of that paper, translated into (hopefully) a simple read and applied to a practical case so we can see data science and mathematical modeling in action.

I am Catalan and, for those who dont know, Catalan is a co-official language in Catalonia, Valencian Community, and the Balearic Islands (Spain) along with Spanish. Its also the official language in Andorra, found in the south of France and even in Alghero (Italy).

Its often that we see on local TV or media that the Catalan language is at risk of extinction. Focusing only on Catalonia, we can easily dig deeper into the case because the government takes care of studying the use of the language through whats called the survey of linguistic uses of the population (Enquesta dusos lingistics de la poblaci)[2].

See more here:

Modeling the Extinction of the Catalan Language | by Pol Marin | Jun, 2024 - Towards Data Science

Read More..

AI Innovations and Their Lasting Impact on Data Science – TechiExpert.com

This is an era of fast-paced technological landscape. Artificial intelligence (AI) is the transformative force amid such a scenario and particularly in the data science sector. Synergy between the two has revolutionized data analysis. Simultaneously, new horizons have been opened up for innovative applications.

Automated Machine Learning (AutoML) is an innovation result of the combination. It democratizes access to machine learning capabilities. It automates complex tasks such as data transformation, algorithm selection, parameter tuning and results interpretation. It saves time in data analysis and also makes advanced analytical tools accessible to a broader audience.

Machine learning has enhanced predictive analytics too by incorporating deep learning, neural networks and more such techniques. The technologies continuously improve accuracy as it keeps on learning from vast datasets. AI-driven predictive analytics can forecast disease outbreaks. It can also forecast health risks of specific patient.

Natural Language Processing (NLP) has revolutionized how data scientists interact with data. It enables meaningful information extraction from text sources like social media posts, emails and documents. It has led to the development of various applications. It also bridges the gap between human language and computer understanding.

It is true that AI has greatly improved data visualization techniques. It has become more interactive and insightful. It can help in identifying patterns and correlations by data analysis. The resulting visualizations are clearer and more compelling. Hence, it helps business executives and stakeholders to grasp complex information quickly. This further facilitates better decision-making and strategic planning.

One of the most important areas is that AI practice should be ethical. AI systems are unbiased based on the data they are trained on. Hence, the focus should be in developing such algorithms that prevents and eliminates biases.

Go here to see the original:

AI Innovations and Their Lasting Impact on Data Science - TechiExpert.com

Read More..

The best Python classes, certificates and bootcamps in NYC – Time Out

There are many programming languages and classes that you could take, so why should you take a course in Python? For starters, Python is one of the fastest-growing programming languages with around 8.2 million users. This vast user community means youll have plenty of company, and a deep network of online forums, local meetups, and the open-source community where youll be able to learn and share. If you are a developer, theres a good chance youll work on projects where Python skills are needed. Python is also one of the easiest languages to learn and use. It was designed to be concise and easy to read, with an efficient syntax that uses fewer lines of code than other languages.

Python has a large collection of frameworks and libraries that speed up your work, and it is compatible across operating systems. Python is extremely versatile and is used for diverse applications, like web development, data science, machine learning, artificial intelligence, and scientific computing. This versatility makes Python an in-demand skill and it means youll be able to choose from a wide variety of career options. In addition, Python is an open-source language, making it license-free and open to contributions from diverse user groups.

Learning Python helps you advance your career as a developer or newly enter the field. If you already have a quantitative background or analytical skills, adding a specialized data science skill set can lead to a high-paying job in fields like data science, advanced analytics, or business intelligence. Employment for software developers is strong and still growing so many opportunities are waiting for people with Python skills.

Python is such a popular programming language that there are many resources available. You can find step-by-step lessons online that introduce the basics or explain specific tasks. This is a great way to become familiar with the capabilities of this robust language. You can also join communities online and on social media where people share what they know and answer questions.

If you want a comprehensive education in Python and a solid set of skills that will help you launch a programming career, Python courses are a great option. They systematically cover the skills youll need with guidance from an instructor whos an expert in the industry. When youre learning Python, its essential to put your budding skills to use and get plenty of practice. The more you practice the better youll become, and building projects is a great way to improve your understanding of the language. Look for classes that incorporate real-world projects, give you hands-on practice, and help you build a portfolio of your skills.

Choosing the best Python course may seem daunting. Some courses are only a few hours and others extend for weeks. Courses may specialize in certain aspects of the language or focus on a particular application, like data science or machine learning. Choosing the right course for you depends on whether you are interested in learning Python as a hobby or for your career, and which field youre trying to enter.

If youre new to Python, look for a course thats designed for beginners. Youll want a basic orientation to the language and the logic behind it so you can start to think like a programmer. For more advanced skills, many classes will teach you to use libraries like Scikit-Learn, Numpy, and MatPlotLib. Advanced courses may focus on Python skills for particular fields like data science or finance. Look for a course with a live instructor who can answer your questions and provide feedback, and check whether youll be able to work on real-world projects that give you practice with Python and help you begin to build a portfolio. Python private tutoring is a good option if youd like one-on-one sessions where you can learn at your own pace and focus on the topics that meet your goals.

What you will learn in a Python course depends on your skill level and the way that you want to use Python. If youre completely new to Python, youll start by learning about the development environment and Python syntax and structure. Then, youll learn to work with different types of data and variables and write control structures like conditional statements and loops. Youll gain an understanding of blocks of code called functions and how to order data into structures. Once you understand how to use Python, you can explore the way that libraries expand its capabilities.

Object-oriented programming is useful to know because many other programming languages use it in addition to Python. OOP allows you to write concise, legible code and create secure and reliable software. You can interact with databases using libraries like SQLite or SQLAlchemy, and you can manipulate and visualize data using libraries like Pandas, Matplotlib, or Seaborn, making your data more meaningful and shareable. If you are using Python for web development, youll explore web frameworks like Flask or Django and learn to create simple web applications. You can also automate tasks using Python scripts and libraries like selenium. Last but not least, youll learn to test your programs and use debugging techniques and tools.

By the end of a comprehensive Python course, you should be able to write Python scripts, use Pythons powerful libraries and frameworks, and develop simple applications or perform data analysis tasks.

Python is widely used in NYC across various industries and by numerous companies, making it a valuable skill to learn. Financial firms like Goldman Sachs and JP Morgan Chase use Python for developing trading algorithms, risk management systems, and quantitative analysis, and they use Pythons libraries to process and visualize large datasets that guide investment decisions. NYCs thriving ecosystem of AI startups relies on Python libraries like TensorFlow, Keras, and Scikit-Learn to build machine learning models and AI-driven applications. Python frameworks like Django and Flask allow them to develop robust and scalable applications quickly.

At media companies like Spotify and The New York Times, Pythons versatility with big data makes it ideal for analyzing user data and improving content recommendations and ad targeting. E-commerce companies like Etsy and Warby Parker also use Python to develop recommendation systems for consumers. Python is invaluable for analyzing sales data, managing inventory, and predicting consumer trends using data analytics.

Pythons powerful data analysis capability is also widely used in the booming field of life sciences, where it powers bioinformatics and healthcare research by companies like Pfizer and Memorial Sloan Kettering Cancer Center, and it helps to analyze patient data and improve healthcare. Pythons simplicity and readability make it a popular choice for educational software development at platforms like Khan Academy and Coursera. Real estate firms like Zillow and Compass use Python to analyze market trends and property values, and it excels at automating tasks like property listings and data collection.

A Python bootcamp is worthwhile if you want to build your skills in this versatile and widely-used programming language. When you complete a Python bootcamp, youll learn essentials like basic syntax and data structures and get hands-on experience building programs. A bootcamp can help you leap forward from basic programming skills to specialties like data science and machine learning, and it often includes mentoring, career coaching, and networking opportunities. A Python bootcamp will give you the confidence to tackle real-world projects and boost your job prospects in tech-savvy cities like NYC. A Python bootcamp can be a great investment in your tech career.

Enrolling in a career-focused Python bootcamp can help you prepare for a future career in one of several different industries. Python is heavily utilized in the field of data science, so if you are interested in becoming a data scientist or a data analyst, learning the skills youll pick up in a data book camp will greatly improve your chances of finding work. Since data has become such an essential part of virtually every industry, these skills are highly marketable, especially in a city like NYC where so many finance and investing firms are located (FinTech is a huge part of data science after all). In NYC, data scientists can expect to earn over $100,000 a year.

Beyond working in data science, Python is an important part of the emerging technological revolutions in machine learning and artificial intelligence. Python is utilized to write the algorithms that allow for LLMs, Chatbots and other artificial intelligence applications to operate and it is an important part of the learning process, allowing machines to read and interpret large amounts of data with the help of a human operator. It is difficult to tell what the future holds for this emerging technology, but it is certainly having an impact across a range of industries (including finance, commerce and advertising, all of which are key parts of the NYC economy). Learning how to program these algorithms is an important aspect of leveraging this technology and businesses are paying a premium for skilled Python Developers who can help them take advantage of these automated systems.

Even if you arent aiming for a career in Python, learning how to manipulate, collect and query data is useful for any aspiring professional looking to grow their brand, get attention to their start-up or otherwise work with investors to get a project off the ground. Learning how to use basic Python programming skills and techniques will ensure that you arent leaving valuable information on the table, particularly as data analysis becomes increasingly important for anyone to get an edge in the market. You dont want to get left behind and taking a Python bootcamp can help ensure that you are able to leverage data to suit your needs.

See the rest here:

The best Python classes, certificates and bootcamps in NYC - Time Out

Read More..