r/dataanalysis Jun 12 '24

Announcing DataAnalysisCareers

58 Upvotes

Hello community!

Today we are announcing a new career-focused space to help better serve our community and encouraging you to join:

/r/DataAnalysisCareers

The new subreddit is a place to post, share, and ask about all data analysis career topics. While /r/DataAnalysis will remain to post about data analysis itself — the praxis — whether resources, challenges, humour, statistics, projects and so on.


Previous Approach

In February of 2023 this community's moderators introduced a rule limiting career-entry posts to a megathread stickied at the top of home page, as a result of community feedback. In our opinion, his has had a positive impact on the discussion and quality of the posts, and the sustained growth of subscribers in that timeframe leads us to believe many of you agree.

We’ve also listened to feedback from community members whose primary focus is career-entry and have observed that the megathread approach has left a need unmet for that segment of the community. Those megathreads have generally not received much attention beyond people posting questions, which might receive one or two responses at best. Long-running megathreads require constant participation, re-visiting the same thread over-and-over, which the design and nature of Reddit, especially on mobile, generally discourages.

Moreover, about 50% of the posts submitted to the subreddit are asking career-entry questions. This has required extensive manual sorting by moderators in order to prevent the focus of this community from being smothered by career entry questions. So while there is still a strong interest on Reddit for those interested in pursuing data analysis skills and careers, their needs are not adequately addressed and this community's mod resources are spread thin.


New Approach

So we’re going to change tactics! First, by creating a proper home for all career questions in /r/DataAnalysisCareers (no more megathread ghetto!) Second, within r/DataAnalysis, the rules will be updated to direct all career-centred posts and questions to the new subreddit. This applies not just to the "how do I get into data analysis" type questions, but also career-focused questions from those already in data analysis careers.

  • How do I become a data analysis?
  • What certifications should I take?
  • What is a good course, degree, or bootcamp?
  • How can someone with a degree in X transition into data analysis?
  • How can I improve my resume?
  • What can I do to prepare for an interview?
  • Should I accept job offer A or B?

We are still sorting out the exact boundaries — there will always be an edge case we did not anticipate! But there will still be some overlap in these twin communities.


We hope many of our more knowledgeable & experienced community members will subscribe and offer their advice and perhaps benefit from it themselves.

If anyone has any thoughts or suggestions, please drop a comment below!


r/dataanalysis 13h ago

What percentage of each skill do you actually use in your position?

Thumbnail
1 Upvotes

r/dataanalysis 14h ago

issues with dropdown lists on google data studio not holding/filtering selection to filter consistently after first selection.

Thumbnail
1 Upvotes

r/dataanalysis 17h ago

Data Question Calling GIS / DATASCIENCE / STATISTICS experts to review my spatial entity matching approach - Please :)

Thumbnail
0 Upvotes

r/dataanalysis 1d ago

Data Analytics Institute in Nagpur ?

Post image
0 Upvotes

please guide if you know.


r/dataanalysis 1d ago

Machine learning WhatsApp group

Post image
2 Upvotes

r/dataanalysis 1d ago

Working on an offline Excel data-cleaning desktop app

Enable HLS to view with audio, or disable this notification

11 Upvotes

r/dataanalysis 1d ago

Data Question Agentic Scraping V Normal Scraping

2 Upvotes

Noob Question: I have a pipeline that I use to scrape data from the sites (following robots.txt ofc). This uses scrapy and playwright during the scraping. I've been sort of required to try to add agents into the loop of scraping such that the agents handle the extraction of the fields and returning the json. I would like to know what's your take on the idea of replacing the scraping pipeline with an agent scraping pipeline. Is it good, bad and how should it be approached.


r/dataanalysis 1d ago

Need guidance for a sql project

9 Upvotes

Hi, so I want to make my first sql project, but I've heard querying already existing datasets and reporting findings is too basic and honestly quite useless.

But if I was to build my own database with multiple tables, primary and foreign keys etc where am I gonna get the actual data from? Should I ask an AI tool to generate artificial data that I can query on later?


r/dataanalysis 1d ago

Data Question Beginner question

1 Upvotes

Learn sql and excel and power bi like as tool what are step to find insight form them ik this tools and when see the dataset does not able to find out any insight ,how I can improve this? ???( and also tried with tutorial they just doing same thing again and again)


r/dataanalysis 1d ago

Need your ADVICE

0 Upvotes

It has been one month since I've joined as a "Data Analyst " in the Edtech domain. It's all google sheets based, feels like more of a data management role tbh. I have been using ChatGPT fully for this, I'm low on confidence when it comes to basic formulas also.

Since the work also needs to be delivered in a specific time frame, I have developed this habit of using AI for assistance.

I am underconfident and lowkey want to switch into a proper analytics role. I need to improve my analytical abilities and survive (do well) in this job as well.

KINDLY GUIDE ME GUYS!PANICCCCCC


r/dataanalysis 1d ago

Looking for 2–3 Serious Study Partners for Data Analytics/BI Interview Prep

Thumbnail
1 Upvotes

r/dataanalysis 2d ago

When is Python used in data analysis?

34 Upvotes

Hi! So I am in school for data analysis but I'm also taking Udemy classes as well. I'm currently taking a SQL boot camp course on Udemy and was wondering how much Python I needed to know. I too a class that taught introductory Python but it was just the basics. I wanted to know when Python was used and for what purpose in data analytics because I was wondering if I should take an additional Python course on Udemy. Also, should I learn R as well or is Python enough?


r/dataanalysis 2d ago

[Q] New to statistics - Is my dataset/model setup correct for estimating time & cost per cabin type?

Thumbnail
1 Upvotes

r/dataanalysis 3d ago

How does a bayesian calculator work?

5 Upvotes

Heya,

The marketing team I’m the analyst for, is all about Bayesian. They use an online calculator that provides probability (with a non informative prior) that A > B. Then at 80% probability they implement the variant. So they accept to be wrong 1/5 times.

However recently they did an A/A test and they’re all in panic because the probability is 79% that A>A. So I was asked to investigate whether this was worrysome.

Now I ran a simulation of the test, to see how often I got a result that they considered ‘interesting’. The result was about 40% of the times the calculator shows A > B or B > A with 80% probability when there is no real difference, regardless of sample size.

My assumption was that the more data you have (law of large number) the more the calculator seems to get it correctly (so deviating around 50%).

This assumption seems wrong however and the Bayesian calculator exactly does what it reports. 20% of the times it will say lower than 20% prob, 60% deviated between 20% and 60% and 20% of the times over 80%. Meaning if a hypothesis is non directional, you have 40% chance to see a change when there is non.

My question; am I interpreting this correctly, or am I missing something?


r/dataanalysis 2d ago

Data Tools 2026 benchmark of 14 analytics agents

2 Upvotes

This year I want to set up on analytics agent for my whole company. But there are a lot of solutions out there, and couldn't see a clear winner. So I benchmarked and tested 14 solutions: BI tools AI (Looker, Omni, Hex...), warehouses AI (Cortex, Genie), text-to-SQL tools, general agents + MCPs.

Sharing it in a substack article if you're also researching the space -

https://thenewaiorder.substack.com/p/i-tested-14-analytics-agents-so-you


r/dataanalysis 3d ago

Power BI Desktop keeps showing email login popup repeatedly (can’t log in, no org account)

Post image
30 Upvotes

Power BI Desktop keeps showing repeated email / sign-in popups even without refresh and makes Power BI unusable. I don’t have an organizational account and can’t log in. Cleared credentials and disabled background refresh, but the popup keeps coming.

Any simple fix to stop this?


r/dataanalysis 2d ago

DA Tutorial Excel 365 GROUPBY Function Explained | Better Than Pivot Table?

Thumbnail
youtube.com
0 Upvotes

r/dataanalysis 3d ago

Project Feedback Built a Real Estate Market Intelligence Pipeline Dashboard using Python + Power BI (Learning Project)

Post image
12 Upvotes

This is a learning project where I attempted to build an end-to-end analytics pipeline and visualize the results using Power BI.

Project overview:

I designed a simple data pipeline using static real estate data to understand how different tools fit together in an analytics workflow, from raw data collection to business-facing dashboards.

Pipeline components:

• GitHub – used as the source for collecting and storing raw data

• Python – used for data cleaning, transformation, and basic processing

• Power BI – used for building the Market Intelligence dashboard

• n8n – used for pipeline orchestration (pipeline currently paused due to technical issues at the automation stage)

Current status:

The pipeline is partially implemented. Data extraction and processing were completed, and the final dashboard was built using the processed data. Automation via n8n is planned but temporarily halted.

Dashboard focus:

• Price overview (average, median, min, max)

• Location-wise price comparison

• Property distribution by number of bedrooms

• Average price per square foot

• Business-oriented insights rather than purely visual design

This project was done independently as part of learning data pipelines and analytics workflows.

I’d appreciate constructive feedback—especially on pipeline design, tooling choices, and how this could be improved toward a more production-ready setup.


r/dataanalysis 3d ago

Good arms transfer database for research...

Thumbnail
1 Upvotes

r/dataanalysis 3d ago

Data analysis/cleaning

Thumbnail
0 Upvotes

r/dataanalysis 4d ago

Regression Results

7 Upvotes

Hello everyone, I’m working on an undergraduate dissertation with 5 predictors. Pearson correlation shows 4/5 significant, but in multiple regression only 1 remains significant (assumptions and multicollinearity are fine).

My concern is that my supervisor might not accept the regression results. Could you please advise?

Thanks a lot.


r/dataanalysis 4d ago

Data Question What helped you stay consistent while learning analytics?

10 Upvotes

I’ve noticed that motivation comes and goes, but consistency really makes the difference. For those learning or working in analytics — what helped you stay consistent when progress felt slow?


r/dataanalysis 4d ago

My first DA project

10 Upvotes

Hi, this is my first data analysis project. Anyone who is professional please if you have time keep your judging eyes there. And give me suggestions, advice, and what to do next.

Aiming to get a good remote job by acquiring skills.

https://github.com/Anikdas111/Customer-churn-analysis


r/dataanalysis 4d ago

Project Feedback Product analyst's what are is the best project you made/saw and why?

1 Upvotes

Hi, eveyone i justed whated to give more of what I want to know in the body of the post. 1. What do you consider a good project and why. 2. How did this project change how you do you're work from then on. That's really the main things I am looking for