Blog

360dissertations understands how tough Masters level projects can be. The standard of a post graduate course is higher than a bachelor’s degree.

To make a valid decision about using an intercession, you cannot rely on the outcomes obtained from a single study. This is so because the results may vary from one study to another for different reasons such as confounding factors and use of distinct study samples. That is when the meta-analysis comes into view. 

Meta-analysis is basically a statistical analysis of multiple studies within your area of study. The principle of meta-analysis is the identification of common concepts in all conceptually similar studies with a specific degree of error within each study. Widely used in the area of medicines, meta-analysis process synthesises data athwart the studies, quantifies & analyses inconsistencies across the studies, and investigates publication bias. 

 

Today, many journals embolden research scholars to submit meta-analysis paper that summarises the body of evidence of a certain research question. For example, a research document that reports outcomes of a primary study may consist of meta-analysis in the introduction section to synthesise prior data and place a new study within the context. However, undoubtedly penning-down such kind of paper is strenuous task. Essentially meta-analysis includes section such as 

  1. Impressive topic - The secret of crafting a perfect meta-analysis paper is identifying a topic which includes the body of the text. To find a topic, you may explore the previous findings that provides explanation to the known concepts.
  2. Abstract - Although conciseness needs to be maintained in this section, you cannot afford to miss significant concepts. The abstract must include subsections such as background, selection criteria, data sources & synthesis, and quality criteria. 
  3. Introduction - This section must provide an in-depth background to the current review including the reviews of the previous studies and state of the art knowledge in the area of study. Also the significance of the review must be included here. 
  4. Methods - This is the most important section in the meta-analysis paper. This section must provide information about the search strategies (including data-base). Not just this, include types of studies & participants, approaches utilised in assessing the methodological quality of the study, how the heterogeneity of the studies were evaluated and in-depth information about subgroup analyses. In this chapter, you must explicitly define the significant outcome measure(s). Approaches used for data collection and analysis must be explained to enable an independent reader interpret the results. Also the information regarding the handling of missing data and existence of biases must be mentioned.
  5. Results - This section must include the total number of studies that were analysed and also the aggregate number of participants. Summary features of the studies must be included such as sample sizes, design, interventions, and outcomes. An analysis of the relative distribution of potential variables among the various studies should also be enlisted.
  6. Conclusion & discussion - The conclusion chapter must be purely based on the data and not on your personal point of view and must restate the crucial findings. The discussion must enlist the factors that assist to interpret the crucial findings as well as possible causes of bias. The discussion should also enclose the quality of the evidence, completeness of the evidence, and the possibility of bias. 

 

Now that you know how to craft a meta-analysis paper, fold your sleeves and write a paper which is apt and structured. Remember, if crafted in a flawless manner, it can significantly contribute to the existing knowledge of your study area. 

Data, also known as the backbone of research, is a base, on which the entire research depends on. After months of backbreaking research, scholars gather huge amounts of data (relevant as well as irrelevant). The collected data must be integrated and organised in an apt manner. And this can be done through the data analysis process. Data analysis, the process of evaluating data using statistical and analytical tools to obtain meaningful information is an ordeal in every scholar life. 

It an integral part of a research and breaks down the complex problem into simple ones, provides a theoretical base to the study and lends credibility to researched data. Today, there are several tool/software available out there which lets you conduct data analysis. This includes SPSS, Excel, Tableau public, and many more. Among all these, the most popular data analysis tool is R language which is widely used in research and academics.

 

R, developed in the 90s, is a robust language that is used to evaluate and organise data in a research. R offers several statistical techniques such as linear & non-linear modelling, time series analysis, clustering, etc. One of the strengths of R is that it lets you develop quality plots and contains plenty of mathematical symbols and formulae required to conduct the test. 

 

However, R requires an integrated software for data calculation, manipulation, and graphical display. The environment includes

 

  • Data handling and storage facility
  •  Operators for calculations on arrays
  •  Coherent and integrated collection of data analysis intermediate tools 
  •  A simple and well-developed programming language consisting of conditional loop and input/output facilities.

Of the several benefits of R programming language for data analysis, a few are:

  1. Missing values - Real data consists of missing values. However, missing values are a significant part of the R language.  R includes several functions that have argument and knows how to handle the missing values.
  2. Interactive language - Data analysis is an interactive process. What you see/done at one stage determines the next and hence, interactivity is important; which is present in R.
  3. Functions as a first class object - Functions such as mean and median, are objects that can be used like a data. R lets you change your analysis to utilise the median function rather than the mean, at ease.
  4. Graphics - Graphics are the crucial feature of the data analysis. This is because, it is effortless to explore data by developing the relevant graphs. R has several graphic function which lets you convey the important features of data. 
  5. Data wrangling - R has several packages that simplifies the task of preparing the data for data analysis. The data cleaning as well as transformation is also a straightforward process in R thereby helping you to reduce the time spent on them.  

R has numerous features that can assist you to perform your data analysis at ease and accurately. Hence, spend some time on learning this language in the initial stage and then clean, manipulate and conduct data analysis efficiently.

Most often, repeated measures ANOVA test is the first choice among the researchers for determining the difference between 3 or more variables. However, if assumptions such as normal distribution isn’t met, an alternative test known as Friedman test is used.

Friedman test, an extended version of one-way ANOVA with repeated measures, is a non-parametric test used to determine the difference between 3 or more matched or paired groups. Basic ANOVA test has the assumptions of a normal distribution with their corresponding variances, but Friedman test eliminates these assumptions of normal distribution. The method incorporates ranking for each block together afterwards analyzing the values of ranks in each column. The Friedman test is primarily a 2-way ANOVA used for the data that is Non - parametric.

In Friedman test one variable serves as a treatment/group variable and another as a blocking variable. Here, the dependent variable must be continuous (but not normally distributed) and independent variable must be categorical (time/condition)

Like any other statistical test, this test too consists of a few assumptions including: 

  • The samples should not be normally distributed
  • There is one group that is measured on three or more occasions
  • The dependent variable must be measured at an ordinal or continuous level
  • A group is a random sample from the population

 

Prior to conducting this test, a researcher must set up hypothesis such as: 

  • Null hypothesis - It states that the medians of values of each group are equal. Simply put, the treatments have no effect
  • Alternative hypothesis - The medians of values of each group are not equal indicating that there is a difference between the treatments 

 

Today, although several tools such as SPSS, SAS, etc. are used to perform Friedman test, the most tool among the researchers is R language.

So, how do you conduct this test in R?

In order to conduct this test in R, import the file into R and refer to the variables directly within the data set. This is followed by analysing the data using the command “Friedman.test.” Create a matrix or table, fill the data and run the test using the “Friedman test ()” command. 

Upon completion of the analysis, the next step is to interpret the results. I.e. to check if the test is statistically significant or not. To accomplish this, compare the P value with the significance level. 

However, before interpreting the results, it should be noted that the Friedman test ranks values in each row. As a result, the test is not affected by sources of variability that equally affect all values in a row. Typically, a significance level of 0.05 works well. We check the value at 5 % significance level.

  • P-value ≤ significance level : If the p-value less than significance level, then we can reject the idea that the difference between the columns are the result of random sampling, concluding that at least one column differs from another. 

  • P-value > significance level : If the P value is greater than the significance level, then the data doesn’t provide you with significant evidence to conclude that overall medians differ. However, this isn’t similar to stating that all medians are similar. 

Although the outcome of Friedman tells you if the groups are significantly different from each other, they do not tell you which groups differ from each other. This is when post hoc analysis for the Friedman’s test comes into the picture. 

The primary goal here is to investigate which all pairs of groups are significantly different from each other. If you have N groups, to check all of their pairs, you will have to perform n^2 comparisons, therefore the need of correcting multiple comparisons arise.

The initial step in a post hoc analysis in R to find out which groups are responsible for the rejection of the null hypothesis. For simple ANOVA test, there exists a readily available package that can directly calculate post hoc analysis -TukeyHSD. 

This is followed by understanding the outputs from the test run. In the case of simple ANOVA, a box plot would be sufficient, but in the case of repeated measure test, a boxplot approach can be misleading. Therefore you can consider using two plots: (a) one for parallel coordinates (b) other boxplots of the differences between all pairs.

Glimpse at an example for Friedman test

Consider an experiment where 6 persons (block) received 6 different diuretics (groups) that were A to F. Here the response determines the concentration of Na in human urine and the observations were recorded after each treatment.

> require(PMCMR) [library used to perform Friedman test] 

> r <- matrix(c(

 + 3.88, 5.44, 8.96,8 .25, 4.91, 12.33, 28.58, 31.14, 16.92, 

+ 24.19, 26.84, 10.91, 25.24, 39.52, 25.45, 16.85, 20.45, 

+ 28.67, 4.44, 7.94, 4.04, 4.4, 4.23, 4.36, 29.41, 37.72,

 + 39.92, 28.23, 28.35, 12, 38.87, 35.12, 39.15, 28.06, 38.23, 

+ 26.65),

Nrow = 6,

Ncol = 6,

 + dimnames = list (1:,c("a”,"b","g","h","i","j")))

 > print (r)

  a b g h i j 1 3.88 5.4 48.96 8 .25 4.91 12.33 28.58 31.14 16.92, 

24.19 26.8 10.91 25.2 39.5 25.45 16.85 20.45

28.67 4.4 47.94 4.04 4.4 4.23 4.3629.41 37.72

 39.9228.23 28.35 12 38.87 35.12 39.15 28.06 38.23

26.65

  

STS > friedman .test(y)

Friedman chi-squared (χ 2) = 23.333,

Degree of freedom = 5 , p-value = 0.000287

Result -  using friedman test

χ 2 (5) = 23.3- chi square test

p < 0.01

 

Note: 

A different post hoc tests can be performed by using the command posthoc.friedman.conover.test in the PMCMR package.

You have got a title of Dr, and now wondering whether to stay in academia or not whilst your supervisor constantly bugs at you, and you have become bored of working in labs, you are not alone. You might think that you are irreplaceable as no one can know about your research better than you, but unfortunately you are living in illusion. There’s a huge supply of academic PhD scholars. No matter how much your supervisors brag about the benefits of staying in academia, it’s not going to be useful for you. Here are the reasons why you should leave academia after qualifying your PhD.

Huge Supply of PhD Scholars
According to a 2014 survey, the world is constantly producing PhDs. It has produced more PhDs than ever before. The survey found that 67,449 people qualified as PhDs in the US. The next country is the United Kingdom that has pipped India into third place by 720 PhDs. India had 24,300 PhDs in that year. 

Many PhDs live in illusion that the longer they stay in academia, the more valuable they are for industry jobs. According to an article published in The Economist, Universities are getting cheap and highly motivated PhDs. The overwhelming supply of PhDs means reduction in your demand. So don’t think that you are irreplaceable. 

Very Low Income
Gone are the days when there was an inadequate supply of postdocs, and they were highly demanded for research. Now Universities are riding on the wave of avalanche of qualified PhDs as they don’t need to pay high salaries. Due to surge in postdocs, a significant reduction in salary has been noted down. Studies have discovered that when it comes to applying in industry jobs, PhDs get few percentage higher salary than other degree holders. So lingering over academia is not worth your weight in gold. It’s not going to increase the chances of selection in industries. So leave academia, and instead start expanding your network and applying industrial jobs.

This will cost your money, time, and experience
Every minute you spend in academia after qualifying your PhD, you cost your money, time, and experience. If you applied in an industry, you would be getting higher salary. According to NIH guidelines, every postdoc get salary hike with a handsome amount every year. This is because they are gaining experience and knowledge. There is no benefit of staying in academia. You will feel like being stuck in a rut. So move to an industry to gain experience. In fact, industry PhDs are more valued than academia PhDs. 

You will lose value in academia over time
You have become a PhD and have joined academia. You will be proud while doing this, but don’t forget that every year a glut of PhDs is rising. Now what does that mean? It means that every year your worth of being in academia is going down, and if the supply of PhDs keep rising, your value will be almost zero. Each year PhDs are being devalued as many PhDs are qualified every year, and most of them opt for academia. Therefore you should focus on developing your transferable skills and apply in industrial jobs. 

Data occupies the ultimate position in the research process. A research procedure involves collection, analysis and interpretation of data. In the current era, one can easily collect plenty of information from various sources. However, not all data gathered will be useful and relevant to a study. One should thoroughly inspect and decide which information would help them in conducting their study. This is when the data mining process comes into the picture. 

 

Data mining process involves filtering and analysing the data. It uses various tools to identify patterns and relationships in the information, which is then used to make valid predictions. 

This process involves several techniques such as clustering, association, classification, prediction, decision tree, sequential pattern techniques have been developed which are widely used in various fields.

 

To conduct data ming process without any hassle, plenty of tools are available in the market. Among them, the most popular software trusted by the research community include: 

 

Weka - This machine learning tool, also called as Waikato Environment, is built at the University of Waikato in New Zealand. Written in the JAVA programming language, this software supports significant data mining tasks such as data processing, visualisation, regression, and many more. Additionally, Weka is best suited for data analysis as well as predictive modeling. It consists of algorithms, visualisation software that support the machine learning process and operates on the assumption that data is available in the form of a flat-file. Weka has a GUI to give easy access to all its features in addition to SQL Databases via database connectivity. 

 

Orange - This component based software aids data mining and visualisation process. Written in Python computing language, its components are known as ‘widgets’. The widgets range from data visualisation, preprocessing to the evaluation of algorithms and predictive modeling. The widgets offer characteristics such as presenting data table & enabling to choose features, reading the data, comparing learning algorithms, etc. Data in Orange gets formatted to the desired pattern swiftly and can be easily moved by simply flipping/moving the widgets. It also allows the user to make smarter decisions by comparing & analyzing the data.

 

KNIME - This tool is considered as the best integration platform for data analytics. Operating on the theme of the modular data pipeline, KNIME uses the assembly of nodes to preprocess the data for analytics & visualisation process. It constitutes different data mining and machine learning components embedded together. This tool is popularly used by the researchers for performing a study in the pharmaceutical field. KNIME includes some excellent characteristics, such as quick deployment and scaling efficiency. Additionally, predictive analysis is made accessible to even naive users.

 

Sisense - Considered as the best suited BI tool, Sisense has the potential to manage and process small as well as a large amount of data. Designed specially for non-technical users, this software enables widgets as well as drag & drop features. Sisense produces reports that are highly visual and lets combining data from different sources to develop a common repository. Further various widgets can be selected to develop the reports in the form of line charts, pie charts, bar graphs, etc. based on the purpose of a study. Reports can be drilled down merely by clicking to investigate details and comprehensive data. 

 

DataMelt - DataMelt, also called as DMelt is a visualisation and computation environment offering an interactive framework to perform data mining and visualisation. DMelt is written in JAVA programming language, is designed mainly for technical users and for the science community. It is a multi-platform utility and can work on any operating system that is compatible with Java Virtual Machine (JVM). DMelt consists of scientific libraries to produce 2D/3D plots and mathematical libraries to develop curve fitting, random numbers, algorithms, etc. This software can also be utilised for analysis of large data volumes or statistical analysis. 

 

SAS data mining - SAS or Statistical Analysis System is developed by SAS Institute for the purpose of analytics & data management. This tool can mine data, modify it, and handle data from various sources and conduct statistical analysis. It allows the user to analyse big data and derives precise insight to make timely decisions. SAS offers a graphical UI for non-technical users and is well suited for text mining, data mining, & optimisation. The added advantage of this tool is that it has a highly scalable distributed memory processing architecture. 

 

IBM SPSS modeler - Owned by IBM, this software suite is used for data mining & text analytics to develop predictive models. IBM SPSS modeler consists of a visual interface that lets the user to work with data mining algorithms without any need for programming. It offers additional features such as text analytics, entity analytics etc. and removes the unnecessary hardships faced during the data transformation process. It also allows the user to access structured as well as unstructured data and makes it easy for them to use predictive models. 

Data mining tools are important to leverage the existing data. Adopt trusted & relevant tools, use them to the fullest potential, uncover hidden patterns & relationships in data and make an impact for your research.