Question about WGCNA soft thresholding value
3
0
Entering edit mode
@jessewilson13-12171
Last seen 7.9 years ago

I am new to WGCNA. The selection of my soft thresholding value is confusing me. My code is identical to the online tutorials except that I chose to do a signed network. These are the resulting plots

I am not sure why the y axis in the first is negative. My understanding is that a soft thresholding value should be one at which the index curve reaches a relative high value (>0.8). Any advice would be greatly appreciated.

wgcna • 2.9k views
ADD COMMENT
1
Entering edit mode
@jessewilson13-12171
Last seen 7.9 years ago

I am not sure why the image isn't showing up. The scale free fit is essentially all negative and plateaus close to 0 after a soft thresholding value of about 18 or 19.

ADD COMMENT
0
Entering edit mode

Do you mean the SFT.R.sq column of the result returned by pickSoftThreshold isn't it? I can't think a reason how the scale free topology fit could be negative. Could you upload the image?

ADD REPLY
1
Entering edit mode
@peter-langfelder-4469
Last seen 29 days ago
United States

The negative "signed R^2" is negative when your network has more genes with high connectivity than ones with low connectivity (i.e., the regression line for the fit log(n(k))~log(k) has a positive slope). It means your network shows a topology in some ways opposite (more high connectivity than low connectivity genes) to what is normally expected (a lot of low-connetivity gens and fewer high connectivity genes).

Usually, a lot of high-connectivity genes means there is a strong global driver (e.g., you have samples from different tissues or a strong batch effect). Make sure your sample tree doesn't show very strong branches. Also, see WGCNA FAQ (https://labs.genetics.ucla.edu/horvath/CoexpressionNetwork/Rpackages/WGCNA/faq.html) for some comments about heterogeneous data sets and lack of SFT.

ADD COMMENT
1
Entering edit mode
@jessewilson13-12171
Last seen 7.9 years ago

Thanks. My samples were from a 16S library and it was abundance data. The section on RNA-seq data helped a lot. I basically had too many OTUs with zero reads in a majority of the samples. After I cut down on the number of OTUs and took the log of the abundance it worked just like in the tutorial.

ADD COMMENT

Login before adding your answer.

Traffic: 626 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6