亚洲免费av电影一区二区三区,日韩爱爱视频,51精品视频一区二区三区,91视频爱爱,日韩欧美在线播放视频,中文字幕少妇AV,亚洲电影中文字幕,久久久久亚洲av成人网址,久久综合视频网站,国产在线不卡免费播放

        ?

        Deep Learning Deciphers Protein—RNA Interaction

        2019-03-07 07:27:24MingLi
        Genomics,Proteomics & Bioinformatics 2019年5期

        Ming Li

        1 David R.Cheriton School of Computer Science,University of Waterloo,Waterloo,Ontario N2L 3G1,Canada

        2 Ningbo Institute of Information Technology Application,Chinese Academy of Sciences,Ningbo 315000,China

        Background

        Protein—RNA interaction is ubiquitous in cells and serves as the main mechanism for post-transcriptional regulation.RNA binding proteins(RBPs)not only control which transcripts are translated,but also determine the speed,location,and concentration of mRNA translation,through controlling multiple layers of gene regulation.Base-dominant interaction and backbone-dominant interaction categorize the two main modes of the way RBPs interact with RNA.

        There are mainly two approaches to understand protein—RNA interaction:experimental techniques and computational methods(Table 1).The former includes high-throughput assays,such as in vitro(e.g.,RNAcompete)and in vivo(e.g.,CLIP-HITS)assays,and structural biology approach.However,both technologies have clear limitations:the assay experiments can reveal statistical patterns(e.g.,sequence logos)of the binding RNAs to an RBP,but cannot elucidate where and how the RNA interacts with the RBP,whereas the structural biology approach can only capture a snapshot of a specified RNA binding to the RBP,without revealing any statistical property.The computational approach,on the other hand,is still at the early development stage.All the existing machine learning-based methods try to make binary predictions.That is,the state-of-the-art resolution is on predicting if a residue of an RBPis a binding residue or not.Such methods often have high false positive rates,even for known RBPs.For a previously unknown RBP,the predictions from the existing machine learning-based methods and docking-based methods are even lessreliable,which hamper their applications in guiding the downstream experimental design.

        NucleicNet—RNA-constituent level predictor of protein—RNA interaction through deep learning

        The new tool,NucleicNet[1],developed by Gao Lab(http://sfb.kaust.edu.sa)at the King Abdullah University of Science and Technology(KAUST),in collaboration with groups in China and USA,is the first-of-its-kind to predict protein—RNA interactions at the RNA-constituent resolution.They formulated the problem as a seven-class classification problem,where thelabel space includesnon-site,ribose,phosphate,and four different bases.

        For any deep learning approach,data are the most critical component.Lam et al.composed a dataset that contains all the solved protein—RNA complex structures in the Protein Data Bank(PDB),and carefully removed the redundant structuresand redundant chains,which resulted in a stringent dataset of 175 RNA-binding protein chains.The surface grid points are extracted and the labels are assigned to the grid points by considering the nearest RNA constituents or assigned 0 if the grid point is outside the bound RNA.The FEATURE framework is applied on a contour-manner to extract spatial,physicochemical properties of the local proteinsurface environment.A 16-level residual network is trained to learn the mapping between the input features and the seven classes.To optimize the problem in a more efficient way,they applied a number of techniques,such as down-sampling of the negative dataset,hierarchical classification,batch normalization,and weight decay.

        Table 1 Summary of the properties of different experimental and computational techniques for protein—RNA interaction studies

        The authors tested NucleicNet on various tasks,starting from the traditional binary classification,i.e.,to predict if a surface residue is an RNA-binding residue or not.Although NucleicNet was trained on the 7-class classification task,when rounding the prediction results to binding/nonbinding,it can still outperform all the sequence-based predictors by a large margin.They then evaluated 7-class classification performance,to which there is no precedent method to compare,through both micro-and macro-performance measures,where‘micro’is sample-averaged and‘macro’is class-averaged.

        Figure 1 Three case studies demonstrating the ability of NucleicNet to predict protein—RNA interactions

        In addition to statistical evaluation,they showed three case studies to demonstrate NucleicNet’s power in revealing complex spatial patterns:Fem-3-binding-factor 2(FBF2),which binds to RNA through base contacts,human Argonaute 2(hAgo2),which binds to RNA through backbone,and Aquifex aeolicus ribonuclease III(Aa-RNase III),which binds to double-stranded RNA(Figure 1).For FBF2(Figure 1A),Nucleic-Net successfully recovered the strong UGUR motif.Interestingly,NucleicNet captures the modest preference for A or U at base 9,which is consistent with recent reports[2,3],while the complex structure solved in PDB has C at that base.This indicates that NucleicNet can really capture the physicochemical mechanisms in protein—RNA interaction through mining from big structural biology data.For both hAgo2(Figure 1B)and Aa-RNase III(Figure1C),NucleicNet was ableto capture well-known patterns,as well as recently-reported patterns.In all three cases,NucleicNet correctly predicted the binding pockets on the protein surface in an unbiased way,which demonstrates its ability to predict novel RBPs and their binding pockets.

        They further validated NucleicNet on in vitro and in vivo assay data.On the in vitro RNACompete datasets,NucleicNet scores on different RNA-binding sequences showed a remarkable level of agreement with the RNACompete position weight matrix scores.NucleicNet was also able to differentiate top scoring sequences from the bottom scoring ones.On the in vivo Ago2 immunoprecipitation and siRNA knockdown datasets from different cell lines of both humans and mice,NucleicNet correctly predicted asymmetry in guide strand loading for majority of the cases.These results become even more significant when considering the fact that NucleicNet was never trained on any assay data,and yet it reached a remarkable level of consistency with such high-throughput experiments.

        In the case of known RBPs,NucleicNet can be applied to score any given RNA-binding sequence,design the most preferred binding sequence,and draw sequence logos.In the case of proteinswith unknown RNA-binding functions,NucleicNet can be applied to check if the protein has a proper RNA-binding site,and if so,what the preferred RNA-binding sequences are.They further provided a webserver for the community to use NucleicNet.

        Discussion

        The aforementioned experiments demonstrated the ability of NucleicNet to capture both the statistical and physicochemical properties underlying protein—RNA interactions.The deep learning model clearly does much more than‘memorizing’the training data.NucleicNet can be potentially applied to understand thebinding mechanism and design RNAsfor some important RBPs,such as Ago2,m6A-responsive RBPs,and RBPs for single guide RNA(sgRNA)in the CRISPR-Cas9 system.

        Despite the success of NucleicNet,there are two future directions.First,NucleicNet does not consider the conformational change caused by protein—RNA interaction.Thus the input apo structure may undergo a large-scale conformational change to accommodate thebinding of the RNA,which would cause the extracted physicochemical features to be imprecise.Second,the idea of NucleicNet can be naturally transferred to model other interactions,such as protein—DNA interaction,protein—drug interaction,and protein—ligand binding.Finally,some ablation studies might help simplify the network.

        Competing interests

        The author has declared no competing interests.

        Acknowledgments

        This work was partially supported by the National Key R&D Program of China(Grant No.2016YFB1000902)and the National Natural Science Foundation of China(Grant No.61832019).

        在线不卡精品免费视频| 欧美国产日韩a在线视频| 亚洲综合久久久| 国产麻豆精品久久一二三| 最新中文字幕亚洲一区| 久久亚洲欧美国产精品| 亚洲一区av无码少妇电影| 911国产在线观看精品| 在线视频自拍视频激情| 国产一区二区三区久久精品| 大肉大捧一进一出好爽视色大师| 美国黄色片一区二区三区| 国产在线看不卡一区二区| 国产精品一区二区三区自拍| 和外国人做人爱视频| 人妻熟妇乱系列| 视频在线播放观看免费| 免费观看国产短视频的方法| 好日子在线观看视频大全免费动漫 | 国产激情自拍在线视频| 亚洲av久久久噜噜噜噜| 精品国产一区二区三区av麻| 情人伊人久久综合亚洲 | 亚洲av无码国产精品色软件下戴 | 国产一区,二区,三区免费视频| 久久精品国产在热亚洲不卡| 日本顶级metart裸体全部| 久久久久久人妻一区二区三区| 久久狠色噜噜狠狠狠狠97| 亚洲精品综合久久中文字幕 | 久久成人成狠狠爱综合网| 欧美两根一起进3p做受视频| 亚洲av成人在线网站| 日韩一本之道一区中文字幕| 精品久久久久久久无码人妻热| 欧美黑人又粗又大久久久| 蜜桃网站在线免费观看视频| 肥老熟女性强欲五十路| 欧美极品少妇无套实战| 亚洲女同成av人片在线观看| 一个人午夜观看在线中文字幕|