Amino acid dipepetide frequency for Barley yellow mosaic virus (isolate China/Yancheng/1998) (BaYMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.366AlaAla: 6.366 ± 0.969
0.606AlaCys: 0.606 ± 0.314
3.941AlaAsp: 3.941 ± 1.46
4.85AlaGlu: 4.85 ± 0.983
3.031AlaPhe: 3.031 ± 0.76
4.244AlaGly: 4.244 ± 0.131
2.425AlaHis: 2.425 ± 0.674
6.366AlaIle: 6.366 ± 0.78
3.334AlaLys: 3.334 ± 0.02
8.791AlaLeu: 8.791 ± 1.271
2.425AlaMet: 2.425 ± 0.092
5.153AlaAsn: 5.153 ± 1.991
3.031AlaPro: 3.031 ± 1.925
3.031AlaGln: 3.031 ± 0.177
5.456AlaArg: 5.456 ± 0.086
8.184AlaSer: 8.184 ± 1.585
5.759AlaThr: 5.759 ± 0.511
4.547AlaVal: 4.547 ± 0.026
1.819AlaTrp: 1.819 ± 0.36
2.728AlaTyr: 2.728 ± 1.5
0.0AlaXaa: 0.0 ± 0.0
Cys
1.516CysAla: 1.516 ± 0.38
0.303CysCys: 0.303 ± 0.157
0.606CysAsp: 0.606 ± 0.314
1.516CysGlu: 1.516 ± 0.203
0.606CysPhe: 0.606 ± 0.269
1.819CysGly: 1.819 ± 0.223
0.303CysHis: 0.303 ± 0.157
0.909CysIle: 0.909 ± 0.471
0.303CysLys: 0.303 ± 0.157
1.819CysLeu: 1.819 ± 0.943
0.0CysMet: 0.0 ± 0.0
0.606CysAsn: 0.606 ± 0.314
0.606CysPro: 0.606 ± 0.314
0.303CysGln: 0.303 ± 0.157
0.303CysArg: 0.303 ± 0.157
0.0CysSer: 0.0 ± 0.0
0.303CysThr: 0.303 ± 0.157
0.606CysVal: 0.606 ± 0.314
0.303CysTrp: 0.303 ± 0.157
0.909CysTyr: 0.909 ± 0.694
0.0CysXaa: 0.0 ± 0.0
Asp
7.578AspAla: 7.578 ± 0.151
1.212AspCys: 1.212 ± 0.046
4.85AspAsp: 4.85 ± 0.4
3.941AspGlu: 3.941 ± 1.46
3.031AspPhe: 3.031 ± 0.989
3.637AspGly: 3.637 ± 0.72
0.909AspHis: 0.909 ± 0.111
3.031AspIle: 3.031 ± 0.406
2.728AspLys: 2.728 ± 0.831
6.669AspLeu: 6.669 ± 1.788
0.606AspMet: 0.606 ± 0.314
1.819AspAsn: 1.819 ± 0.36
1.819AspPro: 1.819 ± 0.36
0.606AspGln: 0.606 ± 0.269
3.637AspArg: 3.637 ± 0.72
3.031AspSer: 3.031 ± 0.76
3.031AspThr: 3.031 ± 0.989
3.941AspVal: 3.941 ± 0.294
1.212AspTrp: 1.212 ± 0.046
0.909AspTyr: 0.909 ± 0.111
0.0AspXaa: 0.0 ± 0.0
Glu
5.456GluAla: 5.456 ± 0.668
0.303GluCys: 0.303 ± 0.157
2.425GluAsp: 2.425 ± 1.257
2.728GluGlu: 2.728 ± 0.831
2.425GluPhe: 2.425 ± 0.674
3.031GluGly: 3.031 ± 0.406
3.637GluHis: 3.637 ± 0.446
2.425GluIle: 2.425 ± 0.674
3.334GluLys: 3.334 ± 1.146
6.062GluLeu: 6.062 ± 0.937
1.212GluMet: 1.212 ± 0.537
3.031GluAsn: 3.031 ± 0.76
2.728GluPro: 2.728 ± 0.334
0.909GluGln: 0.909 ± 0.694
3.941GluArg: 3.941 ± 0.288
3.941GluSer: 3.941 ± 0.288
2.728GluThr: 2.728 ± 0.831
2.728GluVal: 2.728 ± 0.917
0.0GluTrp: 0.0 ± 0.0
0.606GluTyr: 0.606 ± 0.314
0.0GluXaa: 0.0 ± 0.0
Phe
2.728PheAla: 2.728 ± 0.249
0.303PheCys: 0.303 ± 0.157
3.941PheAsp: 3.941 ± 0.877
2.728PheGlu: 2.728 ± 0.334
1.516PhePhe: 1.516 ± 0.203
3.941PheGly: 3.941 ± 0.877
1.212PheHis: 1.212 ± 0.537
3.031PheIle: 3.031 ± 0.76
2.425PheLys: 2.425 ± 1.257
5.759PheLeu: 5.759 ± 0.654
1.212PheMet: 1.212 ± 0.629
2.122PheAsn: 2.122 ± 0.648
2.425PhePro: 2.425 ± 1.074
1.212PheGln: 1.212 ± 1.12
2.425PheArg: 2.425 ± 1.074
4.244PheSer: 4.244 ± 0.131
3.031PheThr: 3.031 ± 0.989
2.425PheVal: 2.425 ± 0.092
0.303PheTrp: 0.303 ± 0.157
3.334PheTyr: 3.334 ± 0.563
0.0PheXaa: 0.0 ± 0.0
Gly
3.637GlyAla: 3.637 ± 2.194
0.606GlyCys: 0.606 ± 0.314
4.547GlyAsp: 4.547 ± 0.557
2.728GlyGlu: 2.728 ± 0.917
2.122GlyPhe: 2.122 ± 0.066
2.728GlyGly: 2.728 ± 0.917
2.425GlyHis: 2.425 ± 0.092
3.334GlyIle: 3.334 ± 0.563
2.425GlyLys: 2.425 ± 1.257
5.759GlyLeu: 5.759 ± 0.511
1.212GlyMet: 1.212 ± 0.046
2.122GlyAsn: 2.122 ± 0.066
0.909GlyPro: 0.909 ± 0.471
1.212GlyGln: 1.212 ± 0.046
4.244GlyArg: 4.244 ± 1.297
5.759GlySer: 5.759 ± 0.072
3.637GlyThr: 3.637 ± 1.303
4.85GlyVal: 4.85 ± 0.766
1.516GlyTrp: 1.516 ± 0.786
1.516GlyTyr: 1.516 ± 0.203
0.0GlyXaa: 0.0 ± 0.0
His
1.819HisAla: 1.819 ± 0.36
0.303HisCys: 0.303 ± 0.157
2.122HisAsp: 2.122 ± 0.066
1.212HisGlu: 1.212 ± 0.046
0.909HisPhe: 0.909 ± 0.471
0.909HisGly: 0.909 ± 0.111
0.606HisHis: 0.606 ± 0.314
0.606HisIle: 0.606 ± 0.269
1.819HisLys: 1.819 ± 0.943
3.031HisLeu: 3.031 ± 0.177
1.212HisMet: 1.212 ± 0.046
1.212HisAsn: 1.212 ± 0.046
0.606HisPro: 0.606 ± 0.314
1.212HisGln: 1.212 ± 0.629
1.516HisArg: 1.516 ± 0.203
2.728HisSer: 2.728 ± 0.917
1.516HisThr: 1.516 ± 0.786
2.122HisVal: 2.122 ± 0.517
0.606HisTrp: 0.606 ± 0.269
1.212HisTyr: 1.212 ± 0.046
0.0HisXaa: 0.0 ± 0.0
Ile
3.637IleAla: 3.637 ± 1.611
1.819IleCys: 1.819 ± 0.36
2.728IleAsp: 2.728 ± 0.249
3.031IleGlu: 3.031 ± 0.406
3.334IlePhe: 3.334 ± 0.563
2.425IleGly: 2.425 ± 0.674
0.909IleHis: 0.909 ± 0.471
2.425IleIle: 2.425 ± 0.491
3.031IleLys: 3.031 ± 0.989
6.062IleLeu: 6.062 ± 0.354
0.606IleMet: 0.606 ± 0.314
1.819IleAsn: 1.819 ± 0.223
3.334IlePro: 3.334 ± 0.02
2.122IleGln: 2.122 ± 0.648
1.212IleArg: 1.212 ± 0.537
4.547IleSer: 4.547 ± 1.192
5.456IleThr: 5.456 ± 1.251
2.728IleVal: 2.728 ± 0.249
0.303IleTrp: 0.303 ± 0.157
2.122IleTyr: 2.122 ± 0.517
0.0IleXaa: 0.0 ± 0.0
Lys
2.728LysAla: 2.728 ± 1.414
0.303LysCys: 0.303 ± 0.426
2.728LysAsp: 2.728 ± 0.249
2.728LysGlu: 2.728 ± 0.249
2.728LysPhe: 2.728 ± 0.831
3.334LysGly: 3.334 ± 1.729
3.941LysHis: 3.941 ± 2.043
2.425LysIle: 2.425 ± 1.257
3.031LysLys: 3.031 ± 1.571
4.547LysLeu: 4.547 ± 1.774
1.212LysMet: 1.212 ± 0.629
1.819LysAsn: 1.819 ± 0.36
1.212LysPro: 1.212 ± 0.046
1.516LysGln: 1.516 ± 0.786
3.637LysArg: 3.637 ± 0.137
3.637LysSer: 3.637 ± 0.72
4.244LysThr: 4.244 ± 1.617
1.516LysVal: 1.516 ± 0.786
0.606LysTrp: 0.606 ± 0.314
1.516LysTyr: 1.516 ± 0.786
0.0LysXaa: 0.0 ± 0.0
Leu
8.487LeuAla: 8.487 ± 0.845
1.212LeuCys: 1.212 ± 0.046
6.062LeuAsp: 6.062 ± 0.354
5.153LeuGlu: 5.153 ± 0.825
5.456LeuPhe: 5.456 ± 2.417
4.244LeuGly: 4.244 ± 1.297
4.244LeuHis: 4.244 ± 1.034
4.547LeuIle: 4.547 ± 0.557
4.85LeuLys: 4.85 ± 1.931
7.881LeuLeu: 7.881 ± 2.325
1.516LeuMet: 1.516 ± 0.786
4.244LeuAsn: 4.244 ± 0.714
3.637LeuPro: 3.637 ± 1.611
6.062LeuGln: 6.062 ± 1.52
7.578LeuArg: 7.578 ± 1.9
6.669LeuSer: 6.669 ± 0.623
7.275LeuThr: 7.275 ± 0.857
5.759LeuVal: 5.759 ± 0.072
1.212LeuTrp: 1.212 ± 0.046
1.212LeuTyr: 1.212 ± 0.537
0.0LeuXaa: 0.0 ± 0.0
Met
1.819MetAla: 1.819 ± 1.388
0.303MetCys: 0.303 ± 0.157
0.606MetAsp: 0.606 ± 0.314
1.212MetGlu: 1.212 ± 0.629
0.606MetPhe: 0.606 ± 0.314
0.606MetGly: 0.606 ± 0.314
0.303MetHis: 0.303 ± 0.157
0.606MetIle: 0.606 ± 0.269
1.212MetLys: 1.212 ± 0.046
2.425MetLeu: 2.425 ± 0.491
0.909MetMet: 0.909 ± 0.471
2.728MetAsn: 2.728 ± 0.831
1.516MetPro: 1.516 ± 0.963
1.516MetGln: 1.516 ± 0.786
2.122MetArg: 2.122 ± 1.1
2.122MetSer: 2.122 ± 0.517
2.122MetThr: 2.122 ± 0.066
1.516MetVal: 1.516 ± 0.203
0.0MetTrp: 0.0 ± 0.0
0.606MetTyr: 0.606 ± 0.314
0.0MetXaa: 0.0 ± 0.0
Asn
4.547AsnAla: 4.547 ± 2.305
0.909AsnCys: 0.909 ± 0.471
1.819AsnAsp: 1.819 ± 0.806
3.334AsnGlu: 3.334 ± 1.186
2.425AsnPhe: 2.425 ± 0.491
3.637AsnGly: 3.637 ± 0.137
0.606AsnHis: 0.606 ± 0.269
3.031AsnIle: 3.031 ± 0.177
3.334AsnLys: 3.334 ± 1.146
5.759AsnLeu: 5.759 ± 2.26
0.606AsnMet: 0.606 ± 0.314
2.425AsnAsn: 2.425 ± 0.092
1.516AsnPro: 1.516 ± 0.203
0.909AsnGln: 0.909 ± 0.471
1.212AsnArg: 1.212 ± 0.046
1.819AsnSer: 1.819 ± 1.388
3.031AsnThr: 3.031 ± 0.406
2.728AsnVal: 2.728 ± 1.414
0.606AsnTrp: 0.606 ± 0.269
1.516AsnTyr: 1.516 ± 0.203
0.0AsnXaa: 0.0 ± 0.0
Pro
3.637ProAla: 3.637 ± 0.137
0.303ProCys: 0.303 ± 0.157
2.122ProAsp: 2.122 ± 0.648
3.334ProGlu: 3.334 ± 0.563
2.728ProPhe: 2.728 ± 0.831
2.122ProGly: 2.122 ± 1.231
1.212ProHis: 1.212 ± 0.046
3.031ProIle: 3.031 ± 0.177
1.819ProLys: 1.819 ± 0.943
2.122ProLeu: 2.122 ± 0.648
1.212ProMet: 1.212 ± 0.537
1.212ProAsn: 1.212 ± 1.12
1.819ProPro: 1.819 ± 1.971
1.212ProGln: 1.212 ± 0.629
2.122ProArg: 2.122 ± 0.648
4.547ProSer: 4.547 ± 1.14
3.334ProThr: 3.334 ± 0.603
2.728ProVal: 2.728 ± 0.917
0.909ProTrp: 0.909 ± 0.694
0.909ProTyr: 0.909 ± 0.471
0.0ProXaa: 0.0 ± 0.0
Gln
5.153GlnAla: 5.153 ± 0.34
0.303GlnCys: 0.303 ± 0.157
0.909GlnAsp: 0.909 ± 0.111
1.516GlnGlu: 1.516 ± 0.203
2.728GlnPhe: 2.728 ± 0.249
2.122GlnGly: 2.122 ± 0.517
0.606GlnHis: 0.606 ± 0.314
1.819GlnIle: 1.819 ± 0.223
1.212GlnLys: 1.212 ± 0.629
1.516GlnLeu: 1.516 ± 0.38
0.606GlnMet: 0.606 ± 0.314
0.606GlnAsn: 0.606 ± 0.314
1.516GlnPro: 1.516 ± 0.963
1.212GlnGln: 1.212 ± 0.629
2.425GlnArg: 2.425 ± 1.074
1.819GlnSer: 1.819 ± 0.223
2.425GlnThr: 2.425 ± 0.092
2.728GlnVal: 2.728 ± 0.249
0.606GlnTrp: 0.606 ± 0.314
1.212GlnTyr: 1.212 ± 0.537
0.0GlnXaa: 0.0 ± 0.0
Arg
3.941ArgAla: 3.941 ± 0.871
0.606ArgCys: 0.606 ± 0.269
2.728ArgAsp: 2.728 ± 0.917
2.728ArgGlu: 2.728 ± 0.917
3.941ArgPhe: 3.941 ± 0.871
3.031ArgGly: 3.031 ± 2.508
1.516ArgHis: 1.516 ± 0.963
3.334ArgIle: 3.334 ± 0.603
4.547ArgLys: 4.547 ± 2.357
4.244ArgLeu: 4.244 ± 0.714
1.819ArgMet: 1.819 ± 0.223
2.425ArgAsn: 2.425 ± 0.092
3.941ArgPro: 3.941 ± 0.288
3.334ArgGln: 3.334 ± 1.729
4.85ArgArg: 4.85 ± 0.4
3.334ArgSer: 3.334 ± 1.186
3.031ArgThr: 3.031 ± 0.177
5.456ArgVal: 5.456 ± 1.834
0.303ArgTrp: 0.303 ± 0.157
0.606ArgTyr: 0.606 ± 0.314
0.0ArgXaa: 0.0 ± 0.0
Ser
6.972SerAla: 6.972 ± 2.797
1.212SerCys: 1.212 ± 0.046
6.972SerAsp: 6.972 ± 0.7
2.122SerGlu: 2.122 ± 0.066
3.941SerPhe: 3.941 ± 0.871
4.85SerGly: 4.85 ± 1.565
0.909SerHis: 0.909 ± 0.111
2.728SerIle: 2.728 ± 0.249
1.819SerLys: 1.819 ± 0.943
7.275SerLeu: 7.275 ± 1.474
2.728SerMet: 2.728 ± 0.249
2.425SerAsn: 2.425 ± 1.074
3.637SerPro: 3.637 ± 0.446
1.516SerGln: 1.516 ± 0.203
3.334SerArg: 3.334 ± 1.186
6.669SerSer: 6.669 ± 1.788
5.153SerThr: 5.153 ± 0.923
6.669SerVal: 6.669 ± 0.543
1.212SerTrp: 1.212 ± 0.046
2.122SerTyr: 2.122 ± 1.1
0.0SerXaa: 0.0 ± 0.0
Thr
5.456ThrAla: 5.456 ± 0.497
0.0ThrCys: 0.0 ± 0.0
3.941ThrAsp: 3.941 ± 1.46
3.334ThrGlu: 3.334 ± 0.02
3.637ThrPhe: 3.637 ± 0.72
2.728ThrGly: 2.728 ± 0.831
0.909ThrHis: 0.909 ± 0.111
3.637ThrIle: 3.637 ± 0.137
4.244ThrLys: 4.244 ± 1.034
6.669ThrLeu: 6.669 ± 0.04
2.122ThrMet: 2.122 ± 0.101
3.637ThrAsn: 3.637 ± 0.137
3.941ThrPro: 3.941 ± 0.294
2.425ThrGln: 2.425 ± 0.674
4.547ThrArg: 4.547 ± 0.557
4.244ThrSer: 4.244 ± 0.452
6.972ThrThr: 6.972 ± 1.048
3.637ThrVal: 3.637 ± 0.137
0.909ThrTrp: 0.909 ± 0.111
2.425ThrTyr: 2.425 ± 1.257
0.0ThrXaa: 0.0 ± 0.0
Val
6.669ValAla: 6.669 ± 0.623
2.122ValCys: 2.122 ± 0.517
3.334ValAsp: 3.334 ± 0.563
3.941ValGlu: 3.941 ± 0.877
2.122ValPhe: 2.122 ± 1.1
4.85ValGly: 4.85 ± 0.183
0.606ValHis: 0.606 ± 0.314
3.941ValIle: 3.941 ± 0.877
2.728ValLys: 2.728 ± 0.334
6.062ValLeu: 6.062 ± 0.937
2.122ValMet: 2.122 ± 0.18
2.728ValAsn: 2.728 ± 0.249
2.728ValPro: 2.728 ± 1.414
2.122ValGln: 2.122 ± 1.231
2.728ValArg: 2.728 ± 0.334
2.728ValSer: 2.728 ± 0.917
4.244ValThr: 4.244 ± 1.034
3.941ValVal: 3.941 ± 0.294
0.606ValTrp: 0.606 ± 0.269
2.122ValTyr: 2.122 ± 0.066
0.0ValXaa: 0.0 ± 0.0
Trp
0.909TrpAla: 0.909 ± 0.471
0.303TrpCys: 0.303 ± 0.157
0.0TrpAsp: 0.0 ± 0.0
0.303TrpGlu: 0.303 ± 0.157
0.909TrpPhe: 0.909 ± 0.694
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.516TrpIle: 1.516 ± 0.203
0.0TrpLys: 0.0 ± 0.0
1.819TrpLeu: 1.819 ± 0.36
0.303TrpMet: 0.303 ± 0.157
2.122TrpAsn: 2.122 ± 0.648
0.606TrpPro: 0.606 ± 0.269
0.303TrpGln: 0.303 ± 0.426
0.606TrpArg: 0.606 ± 0.269
2.122TrpSer: 2.122 ± 1.1
0.606TrpThr: 0.606 ± 0.314
0.606TrpVal: 0.606 ± 0.314
0.303TrpTrp: 0.303 ± 0.426
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.031TyrAla: 3.031 ± 1.571
0.606TyrCys: 0.606 ± 0.269
1.819TyrAsp: 1.819 ± 0.36
1.212TyrGlu: 1.212 ± 0.046
2.122TyrPhe: 2.122 ± 0.517
3.031TyrGly: 3.031 ± 0.989
0.0TyrHis: 0.0 ± 0.0
0.909TyrIle: 0.909 ± 0.471
1.212TyrLys: 1.212 ± 0.046
3.031TyrLeu: 3.031 ± 0.177
0.909TyrMet: 0.909 ± 0.111
1.516TyrAsn: 1.516 ± 0.203
0.909TyrPro: 0.909 ± 0.111
0.303TyrGln: 0.303 ± 0.157
1.819TyrArg: 1.819 ± 0.806
2.122TyrSer: 2.122 ± 0.517
1.819TyrThr: 1.819 ± 0.806
1.212TyrVal: 1.212 ± 0.046
0.0TyrTrp: 0.0 ± 0.0
0.303TyrTyr: 0.303 ± 0.157
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3300 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski