Amino acid dipepetide frequency for Picobirnavirus sp.

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.059AlaAla: 8.059 ± 1.704
2.198AlaCys: 2.198 ± 0.787
7.326AlaAsp: 7.326 ± 3.69
2.93AlaGlu: 2.93 ± 2.738
1.465AlaPhe: 1.465 ± 1.115
2.198AlaGly: 2.198 ± 0.787
0.0AlaHis: 0.0 ± 0.0
6.593AlaIle: 6.593 ± 1.632
8.791AlaLys: 8.791 ± 4.797
10.256AlaLeu: 10.256 ± 2.24
3.663AlaMet: 3.663 ± 1.157
5.128AlaAsn: 5.128 ± 1.106
0.733AlaPro: 0.733 ± 0.558
4.396AlaGln: 4.396 ± 1.689
2.93AlaArg: 2.93 ± 2.666
2.198AlaSer: 2.198 ± 0.787
5.128AlaThr: 5.128 ± 2.945
4.396AlaVal: 4.396 ± 1.626
0.733AlaTrp: 0.733 ± 0.57
1.465AlaTyr: 1.465 ± 1.115
0.0AlaXaa: 0.0 ± 0.0
Cys
1.465CysAla: 1.465 ± 1.14
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.733CysPhe: 0.733 ± 1.464
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
2.198CysIle: 2.198 ± 1.673
0.0CysLys: 0.0 ± 0.0
1.465CysLeu: 1.465 ± 0.401
0.733CysMet: 0.733 ± 0.57
0.733CysAsn: 0.733 ± 0.558
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.733CysSer: 0.733 ± 0.558
1.465CysThr: 1.465 ± 1.14
0.733CysVal: 0.733 ± 0.57
0.0CysTrp: 0.0 ± 0.0
0.733CysTyr: 0.733 ± 0.57
0.0CysXaa: 0.0 ± 0.0
Asp
3.663AspAla: 3.663 ± 0.386
1.465AspCys: 1.465 ± 0.401
5.861AspAsp: 5.861 ± 0.909
2.93AspGlu: 2.93 ± 1.304
3.663AspPhe: 3.663 ± 1.118
2.198AspGly: 2.198 ± 0.813
0.733AspHis: 0.733 ± 0.57
5.861AspIle: 5.861 ± 1.892
1.465AspLys: 1.465 ± 1.14
4.396AspLeu: 4.396 ± 1.575
2.198AspMet: 2.198 ± 1.274
2.198AspAsn: 2.198 ± 1.147
4.396AspPro: 4.396 ± 1.575
2.198AspGln: 2.198 ± 0.813
3.663AspArg: 3.663 ± 1.148
2.93AspSer: 2.93 ± 2.231
4.396AspThr: 4.396 ± 0.735
5.861AspVal: 5.861 ± 1.606
1.465AspTrp: 1.465 ± 0.401
2.198AspTyr: 2.198 ± 1.605
0.0AspXaa: 0.0 ± 0.0
Glu
3.663GluAla: 3.663 ± 1.039
0.0GluCys: 0.0 ± 0.0
3.663GluAsp: 3.663 ± 1.148
0.733GluGlu: 0.733 ± 0.57
2.198GluPhe: 2.198 ± 1.274
1.465GluGly: 1.465 ± 1.14
0.733GluHis: 0.733 ± 0.558
3.663GluIle: 3.663 ± 1.564
1.465GluLys: 1.465 ± 0.89
2.93GluLeu: 2.93 ± 0.803
1.465GluMet: 1.465 ± 0.401
2.93GluAsn: 2.93 ± 0.758
0.733GluPro: 0.733 ± 0.57
2.198GluGln: 2.198 ± 1.823
3.663GluArg: 3.663 ± 2.565
4.396GluSer: 4.396 ± 1.689
9.524GluThr: 9.524 ± 8.873
2.198GluVal: 2.198 ± 0.813
0.733GluTrp: 0.733 ± 0.57
2.93GluTyr: 2.93 ± 1.897
0.0GluXaa: 0.0 ± 0.0
Phe
3.663PheAla: 3.663 ± 1.2
0.733PheCys: 0.733 ± 0.57
1.465PheAsp: 1.465 ± 1.14
2.93PheGlu: 2.93 ± 0.758
0.0PhePhe: 0.0 ± 0.0
2.93PheGly: 2.93 ± 1.304
1.465PheHis: 1.465 ± 1.46
1.465PheIle: 1.465 ± 1.115
1.465PheLys: 1.465 ± 0.89
1.465PheLeu: 1.465 ± 1.14
2.198PheMet: 2.198 ± 1.843
1.465PheAsn: 1.465 ± 0.401
2.198PhePro: 2.198 ± 1.605
0.733PheGln: 0.733 ± 0.558
0.733PheArg: 0.733 ± 0.57
0.733PheSer: 0.733 ± 0.57
2.198PheThr: 2.198 ± 0.813
3.663PheVal: 3.663 ± 1.564
0.0PheTrp: 0.0 ± 0.0
0.733PheTyr: 0.733 ± 0.558
0.0PheXaa: 0.0 ± 0.0
Gly
2.198GlyAla: 2.198 ± 1.673
0.0GlyCys: 0.0 ± 0.0
4.396GlyAsp: 4.396 ± 1.575
1.465GlyGlu: 1.465 ± 0.401
1.465GlyPhe: 1.465 ± 1.115
2.198GlyGly: 2.198 ± 1.274
2.198GlyHis: 2.198 ± 1.147
5.128GlyIle: 5.128 ± 1.516
4.396GlyLys: 4.396 ± 1.924
4.396GlyLeu: 4.396 ± 1.719
2.198GlyMet: 2.198 ± 0.787
2.198GlyAsn: 2.198 ± 0.787
0.733GlyPro: 0.733 ± 0.57
1.465GlyGln: 1.465 ± 1.14
3.663GlyArg: 3.663 ± 1.118
4.396GlySer: 4.396 ± 2.461
2.198GlyThr: 2.198 ± 0.813
2.198GlyVal: 2.198 ± 0.813
0.733GlyTrp: 0.733 ± 0.57
4.396GlyTyr: 4.396 ± 2.148
0.0GlyXaa: 0.0 ± 0.0
His
0.733HisAla: 0.733 ± 0.558
0.733HisCys: 0.733 ± 0.57
0.733HisAsp: 0.733 ± 1.464
1.465HisGlu: 1.465 ± 1.46
1.465HisPhe: 1.465 ± 1.14
1.465HisGly: 1.465 ± 1.14
0.0HisHis: 0.0 ± 0.0
2.198HisIle: 2.198 ± 1.673
0.733HisLys: 0.733 ± 0.558
1.465HisLeu: 1.465 ± 1.46
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.465HisPro: 1.465 ± 0.401
0.733HisGln: 0.733 ± 0.558
1.465HisArg: 1.465 ± 0.966
1.465HisSer: 1.465 ± 0.89
0.0HisThr: 0.0 ± 0.0
1.465HisVal: 1.465 ± 1.46
0.733HisTrp: 0.733 ± 0.57
2.93HisTyr: 2.93 ± 1.304
0.0HisXaa: 0.0 ± 0.0
Ile
2.93IleAla: 2.93 ± 0.639
0.733IleCys: 0.733 ± 0.558
2.93IleAsp: 2.93 ± 0.803
3.663IleGlu: 3.663 ± 1.407
1.465IlePhe: 1.465 ± 0.401
5.128IleGly: 5.128 ± 1.079
4.396IleHis: 4.396 ± 3.312
2.198IleIle: 2.198 ± 1.245
2.198IleLys: 2.198 ± 0.813
4.396IleLeu: 4.396 ± 2.461
0.733IleMet: 0.733 ± 0.558
3.663IleAsn: 3.663 ± 1.845
7.326IlePro: 7.326 ± 3.69
0.0IleGln: 0.0 ± 0.0
3.663IleArg: 3.663 ± 0.386
5.861IleSer: 5.861 ± 2.878
3.663IleThr: 3.663 ± 1.118
8.059IleVal: 8.059 ± 2.773
1.465IleTrp: 1.465 ± 0.401
2.93IleTyr: 2.93 ± 1.298
0.0IleXaa: 0.0 ± 0.0
Lys
3.663LysAla: 3.663 ± 1.407
0.0LysCys: 0.0 ± 0.0
0.0LysAsp: 0.0 ± 0.0
6.593LysGlu: 6.593 ± 5.186
3.663LysPhe: 3.663 ± 1.039
5.128LysGly: 5.128 ± 2.262
0.733LysHis: 0.733 ± 0.57
0.0LysIle: 0.0 ± 0.0
2.198LysLys: 2.198 ± 0.813
2.198LysLeu: 2.198 ± 1.823
0.733LysMet: 0.733 ± 0.797
2.93LysAsn: 2.93 ± 0.803
1.465LysPro: 1.465 ± 1.427
1.465LysGln: 1.465 ± 1.14
4.396LysArg: 4.396 ± 1.689
8.059LysSer: 8.059 ± 1.205
5.861LysThr: 5.861 ± 1.614
2.198LysVal: 2.198 ± 0.787
0.733LysTrp: 0.733 ± 0.558
3.663LysTyr: 3.663 ± 2.849
0.0LysXaa: 0.0 ± 0.0
Leu
7.326LeuAla: 7.326 ± 4.01
0.0LeuCys: 0.0 ± 0.0
5.128LeuAsp: 5.128 ± 2.277
4.396LeuGlu: 4.396 ± 2.461
0.733LeuPhe: 0.733 ± 0.57
4.396LeuGly: 4.396 ± 2.461
1.465LeuHis: 1.465 ± 1.115
5.861LeuIle: 5.861 ± 2.344
3.663LeuLys: 3.663 ± 2.057
5.128LeuLeu: 5.128 ± 1.236
2.198LeuMet: 2.198 ± 0.599
5.128LeuAsn: 5.128 ± 1.485
2.93LeuPro: 2.93 ± 1.304
4.396LeuGln: 4.396 ± 1.204
8.791LeuArg: 8.791 ± 5.398
7.326LeuSer: 7.326 ± 3.499
2.93LeuThr: 2.93 ± 1.304
1.465LeuVal: 1.465 ± 0.89
0.733LeuTrp: 0.733 ± 0.57
2.198LeuTyr: 2.198 ± 1.147
0.0LeuXaa: 0.0 ± 0.0
Met
1.465MetAla: 1.465 ± 1.115
1.465MetCys: 1.465 ± 1.115
2.198MetAsp: 2.198 ± 0.813
0.733MetGlu: 0.733 ± 1.464
0.733MetPhe: 0.733 ± 0.57
2.198MetGly: 2.198 ± 1.852
0.0MetHis: 0.0 ± 0.0
0.733MetIle: 0.733 ± 0.57
3.663MetLys: 3.663 ± 2.225
1.465MetLeu: 1.465 ± 1.954
0.0MetMet: 0.0 ± 0.0
1.465MetAsn: 1.465 ± 0.401
1.465MetPro: 1.465 ± 1.14
1.465MetGln: 1.465 ± 0.401
0.733MetArg: 0.733 ± 0.558
2.198MetSer: 2.198 ± 0.787
2.198MetThr: 2.198 ± 1.147
0.733MetVal: 0.733 ± 0.558
0.0MetTrp: 0.0 ± 0.0
0.733MetTyr: 0.733 ± 0.57
0.0MetXaa: 0.0 ± 0.0
Asn
7.326AsnAla: 7.326 ± 1.516
0.0AsnCys: 0.0 ± 0.0
5.861AsnAsp: 5.861 ± 1.606
0.733AsnGlu: 0.733 ± 0.57
1.465AsnPhe: 1.465 ± 1.115
2.198AsnGly: 2.198 ± 0.787
1.465AsnHis: 1.465 ± 0.401
3.663AsnIle: 3.663 ± 2.76
5.128AsnLys: 5.128 ± 2.692
6.593AsnLeu: 6.593 ± 4.367
0.0AsnMet: 0.0 ± 0.0
2.198AsnAsn: 2.198 ± 0.813
4.396AsnPro: 4.396 ± 0.735
2.198AsnGln: 2.198 ± 1.147
2.198AsnArg: 2.198 ± 1.673
6.593AsnSer: 6.593 ± 2.362
1.465AsnThr: 1.465 ± 1.954
5.128AsnVal: 5.128 ± 1.908
2.198AsnTrp: 2.198 ± 0.787
2.93AsnTyr: 2.93 ± 1.304
0.0AsnXaa: 0.0 ± 0.0
Pro
2.93ProAla: 2.93 ± 1.569
0.0ProCys: 0.0 ± 0.0
4.396ProAsp: 4.396 ± 1.204
2.198ProGlu: 2.198 ± 0.787
3.663ProPhe: 3.663 ± 1.118
1.465ProGly: 1.465 ± 0.401
0.733ProHis: 0.733 ± 0.57
3.663ProIle: 3.663 ± 1.118
2.93ProLys: 2.93 ± 0.758
5.861ProLeu: 5.861 ± 1.989
0.0ProMet: 0.0 ± 0.0
2.198ProAsn: 2.198 ± 1.245
0.0ProPro: 0.0 ± 0.0
1.465ProGln: 1.465 ± 1.427
1.465ProArg: 1.465 ± 1.115
1.465ProSer: 1.465 ± 0.401
1.465ProThr: 1.465 ± 0.401
2.198ProVal: 2.198 ± 1.673
0.0ProTrp: 0.0 ± 0.0
2.198ProTyr: 2.198 ± 0.813
0.0ProXaa: 0.0 ± 0.0
Gln
3.663GlnAla: 3.663 ± 2.471
0.733GlnCys: 0.733 ± 0.57
2.198GlnAsp: 2.198 ± 0.787
3.663GlnGlu: 3.663 ± 3.668
2.198GlnPhe: 2.198 ± 1.605
0.733GlnGly: 0.733 ± 0.558
1.465GlnHis: 1.465 ± 0.401
2.93GlnIle: 2.93 ± 1.298
0.733GlnLys: 0.733 ± 0.57
1.465GlnLeu: 1.465 ± 0.401
0.0GlnMet: 0.0 ± 0.0
2.93GlnAsn: 2.93 ± 1.304
2.198GlnPro: 2.198 ± 0.813
1.465GlnGln: 1.465 ± 1.46
3.663GlnArg: 3.663 ± 0.386
3.663GlnSer: 3.663 ± 1.899
1.465GlnThr: 1.465 ± 1.46
1.465GlnVal: 1.465 ± 0.401
0.0GlnTrp: 0.0 ± 0.0
1.465GlnTyr: 1.465 ± 1.14
0.0GlnXaa: 0.0 ± 0.0
Arg
5.128ArgAla: 5.128 ± 0.511
0.733ArgCys: 0.733 ± 0.57
3.663ArgAsp: 3.663 ± 1.899
3.663ArgGlu: 3.663 ± 3.597
0.0ArgPhe: 0.0 ± 0.0
2.93ArgGly: 2.93 ± 0.639
0.0ArgHis: 0.0 ± 0.0
2.93ArgIle: 2.93 ± 0.803
0.733ArgLys: 0.733 ± 0.558
2.93ArgLeu: 2.93 ± 1.345
0.733ArgMet: 0.733 ± 0.57
4.396ArgAsn: 4.396 ± 2.882
0.0ArgPro: 0.0 ± 0.0
2.198ArgGln: 2.198 ± 1.709
5.128ArgArg: 5.128 ± 1.554
9.524ArgSer: 9.524 ± 6.628
2.198ArgThr: 2.198 ± 1.245
2.93ArgVal: 2.93 ± 2.279
2.93ArgTrp: 2.93 ± 2.279
4.396ArgTyr: 4.396 ± 0.735
0.0ArgXaa: 0.0 ± 0.0
Ser
6.593SerAla: 6.593 ± 1.865
0.0SerCys: 0.0 ± 0.0
3.663SerAsp: 3.663 ± 1.148
2.198SerGlu: 2.198 ± 1.709
1.465SerPhe: 1.465 ± 1.888
4.396SerGly: 4.396 ± 1.204
2.198SerHis: 2.198 ± 0.813
6.593SerIle: 6.593 ± 2.424
5.128SerLys: 5.128 ± 0.511
4.396SerLeu: 4.396 ± 1.497
0.733SerMet: 0.733 ± 1.464
9.524SerAsn: 9.524 ± 7.765
0.733SerPro: 0.733 ± 0.57
3.663SerGln: 3.663 ± 1.503
2.198SerArg: 2.198 ± 1.709
4.396SerSer: 4.396 ± 0.588
6.593SerThr: 6.593 ± 2.368
5.128SerVal: 5.128 ± 0.511
0.733SerTrp: 0.733 ± 0.558
3.663SerTyr: 3.663 ± 1.845
0.0SerXaa: 0.0 ± 0.0
Thr
6.593ThrAla: 6.593 ± 3.16
0.0ThrCys: 0.0 ± 0.0
2.93ThrAsp: 2.93 ± 1.304
3.663ThrGlu: 3.663 ± 0.386
1.465ThrPhe: 1.465 ± 0.401
5.128ThrGly: 5.128 ± 3.378
0.733ThrHis: 0.733 ± 0.944
3.663ThrIle: 3.663 ± 2.057
5.128ThrLys: 5.128 ± 2.974
7.326ThrLeu: 7.326 ± 1.358
0.0ThrMet: 0.0 ± 0.0
3.663ThrAsn: 3.663 ± 1.2
2.198ThrPro: 2.198 ± 0.787
3.663ThrGln: 3.663 ± 2.498
2.198ThrArg: 2.198 ± 1.749
2.198ThrSer: 2.198 ± 1.843
5.128ThrThr: 5.128 ± 1.767
2.93ThrVal: 2.93 ± 1.345
0.0ThrTrp: 0.0 ± 0.0
5.128ThrTyr: 5.128 ± 2.021
0.0ThrXaa: 0.0 ± 0.0
Val
6.593ValAla: 6.593 ± 0.889
2.198ValCys: 2.198 ± 1.605
3.663ValAsp: 3.663 ± 1.118
5.128ValGlu: 5.128 ± 1.154
1.465ValPhe: 1.465 ± 1.427
2.198ValGly: 2.198 ± 0.813
0.0ValHis: 0.0 ± 0.0
5.128ValIle: 5.128 ± 2.081
5.128ValLys: 5.128 ± 2.148
2.93ValLeu: 2.93 ± 0.803
0.733ValMet: 0.733 ± 0.57
5.128ValAsn: 5.128 ± 1.154
3.663ValPro: 3.663 ± 1.503
1.465ValGln: 1.465 ± 1.115
2.198ValArg: 2.198 ± 1.673
2.93ValSer: 2.93 ± 1.721
1.465ValThr: 1.465 ± 1.115
3.663ValVal: 3.663 ± 1.148
0.733ValTrp: 0.733 ± 0.57
3.663ValTyr: 3.663 ± 1.118
0.0ValXaa: 0.0 ± 0.0
Trp
0.733TrpAla: 0.733 ± 0.558
0.0TrpCys: 0.0 ± 0.0
0.733TrpAsp: 0.733 ± 0.57
0.733TrpGlu: 0.733 ± 0.57
0.0TrpPhe: 0.0 ± 0.0
0.733TrpGly: 0.733 ± 0.57
0.733TrpHis: 0.733 ± 0.57
0.733TrpIle: 0.733 ± 0.57
0.733TrpLys: 0.733 ± 0.558
1.465TrpLeu: 1.465 ± 0.401
1.465TrpMet: 1.465 ± 1.14
2.93TrpAsn: 2.93 ± 1.304
0.733TrpPro: 0.733 ± 0.57
0.0TrpGln: 0.0 ± 0.0
0.733TrpArg: 0.733 ± 0.57
0.733TrpSer: 0.733 ± 0.57
0.0TrpThr: 0.0 ± 0.0
0.733TrpVal: 0.733 ± 0.57
1.465TrpTrp: 1.465 ± 0.401
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.198TyrAla: 2.198 ± 0.787
0.0TyrCys: 0.0 ± 0.0
2.93TyrAsp: 2.93 ± 0.758
1.465TyrGlu: 1.465 ± 1.115
2.93TyrPhe: 2.93 ± 1.941
2.93TyrGly: 2.93 ± 1.345
2.198TyrHis: 2.198 ± 1.245
2.198TyrIle: 2.198 ± 1.147
0.733TyrLys: 0.733 ± 0.944
3.663TyrLeu: 3.663 ± 1.2
5.128TyrMet: 5.128 ± 1.049
2.93TyrAsn: 2.93 ± 0.639
2.93TyrPro: 2.93 ± 1.304
2.93TyrGln: 2.93 ± 1.345
3.663TyrArg: 3.663 ± 1.507
2.198TyrSer: 2.198 ± 0.787
4.396TyrThr: 4.396 ± 1.25
2.93TyrVal: 2.93 ± 1.304
0.0TyrTrp: 0.0 ± 0.0
2.198TyrTyr: 2.198 ± 0.787
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1366 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski