Amino acid dipepetide frequency for Beihai tombus-like virus 16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.032AlaAla: 8.032 ± 2.602
2.295AlaCys: 2.295 ± 0.673
3.442AlaAsp: 3.442 ± 0.751
4.59AlaGlu: 4.59 ± 0.716
2.295AlaPhe: 2.295 ± 1.11
1.721AlaGly: 1.721 ± 0.944
1.147AlaHis: 1.147 ± 0.641
4.59AlaIle: 4.59 ± 2.307
2.869AlaLys: 2.869 ± 0.892
9.18AlaLeu: 9.18 ± 1.691
1.721AlaMet: 1.721 ± 0.454
2.869AlaAsn: 2.869 ± 0.612
4.59AlaPro: 4.59 ± 1.489
3.442AlaGln: 3.442 ± 0.995
7.458AlaArg: 7.458 ± 1.076
5.737AlaSer: 5.737 ± 0.695
5.164AlaThr: 5.164 ± 0.72
2.869AlaVal: 2.869 ± 0.951
0.0AlaTrp: 0.0 ± 0.0
2.869AlaTyr: 2.869 ± 1.448
0.0AlaXaa: 0.0 ± 0.0
Cys
0.574CysAla: 0.574 ± 0.575
0.574CysCys: 0.574 ± 0.32
0.574CysAsp: 0.574 ± 0.672
0.574CysGlu: 0.574 ± 0.32
0.574CysPhe: 0.574 ± 0.672
1.147CysGly: 1.147 ± 0.562
0.574CysHis: 0.574 ± 0.32
1.147CysIle: 1.147 ± 0.641
1.147CysLys: 1.147 ± 0.641
1.147CysLeu: 1.147 ± 0.562
0.0CysMet: 0.0 ± 0.0
1.721CysAsn: 1.721 ± 0.498
1.147CysPro: 1.147 ± 0.407
0.574CysGln: 0.574 ± 0.575
1.147CysArg: 1.147 ± 0.641
4.016CysSer: 4.016 ± 0.732
0.574CysThr: 0.574 ± 0.672
0.0CysVal: 0.0 ± 0.0
1.147CysTrp: 1.147 ± 1.15
2.869CysTyr: 2.869 ± 1.139
0.0CysXaa: 0.0 ± 0.0
Asp
6.885AspAla: 6.885 ± 0.677
0.574AspCys: 0.574 ± 0.32
2.295AspAsp: 2.295 ± 0.222
2.295AspGlu: 2.295 ± 0.815
4.016AspPhe: 4.016 ± 0.732
1.721AspGly: 1.721 ± 0.454
1.147AspHis: 1.147 ± 0.641
1.721AspIle: 1.721 ± 0.454
4.59AspLys: 4.59 ± 1.144
5.737AspLeu: 5.737 ± 0.402
1.147AspMet: 1.147 ± 0.562
2.295AspAsn: 2.295 ± 0.812
0.574AspPro: 0.574 ± 0.575
1.147AspGln: 1.147 ± 0.641
0.0AspArg: 0.0 ± 0.0
3.442AspSer: 3.442 ± 1.564
1.147AspThr: 1.147 ± 1.344
2.869AspVal: 2.869 ± 0.612
0.574AspTrp: 0.574 ± 0.32
3.442AspTyr: 3.442 ± 0.518
0.0AspXaa: 0.0 ± 0.0
Glu
4.016GluAla: 4.016 ± 0.914
1.147GluCys: 1.147 ± 0.807
1.721GluAsp: 1.721 ± 0.498
2.869GluGlu: 2.869 ± 0.237
3.442GluPhe: 3.442 ± 0.909
2.295GluGly: 2.295 ± 0.673
1.721GluHis: 1.721 ± 0.62
2.869GluIle: 2.869 ± 1.302
3.442GluLys: 3.442 ± 0.995
4.016GluLeu: 4.016 ± 0.732
1.721GluMet: 1.721 ± 0.789
1.721GluAsn: 1.721 ± 0.62
1.721GluPro: 1.721 ± 0.454
2.295GluGln: 2.295 ± 1.282
1.721GluArg: 1.721 ± 0.961
5.164GluSer: 5.164 ± 2.88
5.164GluThr: 5.164 ± 0.751
0.574GluVal: 0.574 ± 0.575
0.0GluTrp: 0.0 ± 0.0
1.721GluTyr: 1.721 ± 0.944
0.0GluXaa: 0.0 ± 0.0
Phe
5.164PheAla: 5.164 ± 1.112
0.574PheCys: 0.574 ± 0.672
3.442PheAsp: 3.442 ± 1.564
1.721PheGlu: 1.721 ± 1.23
1.147PhePhe: 1.147 ± 0.407
3.442PheGly: 3.442 ± 1.685
1.721PheHis: 1.721 ± 0.944
1.721PheIle: 1.721 ± 0.454
1.147PheLys: 1.147 ± 0.641
3.442PheLeu: 3.442 ± 0.909
1.147PheMet: 1.147 ± 0.422
1.147PheAsn: 1.147 ± 0.407
0.574PhePro: 0.574 ± 0.32
2.295PheGln: 2.295 ± 1.282
1.721PheArg: 1.721 ± 0.62
2.295PheSer: 2.295 ± 0.917
4.016PheThr: 4.016 ± 2.473
1.147PheVal: 1.147 ± 0.562
1.147PheTrp: 1.147 ± 1.15
1.147PheTyr: 1.147 ± 0.562
0.0PheXaa: 0.0 ± 0.0
Gly
1.721GlyAla: 1.721 ± 0.454
1.147GlyCys: 1.147 ± 0.562
2.869GlyAsp: 2.869 ± 1.139
2.295GlyGlu: 2.295 ± 0.815
2.869GlyPhe: 2.869 ± 1.767
2.869GlyGly: 2.869 ± 0.612
1.721GlyHis: 1.721 ± 0.62
2.869GlyIle: 2.869 ± 1.139
3.442GlyLys: 3.442 ± 0.751
4.016GlyLeu: 4.016 ± 1.102
0.0GlyMet: 0.0 ± 0.299
3.442GlyAsn: 3.442 ± 2.003
2.295GlyPro: 2.295 ± 0.673
1.147GlyGln: 1.147 ± 0.562
4.016GlyArg: 4.016 ± 1.405
3.442GlySer: 3.442 ± 0.909
3.442GlyThr: 3.442 ± 1.249
3.442GlyVal: 3.442 ± 1.222
0.574GlyTrp: 0.574 ± 0.32
0.574GlyTyr: 0.574 ± 0.32
0.0GlyXaa: 0.0 ± 0.0
His
1.721HisAla: 1.721 ± 0.62
0.574HisCys: 0.574 ± 0.672
1.147HisAsp: 1.147 ± 0.641
0.574HisGlu: 0.574 ± 0.32
2.869HisPhe: 2.869 ± 0.612
1.721HisGly: 1.721 ± 0.498
0.0HisHis: 0.0 ± 0.0
1.147HisIle: 1.147 ± 0.562
1.147HisLys: 1.147 ± 0.407
2.869HisLeu: 2.869 ± 0.801
0.0HisMet: 0.0 ± 0.0
1.721HisAsn: 1.721 ± 0.454
2.295HisPro: 2.295 ± 0.812
0.574HisGln: 0.574 ± 0.32
0.574HisArg: 0.574 ± 0.672
2.295HisSer: 2.295 ± 1.282
2.869HisThr: 2.869 ± 0.951
2.295HisVal: 2.295 ± 0.815
0.574HisTrp: 0.574 ± 0.32
1.147HisTyr: 1.147 ± 0.641
0.0HisXaa: 0.0 ± 0.0
Ile
4.016IleAla: 4.016 ± 1.187
2.869IleCys: 2.869 ± 0.951
0.574IleAsp: 0.574 ± 0.575
4.59IleGlu: 4.59 ± 0.846
2.295IlePhe: 2.295 ± 0.673
1.147IleGly: 1.147 ± 0.562
4.016IleHis: 4.016 ± 1.102
2.295IleIle: 2.295 ± 0.812
2.869IleLys: 2.869 ± 0.951
6.885IleLeu: 6.885 ± 1.501
0.0IleMet: 0.0 ± 0.0
2.295IleAsn: 2.295 ± 0.815
2.295IlePro: 2.295 ± 1.123
1.721IleGln: 1.721 ± 0.961
6.311IleArg: 6.311 ± 1.8
4.59IleSer: 4.59 ± 0.458
2.869IleThr: 2.869 ± 1.139
4.016IleVal: 4.016 ± 0.914
1.147IleTrp: 1.147 ± 0.562
2.869IleTyr: 2.869 ± 0.612
0.0IleXaa: 0.0 ± 0.0
Lys
4.59LysAla: 4.59 ± 2.307
1.147LysCys: 1.147 ± 1.344
1.147LysAsp: 1.147 ± 0.407
2.869LysGlu: 2.869 ± 1.448
1.147LysPhe: 1.147 ± 0.807
0.574LysGly: 0.574 ± 0.32
2.295LysHis: 2.295 ± 1.614
5.737LysIle: 5.737 ± 0.402
7.458LysLys: 7.458 ± 4.336
9.18LysLeu: 9.18 ± 2.288
0.574LysMet: 0.574 ± 0.32
3.442LysAsn: 3.442 ± 1.35
4.016LysPro: 4.016 ± 1.556
1.721LysGln: 1.721 ± 0.944
1.721LysArg: 1.721 ± 0.498
4.016LysSer: 4.016 ± 0.732
4.016LysThr: 4.016 ± 0.732
3.442LysVal: 3.442 ± 1.167
1.147LysTrp: 1.147 ± 0.407
1.147LysTyr: 1.147 ± 0.641
0.0LysXaa: 0.0 ± 0.0
Leu
5.164LeuAla: 5.164 ± 1.048
0.0LeuCys: 0.0 ± 0.0
4.59LeuAsp: 4.59 ± 0.458
5.164LeuGlu: 5.164 ± 1.584
4.016LeuPhe: 4.016 ± 1.405
5.737LeuGly: 5.737 ± 1.299
1.721LeuHis: 1.721 ± 0.498
5.737LeuIle: 5.737 ± 0.695
6.885LeuLys: 6.885 ± 1.517
10.327LeuLeu: 10.327 ± 2.923
0.574LeuMet: 0.574 ± 0.32
2.295LeuAsn: 2.295 ± 1.11
8.606LeuPro: 8.606 ± 2.647
8.606LeuGln: 8.606 ± 2.488
3.442LeuArg: 3.442 ± 1.35
5.737LeuSer: 5.737 ± 1.061
6.311LeuThr: 6.311 ± 1.375
5.737LeuVal: 5.737 ± 1.299
0.574LeuTrp: 0.574 ± 0.575
2.295LeuTyr: 2.295 ± 1.858
0.0LeuXaa: 0.0 ± 0.0
Met
1.721MetAla: 1.721 ± 0.961
0.0MetCys: 0.0 ± 0.0
1.147MetAsp: 1.147 ± 0.641
0.574MetGlu: 0.574 ± 0.32
0.0MetPhe: 0.0 ± 0.0
0.574MetGly: 0.574 ± 0.575
0.574MetHis: 0.574 ± 0.575
0.574MetIle: 0.574 ± 0.32
0.574MetLys: 0.574 ± 0.575
2.295MetLeu: 2.295 ± 0.222
0.0MetMet: 0.0 ± 0.0
0.574MetAsn: 0.574 ± 0.32
1.147MetPro: 1.147 ± 0.641
0.574MetGln: 0.574 ± 0.32
0.574MetArg: 0.574 ± 0.32
1.721MetSer: 1.721 ± 1.197
1.147MetThr: 1.147 ± 1.15
0.0MetVal: 0.0 ± 0.0
0.574MetTrp: 0.574 ± 0.575
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
6.311AsnAla: 6.311 ± 0.887
0.574AsnCys: 0.574 ± 0.575
1.721AsnAsp: 1.721 ± 0.454
0.574AsnGlu: 0.574 ± 0.575
2.869AsnPhe: 2.869 ± 1.302
2.869AsnGly: 2.869 ± 0.892
0.574AsnHis: 0.574 ± 0.672
4.59AsnIle: 4.59 ± 2.563
2.295AsnLys: 2.295 ± 1.123
3.442AsnLeu: 3.442 ± 0.909
1.147AsnMet: 1.147 ± 0.407
0.574AsnAsn: 0.574 ± 0.672
3.442AsnPro: 3.442 ± 0.518
0.0AsnGln: 0.0 ± 0.0
1.147AsnArg: 1.147 ± 0.807
2.869AsnSer: 2.869 ± 1.335
2.869AsnThr: 2.869 ± 1.448
1.721AsnVal: 1.721 ± 0.961
0.574AsnTrp: 0.574 ± 0.32
1.721AsnTyr: 1.721 ± 0.62
0.0AsnXaa: 0.0 ± 0.0
Pro
3.442ProAla: 3.442 ± 1.222
1.147ProCys: 1.147 ± 0.641
2.295ProAsp: 2.295 ± 1.282
1.147ProGlu: 1.147 ± 0.562
1.147ProPhe: 1.147 ± 0.407
2.869ProGly: 2.869 ± 0.801
4.016ProHis: 4.016 ± 1.556
2.295ProIle: 2.295 ± 0.917
6.311ProLys: 6.311 ± 1.658
2.869ProLeu: 2.869 ± 0.237
1.147ProMet: 1.147 ± 0.641
1.147ProAsn: 1.147 ± 1.344
3.442ProPro: 3.442 ± 1.922
3.442ProGln: 3.442 ± 1.35
1.721ProArg: 1.721 ± 0.62
8.032ProSer: 8.032 ± 3.112
9.753ProThr: 9.753 ± 1.268
4.016ProVal: 4.016 ± 0.244
1.147ProTrp: 1.147 ± 0.641
1.147ProTyr: 1.147 ± 0.807
0.0ProXaa: 0.0 ± 0.0
Gln
2.295GlnAla: 2.295 ± 0.222
0.574GlnCys: 0.574 ± 0.32
2.869GlnAsp: 2.869 ± 0.612
3.442GlnGlu: 3.442 ± 0.518
1.147GlnPhe: 1.147 ± 0.407
1.147GlnGly: 1.147 ± 0.641
0.574GlnHis: 0.574 ± 0.32
2.869GlnIle: 2.869 ± 0.237
0.574GlnLys: 0.574 ± 0.32
4.016GlnLeu: 4.016 ± 0.244
0.0GlnMet: 0.0 ± 0.0
3.442GlnAsn: 3.442 ± 0.518
2.869GlnPro: 2.869 ± 0.892
5.164GlnGln: 5.164 ± 0.628
2.295GlnArg: 2.295 ± 0.815
4.016GlnSer: 4.016 ± 0.732
2.295GlnThr: 2.295 ± 1.123
1.721GlnVal: 1.721 ± 0.62
0.574GlnTrp: 0.574 ± 0.32
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.869ArgAla: 2.869 ± 0.801
1.721ArgCys: 1.721 ± 2.017
5.737ArgAsp: 5.737 ± 0.695
4.59ArgGlu: 4.59 ± 0.443
1.147ArgPhe: 1.147 ± 0.562
2.869ArgGly: 2.869 ± 0.892
2.295ArgHis: 2.295 ± 0.812
2.295ArgIle: 2.295 ± 1.123
2.295ArgLys: 2.295 ± 0.222
1.721ArgLeu: 1.721 ± 0.62
1.147ArgMet: 1.147 ± 0.407
2.295ArgAsn: 2.295 ± 0.222
3.442ArgPro: 3.442 ± 1.222
1.147ArgGln: 1.147 ± 0.407
2.295ArgArg: 2.295 ± 1.858
3.442ArgSer: 3.442 ± 1.24
1.721ArgThr: 1.721 ± 0.498
1.147ArgVal: 1.147 ± 0.407
0.574ArgTrp: 0.574 ± 0.575
1.721ArgTyr: 1.721 ± 1.23
0.0ArgXaa: 0.0 ± 0.0
Ser
6.311SerAla: 6.311 ± 1.03
2.295SerCys: 2.295 ± 0.673
2.295SerAsp: 2.295 ± 0.673
1.147SerGlu: 1.147 ± 0.407
4.59SerPhe: 4.59 ± 0.458
8.032SerGly: 8.032 ± 1.827
1.147SerHis: 1.147 ± 0.641
5.737SerIle: 5.737 ± 1.061
2.869SerLys: 2.869 ± 0.237
8.032SerLeu: 8.032 ± 0.527
1.147SerMet: 1.147 ± 0.562
2.869SerAsn: 2.869 ± 0.951
8.606SerPro: 8.606 ± 1.128
3.442SerGln: 3.442 ± 1.685
2.869SerArg: 2.869 ± 0.612
16.064SerSer: 16.064 ± 2.522
7.458SerThr: 7.458 ± 2.01
6.885SerVal: 6.885 ± 0.677
0.0SerTrp: 0.0 ± 0.0
4.016SerTyr: 4.016 ± 1.187
0.0SerXaa: 0.0 ± 0.0
Thr
6.311ThrAla: 6.311 ± 2.456
1.721ThrCys: 1.721 ± 0.454
2.295ThrAsp: 2.295 ± 1.123
4.016ThrGlu: 4.016 ± 0.828
1.721ThrPhe: 1.721 ± 0.62
2.869ThrGly: 2.869 ± 1.067
0.574ThrHis: 0.574 ± 0.32
5.164ThrIle: 5.164 ± 1.493
5.164ThrLys: 5.164 ± 1.284
8.032ThrLeu: 8.032 ± 0.487
1.147ThrMet: 1.147 ± 0.407
2.869ThrAsn: 2.869 ± 0.237
5.164ThrPro: 5.164 ± 1.615
2.295ThrGln: 2.295 ± 0.222
2.295ThrArg: 2.295 ± 2.3
9.753ThrSer: 9.753 ± 0.937
6.311ThrThr: 6.311 ± 1.985
3.442ThrVal: 3.442 ± 0.909
0.0ThrTrp: 0.0 ± 0.0
3.442ThrTyr: 3.442 ± 1.24
0.0ThrXaa: 0.0 ± 0.0
Val
2.869ValAla: 2.869 ± 0.892
2.295ValCys: 2.295 ± 0.222
3.442ValAsp: 3.442 ± 1.249
3.442ValGlu: 3.442 ± 1.222
1.147ValPhe: 1.147 ± 0.807
2.869ValGly: 2.869 ± 0.237
1.721ValHis: 1.721 ± 0.454
2.869ValIle: 2.869 ± 1.602
1.147ValLys: 1.147 ± 1.15
2.295ValLeu: 2.295 ± 0.917
1.147ValMet: 1.147 ± 0.407
1.147ValAsn: 1.147 ± 0.641
4.016ValPro: 4.016 ± 1.187
1.147ValGln: 1.147 ± 0.407
4.016ValArg: 4.016 ± 1.735
5.737ValSer: 5.737 ± 1.603
2.869ValThr: 2.869 ± 1.602
1.147ValVal: 1.147 ± 0.807
0.574ValTrp: 0.574 ± 0.672
2.295ValTyr: 2.295 ± 1.11
0.0ValXaa: 0.0 ± 0.0
Trp
0.574TrpAla: 0.574 ± 0.575
0.0TrpCys: 0.0 ± 0.0
0.574TrpAsp: 0.574 ± 0.672
1.147TrpGlu: 1.147 ± 0.562
0.574TrpPhe: 0.574 ± 0.32
1.147TrpGly: 1.147 ± 0.407
0.0TrpHis: 0.0 ± 0.0
1.147TrpIle: 1.147 ± 0.641
0.0TrpLys: 0.0 ± 0.0
1.147TrpLeu: 1.147 ± 0.407
0.0TrpMet: 0.0 ± 0.0
1.147TrpAsn: 1.147 ± 0.407
0.574TrpPro: 0.574 ± 0.32
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.147TrpSer: 1.147 ± 0.407
0.574TrpThr: 0.574 ± 0.575
1.147TrpVal: 1.147 ± 1.15
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.295TyrAla: 2.295 ± 0.222
0.0TyrCys: 0.0 ± 0.0
3.442TyrAsp: 3.442 ± 1.167
1.721TyrGlu: 1.721 ± 1.37
1.147TyrPhe: 1.147 ± 0.562
1.147TyrGly: 1.147 ± 0.641
0.574TyrHis: 0.574 ± 0.32
2.295TyrIle: 2.295 ± 1.11
5.164TyrLys: 5.164 ± 1.526
2.869TyrLeu: 2.869 ± 1.139
0.0TyrMet: 0.0 ± 0.0
2.869TyrAsn: 2.869 ± 0.612
1.721TyrPro: 1.721 ± 0.498
1.147TyrGln: 1.147 ± 0.641
1.147TyrArg: 1.147 ± 0.562
2.295TyrSer: 2.295 ± 0.222
4.016TyrThr: 4.016 ± 0.828
0.574TyrVal: 0.574 ± 0.32
0.0TyrTrp: 0.0 ± 0.0
2.869TyrTyr: 2.869 ± 1.067
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1744 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski