Amino acid dipepetide frequency for Picobirnavirus Equ3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.962AlaAla: 8.962 ± 5.857
0.0AlaCys: 0.0 ± 0.0
2.987AlaAsp: 2.987 ± 1.688
5.975AlaGlu: 5.975 ± 4.311
3.734AlaPhe: 3.734 ± 0.093
5.975AlaGly: 5.975 ± 3.105
0.747AlaHis: 0.747 ± 0.485
7.468AlaIle: 7.468 ± 3.255
5.228AlaLys: 5.228 ± 0.899
8.962AlaLeu: 8.962 ± 2.229
2.24AlaMet: 2.24 ± 0.969
3.734AlaAsn: 3.734 ± 2.571
2.987AlaPro: 2.987 ± 0.488
5.228AlaGln: 5.228 ± 1.705
5.228AlaArg: 5.228 ± 3.418
1.494AlaSer: 1.494 ± 0.462
7.468AlaThr: 7.468 ± 1.682
5.975AlaVal: 5.975 ± 1.754
0.747AlaTrp: 0.747 ± 0.485
4.481AlaTyr: 4.481 ± 1.938
0.0AlaXaa: 0.0 ± 0.0
Cys
0.747CysAla: 0.747 ± 0.485
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.747CysIle: 0.747 ± 0.485
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.747CysMet: 0.747 ± 0.485
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.494CysGln: 1.494 ± 0.462
0.747CysArg: 0.747 ± 0.514
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.747CysVal: 0.747 ± 0.514
0.0CysTrp: 0.0 ± 0.0
0.747CysTyr: 0.747 ± 0.514
0.0CysXaa: 0.0 ± 0.0
Asp
3.734AspAla: 3.734 ± 1.336
0.0AspCys: 0.0 ± 0.0
2.987AspAsp: 2.987 ± 0.924
2.24AspGlu: 2.24 ± 0.796
2.987AspPhe: 2.987 ± 1.942
3.734AspGly: 3.734 ± 0.841
2.24AspHis: 2.24 ± 1.456
2.24AspIle: 2.24 ± 0.848
3.734AspLys: 3.734 ± 0.093
5.975AspLeu: 5.975 ± 2.65
1.494AspMet: 1.494 ± 0.971
0.0AspAsn: 0.0 ± 0.0
3.734AspPro: 3.734 ± 1.726
1.494AspGln: 1.494 ± 0.844
2.24AspArg: 2.24 ± 0.848
2.24AspSer: 2.24 ± 0.848
5.228AspThr: 5.228 ± 0.982
4.481AspVal: 4.481 ± 1.592
0.747AspTrp: 0.747 ± 0.485
2.24AspTyr: 2.24 ± 0.969
0.0AspXaa: 0.0 ± 0.0
Glu
6.721GluAla: 6.721 ± 5.42
0.0GluCys: 0.0 ± 0.0
2.24GluAsp: 2.24 ± 1.543
3.734GluGlu: 3.734 ± 2.571
4.481GluPhe: 4.481 ± 1.592
3.734GluGly: 3.734 ± 1.699
0.0GluHis: 0.0 ± 0.0
2.987GluIle: 2.987 ± 1.43
2.987GluLys: 2.987 ± 1.235
5.975GluLeu: 5.975 ± 2.105
0.747GluMet: 0.747 ± 0.514
5.975GluAsn: 5.975 ± 0.976
1.494GluPro: 1.494 ± 0.941
3.734GluGln: 3.734 ± 0.94
2.987GluArg: 2.987 ± 1.883
2.987GluSer: 2.987 ± 0.596
4.481GluThr: 4.481 ± 0.437
5.975GluVal: 5.975 ± 0.757
0.747GluTrp: 0.747 ± 0.514
5.228GluTyr: 5.228 ± 1.455
0.0GluXaa: 0.0 ± 0.0
Phe
1.494PheAla: 1.494 ± 0.971
0.0PheCys: 0.0 ± 0.0
3.734PheAsp: 3.734 ± 1.266
2.24PheGlu: 2.24 ± 0.848
0.747PhePhe: 0.747 ± 0.514
3.734PheGly: 3.734 ± 1.266
0.747PheHis: 0.747 ± 0.485
2.24PheIle: 2.24 ± 0.796
0.0PheLys: 0.0 ± 0.0
5.228PheLeu: 5.228 ± 2.165
2.24PheMet: 2.24 ± 0.542
0.747PheAsn: 0.747 ± 0.485
2.24PhePro: 2.24 ± 0.796
0.747PheGln: 0.747 ± 0.485
2.24PheArg: 2.24 ± 0.546
1.494PheSer: 1.494 ± 0.462
3.734PheThr: 3.734 ± 1.208
2.987PheVal: 2.987 ± 1.235
0.747PheTrp: 0.747 ± 0.485
0.747PheTyr: 0.747 ± 0.514
0.0PheXaa: 0.0 ± 0.0
Gly
6.721GlyAla: 6.721 ± 4.098
0.747GlyCys: 0.747 ± 0.514
3.734GlyAsp: 3.734 ± 1.266
5.228GlyGlu: 5.228 ± 1.831
2.987GlyPhe: 2.987 ± 1.942
5.228GlyGly: 5.228 ± 3.436
0.747GlyHis: 0.747 ± 0.514
5.975GlyIle: 5.975 ± 1.754
0.747GlyLys: 0.747 ± 0.485
4.481GlyLeu: 4.481 ± 1.507
1.494GlyMet: 1.494 ± 1.028
3.734GlyAsn: 3.734 ± 0.841
0.0GlyPro: 0.0 ± 0.0
0.747GlyGln: 0.747 ± 0.514
2.24GlyArg: 2.24 ± 0.796
4.481GlySer: 4.481 ± 1.092
3.734GlyThr: 3.734 ± 1.451
5.975GlyVal: 5.975 ± 1.858
3.734GlyTrp: 3.734 ± 3.771
3.734GlyTyr: 3.734 ± 1.822
0.0GlyXaa: 0.0 ± 0.0
His
0.747HisAla: 0.747 ± 1.007
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.747HisGlu: 0.747 ± 0.514
0.747HisPhe: 0.747 ± 0.485
2.24HisGly: 2.24 ± 0.848
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.494HisLys: 1.494 ± 0.844
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
2.987HisAsn: 2.987 ± 2.773
2.24HisPro: 2.24 ± 0.848
0.0HisGln: 0.0 ± 0.0
2.24HisArg: 2.24 ± 0.848
2.24HisSer: 2.24 ± 0.848
0.747HisThr: 0.747 ± 0.485
2.987HisVal: 2.987 ± 1.235
0.0HisTrp: 0.0 ± 0.0
0.747HisTyr: 0.747 ± 1.007
0.0HisXaa: 0.0 ± 0.0
Ile
8.215IleAla: 8.215 ± 1.598
0.0IleCys: 0.0 ± 0.0
4.481IleAsp: 4.481 ± 1.386
2.24IleGlu: 2.24 ± 0.848
1.494IlePhe: 1.494 ± 1.028
5.975IleGly: 5.975 ± 1.979
0.0IleHis: 0.0 ± 0.0
3.734IleIle: 3.734 ± 1.699
5.228IleLys: 5.228 ± 0.982
3.734IleLeu: 3.734 ± 2.427
0.747IleMet: 0.747 ± 0.514
2.24IleAsn: 2.24 ± 0.969
1.494IlePro: 1.494 ± 0.941
0.747IleGln: 0.747 ± 0.514
2.24IleArg: 2.24 ± 0.796
2.24IleSer: 2.24 ± 0.848
0.747IleThr: 0.747 ± 1.007
2.24IleVal: 2.24 ± 1.456
1.494IleTrp: 1.494 ± 0.462
2.987IleTyr: 2.987 ± 1.235
0.0IleXaa: 0.0 ± 0.0
Lys
3.734LysAla: 3.734 ± 2.571
0.0LysCys: 0.0 ± 0.0
2.987LysAsp: 2.987 ± 1.325
2.987LysGlu: 2.987 ± 1.43
2.987LysPhe: 2.987 ± 1.942
1.494LysGly: 1.494 ± 0.844
2.24LysHis: 2.24 ± 0.969
2.24LysIle: 2.24 ± 1.888
3.734LysLys: 3.734 ± 1.208
2.987LysLeu: 2.987 ± 1.942
0.747LysMet: 0.747 ± 0.485
3.734LysAsn: 3.734 ± 1.336
5.975LysPro: 5.975 ± 1.848
0.0LysGln: 0.0 ± 0.0
5.228LysArg: 5.228 ± 0.382
4.481LysSer: 4.481 ± 1.372
1.494LysThr: 1.494 ± 0.971
1.494LysVal: 1.494 ± 0.462
2.987LysTrp: 2.987 ± 1.235
1.494LysTyr: 1.494 ± 0.971
0.0LysXaa: 0.0 ± 0.0
Leu
6.721LeuAla: 6.721 ± 3.005
0.747LeuCys: 0.747 ± 0.514
3.734LeuAsp: 3.734 ± 1.266
6.721LeuGlu: 6.721 ± 1.213
2.24LeuPhe: 2.24 ± 1.109
3.734LeuGly: 3.734 ± 1.743
1.494LeuHis: 1.494 ± 0.462
2.987LeuIle: 2.987 ± 1.235
6.721LeuLys: 6.721 ± 2.388
3.734LeuLeu: 3.734 ± 1.266
6.721LeuMet: 6.721 ± 1.311
4.481LeuAsn: 4.481 ± 2.173
6.721LeuPro: 6.721 ± 0.508
2.24LeuGln: 2.24 ± 0.796
1.494LeuArg: 1.494 ± 0.971
6.721LeuSer: 6.721 ± 0.843
6.721LeuThr: 6.721 ± 2.156
2.24LeuVal: 2.24 ± 0.796
0.747LeuTrp: 0.747 ± 0.485
2.987LeuTyr: 2.987 ± 1.235
0.0LeuXaa: 0.0 ± 0.0
Met
2.987MetAla: 2.987 ± 0.924
0.0MetCys: 0.0 ± 0.0
1.494MetAsp: 1.494 ± 0.462
2.24MetGlu: 2.24 ± 0.546
0.747MetPhe: 0.747 ± 0.485
4.481MetGly: 4.481 ± 2.326
1.494MetHis: 1.494 ± 1.028
1.494MetIle: 1.494 ± 0.462
2.987MetLys: 2.987 ± 0.924
0.747MetLeu: 0.747 ± 0.485
0.747MetMet: 0.747 ± 1.007
0.747MetAsn: 0.747 ± 1.007
0.747MetPro: 0.747 ± 1.007
0.0MetGln: 0.0 ± 0.0
1.494MetArg: 1.494 ± 0.462
0.747MetSer: 0.747 ± 0.514
2.987MetThr: 2.987 ± 1.302
0.0MetVal: 0.0 ± 0.0
1.494MetTrp: 1.494 ± 0.462
2.24MetTyr: 2.24 ± 0.796
0.0MetXaa: 0.0 ± 0.0
Asn
9.709AsnAla: 9.709 ± 3.19
1.494AsnCys: 1.494 ± 0.462
3.734AsnAsp: 3.734 ± 0.093
5.228AsnGlu: 5.228 ± 1.798
2.24AsnPhe: 2.24 ± 1.543
1.494AsnGly: 1.494 ± 2.014
1.494AsnHis: 1.494 ± 1.028
2.24AsnIle: 2.24 ± 0.848
2.24AsnLys: 2.24 ± 1.785
3.734AsnLeu: 3.734 ± 0.841
0.747AsnMet: 0.747 ± 0.442
6.721AsnAsn: 6.721 ± 1.741
5.228AsnPro: 5.228 ± 0.382
3.734AsnGln: 3.734 ± 1.999
1.494AsnArg: 1.494 ± 0.844
5.975AsnSer: 5.975 ± 1.192
4.481AsnThr: 4.481 ± 1.299
1.494AsnVal: 1.494 ± 0.844
0.747AsnTrp: 0.747 ± 0.485
2.987AsnTyr: 2.987 ± 1.325
0.0AsnXaa: 0.0 ± 0.0
Pro
4.481ProAla: 4.481 ± 1.092
0.747ProCys: 0.747 ± 0.485
2.24ProAsp: 2.24 ± 0.796
6.721ProGlu: 6.721 ± 1.977
2.24ProPhe: 2.24 ± 1.543
2.24ProGly: 2.24 ± 0.546
1.494ProHis: 1.494 ± 0.844
1.494ProIle: 1.494 ± 0.462
2.24ProLys: 2.24 ± 0.796
5.228ProLeu: 5.228 ± 2.513
3.734ProMet: 3.734 ± 0.093
4.481ProAsn: 4.481 ± 1.938
0.0ProPro: 0.0 ± 0.0
0.747ProGln: 0.747 ± 0.514
2.987ProArg: 2.987 ± 0.924
2.987ProSer: 2.987 ± 0.488
2.24ProThr: 2.24 ± 0.546
0.747ProVal: 0.747 ± 0.514
0.747ProTrp: 0.747 ± 0.485
1.494ProTyr: 1.494 ± 0.462
0.0ProXaa: 0.0 ± 0.0
Gln
2.24GlnAla: 2.24 ± 0.796
0.0GlnCys: 0.0 ± 0.0
2.24GlnAsp: 2.24 ± 0.546
1.494GlnGlu: 1.494 ± 0.971
4.481GlnPhe: 4.481 ± 1.386
2.987GlnGly: 2.987 ± 2.876
0.747GlnHis: 0.747 ± 1.007
3.734GlnIle: 3.734 ± 0.093
0.747GlnLys: 0.747 ± 0.485
0.747GlnLeu: 0.747 ± 0.485
0.0GlnMet: 0.0 ± 0.0
2.24GlnAsn: 2.24 ± 0.969
1.494GlnPro: 1.494 ± 0.462
1.494GlnGln: 1.494 ± 0.971
2.24GlnArg: 2.24 ± 1.888
3.734GlnSer: 3.734 ± 1.451
0.747GlnThr: 0.747 ± 1.007
1.494GlnVal: 1.494 ± 0.462
0.747GlnTrp: 0.747 ± 1.007
1.494GlnTyr: 1.494 ± 0.971
0.0GlnXaa: 0.0 ± 0.0
Arg
3.734ArgAla: 3.734 ± 0.093
0.0ArgCys: 0.0 ± 0.0
2.987ArgAsp: 2.987 ± 1.942
4.481ArgGlu: 4.481 ± 3.571
0.0ArgPhe: 0.0 ± 0.0
3.734ArgGly: 3.734 ± 1.699
3.734ArgHis: 3.734 ± 2.605
2.24ArgIle: 2.24 ± 0.796
0.747ArgLys: 0.747 ± 0.485
3.734ArgLeu: 3.734 ± 1.208
0.747ArgMet: 0.747 ± 0.514
2.987ArgAsn: 2.987 ± 1.552
0.747ArgPro: 0.747 ± 0.485
1.494ArgGln: 1.494 ± 0.941
2.24ArgArg: 2.24 ± 0.796
4.481ArgSer: 4.481 ± 1.372
3.734ArgThr: 3.734 ± 0.841
2.24ArgVal: 2.24 ± 1.543
0.747ArgTrp: 0.747 ± 0.514
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
1.494SerAla: 1.494 ± 0.971
0.0SerCys: 0.0 ± 0.0
2.987SerAsp: 2.987 ± 1.43
3.734SerGlu: 3.734 ± 1.208
3.734SerPhe: 3.734 ± 1.822
5.975SerGly: 5.975 ± 0.632
2.24SerHis: 2.24 ± 1.456
0.747SerIle: 0.747 ± 1.007
5.228SerLys: 5.228 ± 0.95
5.228SerLeu: 5.228 ± 1.035
2.24SerMet: 2.24 ± 0.796
5.228SerAsn: 5.228 ± 2.098
2.987SerPro: 2.987 ± 0.488
2.24SerGln: 2.24 ± 0.546
0.747SerArg: 0.747 ± 0.485
5.228SerSer: 5.228 ± 2.384
2.987SerThr: 2.987 ± 0.924
2.987SerVal: 2.987 ± 0.488
2.24SerTrp: 2.24 ± 0.796
3.734SerTyr: 3.734 ± 1.822
0.0SerXaa: 0.0 ± 0.0
Thr
5.975ThrAla: 5.975 ± 2.135
0.747ThrCys: 0.747 ± 0.514
3.734ThrAsp: 3.734 ± 2.559
4.481ThrGlu: 4.481 ± 2.425
1.494ThrPhe: 1.494 ± 1.028
2.24ThrGly: 2.24 ± 0.796
0.0ThrHis: 0.0 ± 0.0
5.228ThrIle: 5.228 ± 0.382
2.987ThrLys: 2.987 ± 0.488
2.987ThrLeu: 2.987 ± 0.924
1.494ThrMet: 1.494 ± 1.028
8.215ThrAsn: 8.215 ± 1.305
4.481ThrPro: 4.481 ± 1.386
2.24ThrGln: 2.24 ± 0.546
2.24ThrArg: 2.24 ± 0.546
2.24ThrSer: 2.24 ± 0.848
2.987ThrThr: 2.987 ± 1.302
1.494ThrVal: 1.494 ± 0.462
0.0ThrTrp: 0.0 ± 0.0
4.481ThrTyr: 4.481 ± 1.592
0.0ThrXaa: 0.0 ± 0.0
Val
2.24ValAla: 2.24 ± 1.543
1.494ValCys: 1.494 ± 0.971
0.747ValAsp: 0.747 ± 0.485
3.734ValGlu: 3.734 ± 1.699
0.0ValPhe: 0.0 ± 0.0
1.494ValGly: 1.494 ± 0.844
0.0ValHis: 0.0 ± 0.0
2.24ValIle: 2.24 ± 0.848
4.481ValLys: 4.481 ± 2.913
6.721ValLeu: 6.721 ± 0.843
0.747ValMet: 0.747 ± 0.514
2.987ValAsn: 2.987 ± 1.325
3.734ValPro: 3.734 ± 1.743
2.24ValGln: 2.24 ± 0.546
1.494ValArg: 1.494 ± 0.941
3.734ValSer: 3.734 ± 0.841
3.734ValThr: 3.734 ± 0.093
5.228ValVal: 5.228 ± 1.706
0.747ValTrp: 0.747 ± 0.485
6.721ValTyr: 6.721 ± 1.977
0.0ValXaa: 0.0 ± 0.0
Trp
2.987TrpAla: 2.987 ± 0.596
0.0TrpCys: 0.0 ± 0.0
0.747TrpAsp: 0.747 ± 0.485
0.747TrpGlu: 0.747 ± 0.485
0.0TrpPhe: 0.0 ± 0.0
0.747TrpGly: 0.747 ± 0.485
0.0TrpHis: 0.0 ± 0.0
1.494TrpIle: 1.494 ± 0.941
0.0TrpLys: 0.0 ± 0.0
2.987TrpLeu: 2.987 ± 0.924
0.747TrpMet: 0.747 ± 0.485
0.747TrpAsn: 0.747 ± 0.485
0.0TrpPro: 0.0 ± 0.0
1.494TrpGln: 1.494 ± 0.941
1.494TrpArg: 1.494 ± 0.462
0.747TrpSer: 0.747 ± 0.485
1.494TrpThr: 1.494 ± 0.462
2.24TrpVal: 2.24 ± 0.546
0.0TrpTrp: 0.0 ± 0.0
1.494TrpTyr: 1.494 ± 0.844
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.975TyrAla: 5.975 ± 2.65
0.0TyrCys: 0.0 ± 0.0
5.975TyrAsp: 5.975 ± 2.105
2.24TyrGlu: 2.24 ± 0.796
1.494TyrPhe: 1.494 ± 1.028
5.228TyrGly: 5.228 ± 1.455
0.747TyrHis: 0.747 ± 0.514
0.747TyrIle: 0.747 ± 0.514
1.494TyrLys: 1.494 ± 0.462
6.721TyrLeu: 6.721 ± 2.095
0.747TyrMet: 0.747 ± 0.485
5.228TyrAsn: 5.228 ± 0.899
2.987TyrPro: 2.987 ± 0.596
2.987TyrGln: 2.987 ± 1.552
1.494TyrArg: 1.494 ± 1.028
3.734TyrSer: 3.734 ± 0.093
0.0TyrThr: 0.0 ± 0.0
1.494TyrVal: 1.494 ± 1.028
0.747TyrTrp: 0.747 ± 0.485
0.747TyrTyr: 0.747 ± 0.514
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1340 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski