Amino acid dipepetide frequency for Microviridae Fen418_41

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.954AlaAla: 9.954 ± 1.983
0.664AlaCys: 0.664 ± 0.476
5.309AlaAsp: 5.309 ± 0.845
4.645AlaGlu: 4.645 ± 1.614
2.654AlaPhe: 2.654 ± 0.735
6.636AlaGly: 6.636 ± 2.507
1.327AlaHis: 1.327 ± 0.685
4.645AlaIle: 4.645 ± 0.695
1.991AlaLys: 1.991 ± 1.148
9.29AlaLeu: 9.29 ± 1.58
0.664AlaMet: 0.664 ± 0.476
4.645AlaAsn: 4.645 ± 1.615
2.654AlaPro: 2.654 ± 1.327
8.626AlaGln: 8.626 ± 8.337
1.991AlaArg: 1.991 ± 1.009
3.318AlaSer: 3.318 ± 0.966
1.991AlaThr: 1.991 ± 1.038
3.318AlaVal: 3.318 ± 2.719
0.0AlaTrp: 0.0 ± 0.0
4.645AlaTyr: 4.645 ± 1.394
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.327CysAsp: 1.327 ± 1.264
0.0CysGlu: 0.0 ± 0.0
1.327CysPhe: 1.327 ± 0.611
2.654CysGly: 2.654 ± 1.75
0.0CysHis: 0.0 ± 0.0
0.664CysIle: 0.664 ± 0.476
0.0CysLys: 0.0 ± 0.0
1.327CysLeu: 1.327 ± 1.264
0.664CysMet: 0.664 ± 0.476
0.0CysAsn: 0.0 ± 0.0
0.664CysPro: 0.664 ± 0.476
0.664CysGln: 0.664 ± 0.476
1.327CysArg: 1.327 ± 1.136
1.327CysSer: 1.327 ± 0.611
0.0CysThr: 0.0 ± 0.0
0.664CysVal: 0.664 ± 0.476
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.991AspAla: 1.991 ± 1.912
0.0AspCys: 0.0 ± 0.0
2.654AspAsp: 2.654 ± 2.063
1.991AspGlu: 1.991 ± 1.148
1.327AspPhe: 1.327 ± 0.685
2.654AspGly: 2.654 ± 1.75
1.991AspHis: 1.991 ± 1.025
0.664AspIle: 0.664 ± 0.476
0.0AspLys: 0.0 ± 0.0
3.318AspLeu: 3.318 ± 1.093
3.981AspMet: 3.981 ± 2.962
1.327AspAsn: 1.327 ± 1.418
0.664AspPro: 0.664 ± 0.887
4.645AspGln: 4.645 ± 1.934
6.636AspArg: 6.636 ± 3.287
3.318AspSer: 3.318 ± 0.966
3.318AspThr: 3.318 ± 1.801
3.318AspVal: 3.318 ± 1.87
1.991AspTrp: 1.991 ± 1.168
2.654AspTyr: 2.654 ± 1.012
0.0AspXaa: 0.0 ± 0.0
Glu
5.972GluAla: 5.972 ± 1.925
0.664GluCys: 0.664 ± 0.476
1.327GluAsp: 1.327 ± 1.13
2.654GluGlu: 2.654 ± 1.33
3.981GluPhe: 3.981 ± 1.003
0.664GluGly: 0.664 ± 0.632
1.327GluHis: 1.327 ± 1.264
0.664GluIle: 0.664 ± 0.709
1.991GluLys: 1.991 ± 1.558
3.318GluLeu: 3.318 ± 1.992
1.327GluMet: 1.327 ± 0.685
1.991GluAsn: 1.991 ± 0.943
0.664GluPro: 0.664 ± 0.887
1.991GluGln: 1.991 ± 1.879
4.645GluArg: 4.645 ± 2.905
1.991GluSer: 1.991 ± 1.817
1.327GluThr: 1.327 ± 1.264
2.654GluVal: 2.654 ± 1.285
1.991GluTrp: 1.991 ± 1.427
1.991GluTyr: 1.991 ± 0.894
0.0GluXaa: 0.0 ± 0.0
Phe
3.318PheAla: 3.318 ± 1.191
0.0PheCys: 0.0 ± 0.0
2.654PheAsp: 2.654 ± 1.348
1.327PheGlu: 1.327 ± 0.964
1.991PhePhe: 1.991 ± 0.894
3.981PheGly: 3.981 ± 1.833
0.664PheHis: 0.664 ± 0.476
1.991PheIle: 1.991 ± 0.943
1.327PheLys: 1.327 ± 1.031
5.972PheLeu: 5.972 ± 1.791
0.0PheMet: 0.0 ± 0.0
2.654PheAsn: 2.654 ± 0.921
1.327PhePro: 1.327 ± 0.611
4.645PheGln: 4.645 ± 2.036
2.654PheArg: 2.654 ± 1.222
1.991PheSer: 1.991 ± 0.894
3.981PheThr: 3.981 ± 1.488
1.327PheVal: 1.327 ± 0.685
0.664PheTrp: 0.664 ± 0.476
0.664PheTyr: 0.664 ± 0.632
0.0PheXaa: 0.0 ± 0.0
Gly
4.645GlyAla: 4.645 ± 1.452
0.664GlyCys: 0.664 ± 0.632
3.981GlyAsp: 3.981 ± 0.741
3.318GlyGlu: 3.318 ± 1.833
0.0GlyPhe: 0.0 ± 0.0
7.963GlyGly: 7.963 ± 1.81
1.991GlyHis: 1.991 ± 1.427
4.645GlyIle: 4.645 ± 1.757
1.327GlyLys: 1.327 ± 1.264
7.963GlyLeu: 7.963 ± 1.517
1.327GlyMet: 1.327 ± 0.611
3.981GlyAsn: 3.981 ± 0.978
1.991GlyPro: 1.991 ± 1.427
0.0GlyGln: 0.0 ± 0.0
1.991GlyArg: 1.991 ± 1.148
6.636GlySer: 6.636 ± 1.804
6.636GlyThr: 6.636 ± 2.695
3.318GlyVal: 3.318 ± 0.992
1.327GlyTrp: 1.327 ± 0.611
4.645GlyTyr: 4.645 ± 2.175
0.0GlyXaa: 0.0 ± 0.0
His
0.664HisAla: 0.664 ± 0.476
0.664HisCys: 0.664 ± 0.476
1.327HisAsp: 1.327 ± 0.952
0.664HisGlu: 0.664 ± 0.974
0.664HisPhe: 0.664 ± 0.476
1.991HisGly: 1.991 ± 1.311
0.0HisHis: 0.0 ± 0.0
1.991HisIle: 1.991 ± 0.894
2.654HisLys: 2.654 ± 1.75
1.991HisLeu: 1.991 ± 0.968
0.664HisMet: 0.664 ± 0.476
1.991HisAsn: 1.991 ± 1.512
0.664HisPro: 0.664 ± 0.709
0.0HisGln: 0.0 ± 0.0
0.664HisArg: 0.664 ± 0.476
0.0HisSer: 0.0 ± 0.0
0.664HisThr: 0.664 ± 0.632
0.664HisVal: 0.664 ± 0.632
0.664HisTrp: 0.664 ± 0.476
0.664HisTyr: 0.664 ± 0.632
0.0HisXaa: 0.0 ± 0.0
Ile
5.309IleAla: 5.309 ± 1.52
0.664IleCys: 0.664 ± 0.632
3.981IleAsp: 3.981 ± 2.051
1.991IleGlu: 1.991 ± 0.923
3.318IlePhe: 3.318 ± 0.966
1.327IleGly: 1.327 ± 0.84
1.991IleHis: 1.991 ± 0.894
1.991IleIle: 1.991 ± 1.311
1.991IleLys: 1.991 ± 0.444
1.991IleLeu: 1.991 ± 1.168
3.981IleMet: 3.981 ± 1.671
9.29IleAsn: 9.29 ± 2.256
2.654IlePro: 2.654 ± 1.012
1.327IleGln: 1.327 ± 0.952
2.654IleArg: 2.654 ± 1.38
5.309IleSer: 5.309 ± 1.338
4.645IleThr: 4.645 ± 1.224
7.299IleVal: 7.299 ± 0.491
0.0IleTrp: 0.0 ± 0.0
1.991IleTyr: 1.991 ± 1.123
0.0IleXaa: 0.0 ± 0.0
Lys
4.645LysAla: 4.645 ± 2.792
1.991LysCys: 1.991 ± 1.148
1.991LysAsp: 1.991 ± 1.065
1.327LysGlu: 1.327 ± 1.438
0.664LysPhe: 0.664 ± 0.476
0.664LysGly: 0.664 ± 0.476
0.0LysHis: 0.0 ± 0.0
2.654LysIle: 2.654 ± 1.366
2.654LysLys: 2.654 ± 2.528
1.991LysLeu: 1.991 ± 0.943
1.327LysMet: 1.327 ± 0.694
1.327LysAsn: 1.327 ± 1.031
2.654LysPro: 2.654 ± 1.095
1.991LysGln: 1.991 ± 0.923
3.318LysArg: 3.318 ± 1.813
1.327LysSer: 1.327 ± 1.13
1.991LysThr: 1.991 ± 1.025
1.327LysVal: 1.327 ± 0.84
0.0LysTrp: 0.0 ± 0.0
1.991LysTyr: 1.991 ± 1.896
0.0LysXaa: 0.0 ± 0.0
Leu
7.299LeuAla: 7.299 ± 2.292
0.664LeuCys: 0.664 ± 0.476
3.981LeuAsp: 3.981 ± 1.845
2.654LeuGlu: 2.654 ± 1.887
3.318LeuPhe: 3.318 ± 1.801
5.309LeuGly: 5.309 ± 1.103
0.0LeuHis: 0.0 ± 0.0
8.626LeuIle: 8.626 ± 1.991
1.991LeuLys: 1.991 ± 1.496
3.981LeuLeu: 3.981 ± 1.669
2.654LeuMet: 2.654 ± 1.993
7.299LeuAsn: 7.299 ± 2.013
5.972LeuPro: 5.972 ± 2.326
5.309LeuGln: 5.309 ± 2.897
2.654LeuArg: 2.654 ± 1.306
5.972LeuSer: 5.972 ± 2.466
9.29LeuThr: 9.29 ± 3.015
3.981LeuVal: 3.981 ± 2.002
1.991LeuTrp: 1.991 ± 0.894
2.654LeuTyr: 2.654 ± 1.371
0.0LeuXaa: 0.0 ± 0.0
Met
1.327MetAla: 1.327 ± 1.264
0.664MetCys: 0.664 ± 0.476
0.664MetAsp: 0.664 ± 0.974
0.0MetGlu: 0.0 ± 0.0
0.664MetPhe: 0.664 ± 0.476
2.654MetGly: 2.654 ± 1.296
0.0MetHis: 0.0 ± 0.0
1.991MetIle: 1.991 ± 0.444
1.991MetLys: 1.991 ± 1.522
0.0MetLeu: 0.0 ± 0.0
1.327MetMet: 1.327 ± 1.169
3.981MetAsn: 3.981 ± 1.806
3.318MetPro: 3.318 ± 0.664
3.318MetGln: 3.318 ± 1.898
0.664MetArg: 0.664 ± 0.476
3.981MetSer: 3.981 ± 0.903
1.991MetThr: 1.991 ± 1.148
2.654MetVal: 2.654 ± 2.126
1.327MetTrp: 1.327 ± 0.694
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.318AsnAla: 3.318 ± 3.545
0.664AsnCys: 0.664 ± 0.632
2.654AsnAsp: 2.654 ± 0.85
3.318AsnGlu: 3.318 ± 1.191
1.991AsnPhe: 1.991 ± 0.943
2.654AsnGly: 2.654 ± 1.222
2.654AsnHis: 2.654 ± 1.726
6.636AsnIle: 6.636 ± 2.843
1.327AsnLys: 1.327 ± 1.031
8.626AsnLeu: 8.626 ± 4.89
1.327AsnMet: 1.327 ± 0.694
3.981AsnAsn: 3.981 ± 1.584
1.991AsnPro: 1.991 ± 0.943
7.299AsnGln: 7.299 ± 3.288
3.981AsnArg: 3.981 ± 1.318
5.309AsnSer: 5.309 ± 2.221
4.645AsnThr: 4.645 ± 1.864
3.318AsnVal: 3.318 ± 1.269
0.664AsnTrp: 0.664 ± 0.476
1.991AsnTyr: 1.991 ± 0.968
0.0AsnXaa: 0.0 ± 0.0
Pro
2.654ProAla: 2.654 ± 1.327
1.327ProCys: 1.327 ± 1.264
3.318ProAsp: 3.318 ± 1.622
3.318ProGlu: 3.318 ± 0.799
1.991ProPhe: 1.991 ± 1.148
3.318ProGly: 3.318 ± 2.379
1.991ProHis: 1.991 ± 0.894
3.318ProIle: 3.318 ± 1.87
2.654ProLys: 2.654 ± 1.306
4.645ProLeu: 4.645 ± 1.844
1.991ProMet: 1.991 ± 0.943
3.318ProAsn: 3.318 ± 1.968
1.327ProPro: 1.327 ± 0.964
3.318ProGln: 3.318 ± 1.757
1.327ProArg: 1.327 ± 0.694
3.318ProSer: 3.318 ± 1.563
2.654ProThr: 2.654 ± 1.903
2.654ProVal: 2.654 ± 1.296
1.327ProTrp: 1.327 ± 0.964
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
7.299GlnAla: 7.299 ± 3.03
0.0GlnCys: 0.0 ± 0.0
2.654GlnAsp: 2.654 ± 0.721
4.645GlnGlu: 4.645 ± 1.741
1.991GlnPhe: 1.991 ± 0.894
2.654GlnGly: 2.654 ± 1.327
0.664GlnHis: 0.664 ± 0.974
3.981GlnIle: 3.981 ± 1.783
2.654GlnLys: 2.654 ± 1.378
5.972GlnLeu: 5.972 ± 3.831
0.0GlnMet: 0.0 ± 0.0
4.645GlnAsn: 4.645 ± 3.09
3.318GlnPro: 3.318 ± 0.966
5.972GlnGln: 5.972 ± 3.76
3.981GlnArg: 3.981 ± 0.903
2.654GlnSer: 2.654 ± 1.474
6.636GlnThr: 6.636 ± 1.581
0.664GlnVal: 0.664 ± 0.709
0.664GlnTrp: 0.664 ± 0.709
1.991GlnTyr: 1.991 ± 1.009
0.0GlnXaa: 0.0 ± 0.0
Arg
3.318ArgAla: 3.318 ± 1.87
0.664ArgCys: 0.664 ± 0.476
1.327ArgAsp: 1.327 ± 0.964
2.654ArgGlu: 2.654 ± 2.003
3.318ArgPhe: 3.318 ± 2.653
1.327ArgGly: 1.327 ± 0.84
0.664ArgHis: 0.664 ± 0.709
4.645ArgIle: 4.645 ± 2.051
3.318ArgLys: 3.318 ± 1.571
3.981ArgLeu: 3.981 ± 1.789
1.991ArgMet: 1.991 ± 1.148
2.654ArgAsn: 2.654 ± 1.095
3.981ArgPro: 3.981 ± 1.807
3.318ArgGln: 3.318 ± 1.572
7.299ArgArg: 7.299 ± 4.239
3.318ArgSer: 3.318 ± 1.728
1.327ArgThr: 1.327 ± 0.694
4.645ArgVal: 4.645 ± 2.29
0.0ArgTrp: 0.0 ± 0.0
2.654ArgTyr: 2.654 ± 0.603
0.0ArgXaa: 0.0 ± 0.0
Ser
3.318SerAla: 3.318 ± 0.992
1.327SerCys: 1.327 ± 1.264
1.991SerAsp: 1.991 ± 1.168
1.991SerGlu: 1.991 ± 1.168
3.981SerPhe: 3.981 ± 2.297
8.626SerGly: 8.626 ± 2.59
0.664SerHis: 0.664 ± 0.476
5.972SerIle: 5.972 ± 2.08
2.654SerLys: 2.654 ± 1.993
7.299SerLeu: 7.299 ± 2.388
1.991SerMet: 1.991 ± 0.648
3.318SerAsn: 3.318 ± 1.968
2.654SerPro: 2.654 ± 1.371
0.664SerGln: 0.664 ± 0.632
1.991SerArg: 1.991 ± 0.444
1.991SerSer: 1.991 ± 0.943
5.972SerThr: 5.972 ± 1.572
2.654SerVal: 2.654 ± 1.474
0.664SerTrp: 0.664 ± 0.632
0.664SerTyr: 0.664 ± 0.632
0.0SerXaa: 0.0 ± 0.0
Thr
7.299ThrAla: 7.299 ± 3.869
0.664ThrCys: 0.664 ± 0.632
1.327ThrAsp: 1.327 ± 0.611
2.654ThrGlu: 2.654 ± 1.296
3.318ThrPhe: 3.318 ± 1.735
6.636ThrGly: 6.636 ± 1.538
0.664ThrHis: 0.664 ± 0.476
3.318ThrIle: 3.318 ± 1.894
3.318ThrLys: 3.318 ± 1.813
5.309ThrLeu: 5.309 ± 3.5
0.664ThrMet: 0.664 ± 0.709
4.645ThrAsn: 4.645 ± 1.456
4.645ThrPro: 4.645 ± 1.689
5.972ThrGln: 5.972 ± 2.042
2.654ThrArg: 2.654 ± 1.306
2.654ThrSer: 2.654 ± 1.296
7.299ThrThr: 7.299 ± 2.38
4.645ThrVal: 4.645 ± 1.394
0.0ThrTrp: 0.0 ± 0.0
2.654ThrTyr: 2.654 ± 1.095
0.0ThrXaa: 0.0 ± 0.0
Val
3.318ValAla: 3.318 ± 2.689
1.327ValCys: 1.327 ± 0.964
2.654ValAsp: 2.654 ± 1.366
1.991ValGlu: 1.991 ± 1.905
3.318ValPhe: 3.318 ± 2.143
3.318ValGly: 3.318 ± 1.165
0.664ValHis: 0.664 ± 0.632
2.654ValIle: 2.654 ± 1.38
1.327ValLys: 1.327 ± 0.685
3.981ValLeu: 3.981 ± 1.584
3.318ValMet: 3.318 ± 0.664
3.981ValAsn: 3.981 ± 1.886
5.972ValPro: 5.972 ± 2.38
1.991ValGln: 1.991 ± 1.912
2.654ValArg: 2.654 ± 1.366
3.318ValSer: 3.318 ± 1.231
3.318ValThr: 3.318 ± 1.757
0.664ValVal: 0.664 ± 0.476
0.0ValTrp: 0.0 ± 0.0
1.327ValTyr: 1.327 ± 0.694
0.0ValXaa: 0.0 ± 0.0
Trp
0.664TrpAla: 0.664 ± 0.632
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.664TrpGlu: 0.664 ± 0.476
0.664TrpPhe: 0.664 ± 0.476
0.664TrpGly: 0.664 ± 0.476
0.664TrpHis: 0.664 ± 0.476
0.664TrpIle: 0.664 ± 0.632
0.0TrpLys: 0.0 ± 0.0
1.327TrpLeu: 1.327 ± 0.964
0.0TrpMet: 0.0 ± 0.0
0.664TrpAsn: 0.664 ± 0.709
1.327TrpPro: 1.327 ± 0.964
1.327TrpGln: 1.327 ± 0.611
1.991TrpArg: 1.991 ± 0.943
0.0TrpSer: 0.0 ± 0.0
1.327TrpThr: 1.327 ± 0.952
0.0TrpVal: 0.0 ± 0.0
0.664TrpTrp: 0.664 ± 0.476
1.327TrpTyr: 1.327 ± 0.611
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.654TyrAla: 2.654 ± 1.378
0.0TyrCys: 0.0 ± 0.0
3.318TyrAsp: 3.318 ± 0.993
1.327TyrGlu: 1.327 ± 0.694
2.654TyrPhe: 2.654 ± 1.222
2.654TyrGly: 2.654 ± 0.603
1.327TyrHis: 1.327 ± 1.264
1.991TyrIle: 1.991 ± 0.894
0.664TyrLys: 0.664 ± 0.476
3.318TyrLeu: 3.318 ± 0.799
2.654TyrMet: 2.654 ± 1.721
2.654TyrAsn: 2.654 ± 2.836
1.327TyrPro: 1.327 ± 0.611
0.664TyrGln: 0.664 ± 0.476
1.327TyrArg: 1.327 ± 0.611
3.318TyrSer: 3.318 ± 1.656
1.327TyrThr: 1.327 ± 0.611
1.327TyrVal: 1.327 ± 0.952
0.0TyrTrp: 0.0 ± 0.0
0.664TyrTyr: 0.664 ± 0.476
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1508 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski