Amino acid dipepetide frequency for Tavallinen suomalainen mies virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.51AlaAla: 3.51 ± 0.872
1.276AlaCys: 1.276 ± 1.28
2.234AlaAsp: 2.234 ± 0.772
1.914AlaGlu: 1.914 ± 0.627
1.276AlaPhe: 1.276 ± 0.669
3.191AlaGly: 3.191 ± 0.423
1.276AlaHis: 1.276 ± 0.347
3.829AlaIle: 3.829 ± 1.082
2.234AlaLys: 2.234 ± 0.248
5.424AlaLeu: 5.424 ± 2.784
1.276AlaMet: 1.276 ± 0.731
1.276AlaAsn: 1.276 ± 1.429
1.595AlaPro: 1.595 ± 0.685
1.276AlaGln: 1.276 ± 0.861
2.234AlaArg: 2.234 ± 0.798
2.234AlaSer: 2.234 ± 0.691
1.276AlaThr: 1.276 ± 0.389
2.872AlaVal: 2.872 ± 1.016
0.638AlaTrp: 0.638 ± 0.335
0.638AlaTyr: 0.638 ± 0.335
0.0AlaXaa: 0.0 ± 0.0
Cys
0.957CysAla: 0.957 ± 0.586
0.957CysCys: 0.957 ± 0.389
0.957CysAsp: 0.957 ± 0.389
0.638CysGlu: 0.638 ± 0.618
0.638CysPhe: 0.638 ± 0.424
0.638CysGly: 0.638 ± 0.618
0.638CysHis: 0.638 ± 0.424
1.276CysIle: 1.276 ± 0.956
2.234CysLys: 2.234 ± 0.691
3.191CysLeu: 3.191 ± 0.758
0.319CysMet: 0.319 ± 0.691
0.638CysAsn: 0.638 ± 0.424
0.638CysPro: 0.638 ± 0.976
0.319CysGln: 0.319 ± 0.691
0.957CysArg: 0.957 ± 0.927
0.957CysSer: 0.957 ± 0.927
0.638CysThr: 0.638 ± 0.366
2.234CysVal: 2.234 ± 1.333
0.0CysTrp: 0.0 ± 0.0
1.276CysTyr: 1.276 ± 0.601
0.0CysXaa: 0.0 ± 0.0
Asp
2.872AspAla: 2.872 ± 0.796
0.638AspCys: 0.638 ± 0.335
2.553AspAsp: 2.553 ± 2.042
4.786AspGlu: 4.786 ± 0.439
3.191AspPhe: 3.191 ± 1.023
3.191AspGly: 3.191 ± 1.28
1.914AspHis: 1.914 ± 1.003
3.829AspIle: 3.829 ± 1.081
2.872AspLys: 2.872 ± 0.55
5.424AspLeu: 5.424 ± 1.098
2.553AspMet: 2.553 ± 1.114
1.914AspAsn: 1.914 ± 1.145
2.553AspPro: 2.553 ± 1.518
1.914AspGln: 1.914 ± 1.004
1.595AspArg: 1.595 ± 0.836
4.786AspSer: 4.786 ± 1.012
2.553AspThr: 2.553 ± 0.927
3.191AspVal: 3.191 ± 1.043
1.276AspTrp: 1.276 ± 0.669
2.234AspTyr: 2.234 ± 0.772
0.0AspXaa: 0.0 ± 0.0
Glu
4.148GluAla: 4.148 ± 0.457
1.276GluCys: 1.276 ± 1.096
4.148GluAsp: 4.148 ± 1.733
7.339GluGlu: 7.339 ± 1.025
3.829GluPhe: 3.829 ± 1.569
2.872GluGly: 2.872 ± 0.55
0.957GluHis: 0.957 ± 0.927
5.743GluIle: 5.743 ± 0.717
3.51GluLys: 3.51 ± 1.352
11.168GluLeu: 11.168 ± 2.41
3.51GluMet: 3.51 ± 0.984
2.234GluAsn: 2.234 ± 0.594
3.51GluPro: 3.51 ± 1.565
1.276GluGln: 1.276 ± 0.669
4.467GluArg: 4.467 ± 1.365
5.105GluSer: 5.105 ± 1.4
5.105GluThr: 5.105 ± 1.356
4.148GluVal: 4.148 ± 1.285
0.319GluTrp: 0.319 ± 0.167
1.595GluTyr: 1.595 ± 1.171
0.0GluXaa: 0.0 ± 0.0
Phe
0.319PheAla: 0.319 ± 0.167
0.638PheCys: 0.638 ± 0.618
2.553PheAsp: 2.553 ± 1.338
2.872PheGlu: 2.872 ± 0.358
2.872PhePhe: 2.872 ± 0.358
2.872PheGly: 2.872 ± 1.593
1.276PheHis: 1.276 ± 0.423
1.914PheIle: 1.914 ± 0.751
5.105PheLys: 5.105 ± 0.914
5.105PheLeu: 5.105 ± 1.705
2.234PheMet: 2.234 ± 0.578
1.914PheAsn: 1.914 ± 0.627
1.914PhePro: 1.914 ± 1.004
2.234PheGln: 2.234 ± 0.71
2.553PheArg: 2.553 ± 0.373
4.148PheSer: 4.148 ± 0.503
2.234PheThr: 2.234 ± 1.215
2.553PheVal: 2.553 ± 1.649
0.0PheTrp: 0.0 ± 0.0
1.276PheTyr: 1.276 ± 0.948
0.0PheXaa: 0.0 ± 0.0
Gly
1.276GlyAla: 1.276 ± 0.669
1.595GlyCys: 1.595 ± 0.707
3.51GlyAsp: 3.51 ± 1.84
6.382GlyGlu: 6.382 ± 0.623
2.553GlyPhe: 2.553 ± 1.696
2.872GlyGly: 2.872 ± 0.55
2.234GlyHis: 2.234 ± 0.578
3.51GlyIle: 3.51 ± 0.733
3.191GlyLys: 3.191 ± 1.209
6.701GlyLeu: 6.701 ± 0.764
2.553GlyMet: 2.553 ± 0.373
1.595GlyAsn: 1.595 ± 0.512
4.148GlyPro: 4.148 ± 1.032
0.957GlyGln: 0.957 ± 0.502
2.234GlyArg: 2.234 ± 0.773
4.148GlySer: 4.148 ± 1.012
3.51GlyThr: 3.51 ± 1.488
0.638GlyVal: 0.638 ± 0.424
1.595GlyTrp: 1.595 ± 0.809
1.595GlyTyr: 1.595 ± 0.809
0.0GlyXaa: 0.0 ± 0.0
His
0.638HisAla: 0.638 ± 0.366
0.319HisCys: 0.319 ± 0.167
0.319HisAsp: 0.319 ± 0.514
3.191HisGlu: 3.191 ± 0.997
0.319HisPhe: 0.319 ± 0.167
0.957HisGly: 0.957 ± 0.502
1.276HisHis: 1.276 ± 0.423
0.957HisIle: 0.957 ± 0.502
0.957HisLys: 0.957 ± 0.502
1.595HisLeu: 1.595 ± 0.836
0.0HisMet: 0.0 ± 0.0
1.595HisAsn: 1.595 ± 1.347
0.638HisPro: 0.638 ± 0.913
0.638HisGln: 0.638 ± 0.424
1.276HisArg: 1.276 ± 0.423
2.872HisSer: 2.872 ± 1.166
0.638HisThr: 0.638 ± 0.335
0.957HisVal: 0.957 ± 0.927
0.0HisTrp: 0.0 ± 0.0
1.276HisTyr: 1.276 ± 0.669
0.0HisXaa: 0.0 ± 0.0
Ile
2.553IleAla: 2.553 ± 1.449
1.276IleCys: 1.276 ± 0.948
4.467IleAsp: 4.467 ± 0.907
5.424IleGlu: 5.424 ± 1.752
3.51IlePhe: 3.51 ± 1.679
3.191IleGly: 3.191 ± 0.929
1.276IleHis: 1.276 ± 0.669
5.743IleIle: 5.743 ± 0.593
4.786IleLys: 4.786 ± 1.261
7.977IleLeu: 7.977 ± 1.243
1.595IleMet: 1.595 ± 0.287
3.191IleAsn: 3.191 ± 0.712
0.957IlePro: 0.957 ± 0.502
1.914IleGln: 1.914 ± 1.145
3.829IleArg: 3.829 ± 0.679
6.701IleSer: 6.701 ± 0.975
5.424IleThr: 5.424 ± 1.539
5.424IleVal: 5.424 ± 1.302
0.0IleTrp: 0.0 ± 0.0
1.914IleTyr: 1.914 ± 0.159
0.0IleXaa: 0.0 ± 0.0
Lys
4.786LysAla: 4.786 ± 1.965
0.319LysCys: 0.319 ± 0.167
3.51LysAsp: 3.51 ± 2.263
3.51LysGlu: 3.51 ± 0.557
4.467LysPhe: 4.467 ± 0.622
5.105LysGly: 5.105 ± 0.932
1.276LysHis: 1.276 ± 0.389
5.743LysIle: 5.743 ± 2.038
3.51LysLys: 3.51 ± 1.52
7.02LysLeu: 7.02 ± 1.32
1.276LysMet: 1.276 ± 0.669
2.872LysAsn: 2.872 ± 1.165
3.51LysPro: 3.51 ± 1.117
2.553LysGln: 2.553 ± 0.779
4.467LysArg: 4.467 ± 1.061
3.191LysSer: 3.191 ± 0.989
4.786LysThr: 4.786 ± 1.535
3.191LysVal: 3.191 ± 0.706
1.276LysTrp: 1.276 ± 0.669
2.234LysTyr: 2.234 ± 1.171
0.0LysXaa: 0.0 ± 0.0
Leu
7.02LeuAla: 7.02 ± 0.816
1.914LeuCys: 1.914 ± 0.159
5.105LeuAsp: 5.105 ± 1.558
7.977LeuGlu: 7.977 ± 1.968
5.424LeuPhe: 5.424 ± 0.982
5.105LeuGly: 5.105 ± 0.647
1.276LeuHis: 1.276 ± 0.423
6.701LeuIle: 6.701 ± 2.05
9.253LeuLys: 9.253 ± 1.782
11.168LeuLeu: 11.168 ± 1.238
3.191LeuMet: 3.191 ± 0.998
7.658LeuAsn: 7.658 ± 0.976
3.191LeuPro: 3.191 ± 1.449
3.191LeuGln: 3.191 ± 1.023
6.063LeuArg: 6.063 ± 1.859
9.892LeuSer: 9.892 ± 1.736
6.063LeuThr: 6.063 ± 1.267
7.339LeuVal: 7.339 ± 1.701
1.276LeuTrp: 1.276 ± 0.601
3.191LeuTyr: 3.191 ± 1.043
0.0LeuXaa: 0.0 ± 0.0
Met
0.957MetAla: 0.957 ± 0.502
0.638MetCys: 0.638 ± 1.028
2.234MetAsp: 2.234 ± 1.044
2.234MetGlu: 2.234 ± 0.865
1.595MetPhe: 1.595 ± 0.812
1.914MetGly: 1.914 ± 0.778
0.0MetHis: 0.0 ± 0.0
1.914MetIle: 1.914 ± 0.778
1.595MetLys: 1.595 ± 0.796
2.234MetLeu: 2.234 ± 1.008
0.957MetMet: 0.957 ± 1.016
1.595MetAsn: 1.595 ± 0.512
1.276MetPro: 1.276 ± 0.389
0.957MetGln: 0.957 ± 0.389
1.595MetArg: 1.595 ± 0.597
2.872MetSer: 2.872 ± 1.583
1.276MetThr: 1.276 ± 0.389
3.191MetVal: 3.191 ± 0.523
0.0MetTrp: 0.0 ± 0.0
0.319MetTyr: 0.319 ± 0.167
0.0MetXaa: 0.0 ± 0.0
Asn
0.638AsnAla: 0.638 ± 0.618
0.638AsnCys: 0.638 ± 0.335
2.872AsnAsp: 2.872 ± 0.358
1.914AsnGlu: 1.914 ± 0.677
1.914AsnPhe: 1.914 ± 0.159
3.51AsnGly: 3.51 ± 0.334
0.319AsnHis: 0.319 ± 0.167
3.829AsnIle: 3.829 ± 0.317
4.467AsnLys: 4.467 ± 1.494
3.829AsnLeu: 3.829 ± 1.569
0.319AsnMet: 0.319 ± 0.514
2.234AsnAsn: 2.234 ± 0.248
2.234AsnPro: 2.234 ± 0.691
1.595AsnGln: 1.595 ± 1.171
2.872AsnArg: 2.872 ± 0.358
6.063AsnSer: 6.063 ± 0.966
2.234AsnThr: 2.234 ± 0.248
1.914AsnVal: 1.914 ± 1.31
0.638AsnTrp: 0.638 ± 0.335
1.595AsnTyr: 1.595 ± 1.347
0.0AsnXaa: 0.0 ± 0.0
Pro
0.957ProAla: 0.957 ± 0.339
1.914ProCys: 1.914 ± 0.877
2.553ProAsp: 2.553 ± 0.482
3.191ProGlu: 3.191 ± 0.876
2.234ProPhe: 2.234 ± 0.453
0.319ProGly: 0.319 ± 0.514
0.319ProHis: 0.319 ± 0.167
1.595ProIle: 1.595 ± 0.796
4.148ProLys: 4.148 ± 1.378
2.872ProLeu: 2.872 ± 1.075
0.957ProMet: 0.957 ± 1.167
2.553ProAsn: 2.553 ± 1.722
0.957ProPro: 0.957 ± 0.81
1.914ProGln: 1.914 ± 0.159
3.191ProArg: 3.191 ± 1.243
4.467ProSer: 4.467 ± 0.938
2.553ProThr: 2.553 ± 0.771
1.595ProVal: 1.595 ± 0.978
0.0ProTrp: 0.0 ± 0.0
0.957ProTyr: 0.957 ± 0.339
0.0ProXaa: 0.0 ± 0.0
Gln
0.319GlnAla: 0.319 ± 0.167
0.319GlnCys: 0.319 ± 0.457
1.276GlnAsp: 1.276 ± 0.389
1.595GlnGlu: 1.595 ± 0.836
0.957GlnPhe: 0.957 ± 0.502
2.872GlnGly: 2.872 ± 0.45
0.319GlnHis: 0.319 ± 0.457
2.553GlnIle: 2.553 ± 1.465
1.595GlnLys: 1.595 ± 0.495
4.148GlnLeu: 4.148 ± 1.292
0.319GlnMet: 0.319 ± 0.457
0.957GlnAsn: 0.957 ± 0.502
0.957GlnPro: 0.957 ± 0.502
0.957GlnGln: 0.957 ± 0.502
2.234GlnArg: 2.234 ± 0.453
3.51GlnSer: 3.51 ± 0.525
0.638GlnThr: 0.638 ± 0.335
1.276GlnVal: 1.276 ± 0.669
0.638GlnTrp: 0.638 ± 0.424
0.638GlnTyr: 0.638 ± 0.335
0.0GlnXaa: 0.0 ± 0.0
Arg
1.276ArgAla: 1.276 ± 0.731
0.957ArgCys: 0.957 ± 0.586
3.191ArgAsp: 3.191 ± 0.523
3.829ArgGlu: 3.829 ± 1.15
1.276ArgPhe: 1.276 ± 0.389
2.234ArgGly: 2.234 ± 0.846
0.957ArgHis: 0.957 ± 0.502
4.148ArgIle: 4.148 ± 1.398
3.829ArgLys: 3.829 ± 1.254
9.253ArgLeu: 9.253 ± 1.808
2.234ArgMet: 2.234 ± 0.33
1.595ArgAsn: 1.595 ± 0.512
1.914ArgPro: 1.914 ± 0.679
1.595ArgGln: 1.595 ± 0.512
2.553ArgArg: 2.553 ± 0.508
5.743ArgSer: 5.743 ± 1.407
3.51ArgThr: 3.51 ± 1.281
2.553ArgVal: 2.553 ± 0.508
0.0ArgTrp: 0.0 ± 0.0
1.595ArgTyr: 1.595 ± 0.836
0.0ArgXaa: 0.0 ± 0.0
Ser
2.872SerAla: 2.872 ± 0.358
1.914SerCys: 1.914 ± 1.638
6.382SerAsp: 6.382 ± 2.252
7.977SerGlu: 7.977 ± 1.388
3.829SerPhe: 3.829 ± 1.553
3.191SerGly: 3.191 ± 1.347
2.234SerHis: 2.234 ± 0.772
7.658SerIle: 7.658 ± 0.777
6.063SerLys: 6.063 ± 1.315
8.615SerLeu: 8.615 ± 1.35
0.957SerMet: 0.957 ± 0.502
3.51SerAsn: 3.51 ± 1.139
3.191SerPro: 3.191 ± 1.828
2.234SerGln: 2.234 ± 0.453
3.829SerArg: 3.829 ± 0.606
7.658SerSer: 7.658 ± 0.514
5.424SerThr: 5.424 ± 1.302
4.467SerVal: 4.467 ± 1.547
1.276SerTrp: 1.276 ± 0.347
2.234SerTyr: 2.234 ± 0.772
0.0SerXaa: 0.0 ± 0.0
Thr
2.872ThrAla: 2.872 ± 1.505
0.957ThrCys: 0.957 ± 0.502
2.234ThrAsp: 2.234 ± 1.215
2.234ThrGlu: 2.234 ± 0.565
2.872ThrPhe: 2.872 ± 0.734
4.467ThrGly: 4.467 ± 0.441
0.638ThrHis: 0.638 ± 1.028
3.829ThrIle: 3.829 ± 0.716
3.51ThrLys: 3.51 ± 0.875
5.424ThrLeu: 5.424 ± 1.304
2.553ThrMet: 2.553 ± 0.741
3.829ThrAsn: 3.829 ± 0.317
1.914ThrPro: 1.914 ± 0.627
0.638ThrGln: 0.638 ± 0.424
3.829ThrArg: 3.829 ± 2.007
5.743ThrSer: 5.743 ± 0.268
2.553ThrThr: 2.553 ± 0.917
2.872ThrVal: 2.872 ± 1.348
0.957ThrTrp: 0.957 ± 0.389
0.957ThrTyr: 0.957 ± 0.502
0.0ThrXaa: 0.0 ± 0.0
Val
2.234ValAla: 2.234 ± 0.248
2.234ValCys: 2.234 ± 1.629
2.234ValAsp: 2.234 ± 0.772
6.063ValGlu: 6.063 ± 2.052
2.234ValPhe: 2.234 ± 1.591
4.148ValGly: 4.148 ± 0.847
1.276ValHis: 1.276 ± 0.948
3.51ValIle: 3.51 ± 2.066
3.51ValLys: 3.51 ± 1.696
6.063ValLeu: 6.063 ± 0.941
0.638ValMet: 0.638 ± 0.335
3.191ValAsn: 3.191 ± 0.423
3.191ValPro: 3.191 ± 0.423
1.595ValGln: 1.595 ± 0.495
2.872ValArg: 2.872 ± 1.166
2.872ValSer: 2.872 ± 1.135
2.872ValThr: 2.872 ± 0.924
4.148ValVal: 4.148 ± 0.834
0.0ValTrp: 0.0 ± 0.0
1.276ValTyr: 1.276 ± 0.848
0.0ValXaa: 0.0 ± 0.0
Trp
0.638TrpAla: 0.638 ± 0.335
0.319TrpCys: 0.319 ± 0.691
0.319TrpAsp: 0.319 ± 0.457
0.638TrpGlu: 0.638 ± 0.335
0.957TrpPhe: 0.957 ± 0.502
1.276TrpGly: 1.276 ± 0.848
0.319TrpHis: 0.319 ± 0.167
1.595TrpIle: 1.595 ± 0.796
0.319TrpLys: 0.319 ± 0.167
1.276TrpLeu: 1.276 ± 0.669
0.319TrpMet: 0.319 ± 0.167
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.638TrpArg: 0.638 ± 0.335
0.0TrpSer: 0.0 ± 0.0
0.319TrpThr: 0.319 ± 0.167
0.957TrpVal: 0.957 ± 0.502
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.638TyrAla: 0.638 ± 0.366
0.0TyrCys: 0.0 ± 0.0
3.191TyrAsp: 3.191 ± 0.712
2.553TyrGlu: 2.553 ± 0.995
0.638TyrPhe: 0.638 ± 0.335
2.872TyrGly: 2.872 ± 0.924
0.638TyrHis: 0.638 ± 0.424
1.276TyrIle: 1.276 ± 0.731
1.914TyrLys: 1.914 ± 0.677
3.51TyrLeu: 3.51 ± 1.392
1.276TyrMet: 1.276 ± 0.347
1.276TyrAsn: 1.276 ± 0.389
1.276TyrPro: 1.276 ± 0.347
0.319TyrGln: 0.319 ± 0.167
0.957TyrArg: 0.957 ± 0.502
2.553TyrSer: 2.553 ± 1.702
1.276TyrThr: 1.276 ± 0.423
0.638TyrVal: 0.638 ± 0.335
0.0TyrTrp: 0.0 ± 0.0
0.957TyrTyr: 0.957 ± 0.502
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3135 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski