Amino acid dipepetide frequency for Aspergillus foetidus dsRNA mycovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.455AlaAla: 11.455 ± 2.428
1.238AlaCys: 1.238 ± 0.349
6.502AlaAsp: 6.502 ± 1.516
7.121AlaGlu: 7.121 ± 0.741
2.167AlaPhe: 2.167 ± 0.732
11.765AlaGly: 11.765 ± 1.349
1.238AlaHis: 1.238 ± 0.587
4.644AlaIle: 4.644 ± 0.322
2.477AlaLys: 2.477 ± 0.47
9.598AlaLeu: 9.598 ± 0.615
3.096AlaMet: 3.096 ± 0.863
1.858AlaAsn: 1.858 ± 0.441
4.644AlaPro: 4.644 ± 1.145
3.406AlaGln: 3.406 ± 0.716
9.288AlaArg: 9.288 ± 1.349
6.502AlaSer: 6.502 ± 1.687
3.406AlaThr: 3.406 ± 0.766
7.43AlaVal: 7.43 ± 1.707
3.406AlaTrp: 3.406 ± 0.599
4.644AlaTyr: 4.644 ± 0.368
0.0AlaXaa: 0.0 ± 0.0
Cys
1.858CysAla: 1.858 ± 1.064
0.0CysCys: 0.0 ± 0.0
0.929CysAsp: 0.929 ± 0.557
0.929CysGlu: 0.929 ± 0.361
0.929CysPhe: 0.929 ± 0.473
0.619CysGly: 0.619 ± 0.566
0.31CysHis: 0.31 ± 0.228
0.31CysIle: 0.31 ± 0.292
0.31CysLys: 0.31 ± 0.228
2.477CysLeu: 2.477 ± 0.542
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.929CysPro: 0.929 ± 0.285
0.619CysGln: 0.619 ± 0.456
0.31CysArg: 0.31 ± 0.292
1.858CysSer: 1.858 ± 0.57
0.31CysThr: 0.31 ± 0.292
1.858CysVal: 1.858 ± 0.772
0.619CysTrp: 0.619 ± 0.583
0.619CysTyr: 0.619 ± 0.38
0.0CysXaa: 0.0 ± 0.0
Asp
9.288AspAla: 9.288 ± 1.72
0.619AspCys: 0.619 ± 0.566
4.954AspAsp: 4.954 ± 0.76
2.477AspGlu: 2.477 ± 0.723
1.238AspPhe: 1.238 ± 0.862
4.025AspGly: 4.025 ± 2.261
1.238AspHis: 1.238 ± 0.349
2.167AspIle: 2.167 ± 0.625
0.619AspLys: 0.619 ± 0.456
6.192AspLeu: 6.192 ± 0.98
1.238AspMet: 1.238 ± 0.587
0.619AspAsn: 0.619 ± 0.342
5.263AspPro: 5.263 ± 1.394
1.548AspGln: 1.548 ± 0.349
4.025AspArg: 4.025 ± 1.282
3.096AspSer: 3.096 ± 0.567
3.096AspThr: 3.096 ± 0.426
6.192AspVal: 6.192 ± 1.134
2.786AspTrp: 2.786 ± 0.44
1.858AspTyr: 1.858 ± 0.441
0.0AspXaa: 0.0 ± 0.0
Glu
5.263GluAla: 5.263 ± 1.536
0.929GluCys: 0.929 ± 0.361
1.548GluAsp: 1.548 ± 0.123
2.477GluGlu: 2.477 ± 0.288
2.477GluPhe: 2.477 ± 0.638
5.882GluGly: 5.882 ± 0.95
0.929GluHis: 0.929 ± 0.531
1.548GluIle: 1.548 ± 1.141
1.238GluLys: 1.238 ± 0.656
4.644GluLeu: 4.644 ± 1.052
1.238GluMet: 1.238 ± 0.587
1.858GluAsn: 1.858 ± 0.715
2.786GluPro: 2.786 ± 0.786
0.929GluGln: 0.929 ± 0.459
5.573GluArg: 5.573 ± 1.176
3.715GluSer: 3.715 ± 0.122
1.858GluThr: 1.858 ± 0.749
5.263GluVal: 5.263 ± 0.726
0.929GluTrp: 0.929 ± 0.568
1.548GluTyr: 1.548 ± 0.406
0.0GluXaa: 0.0 ± 0.0
Phe
3.715PheAla: 3.715 ± 1.104
0.31PheCys: 0.31 ± 0.292
3.406PheAsp: 3.406 ± 0.425
1.548PheGlu: 1.548 ± 0.431
2.167PhePhe: 2.167 ± 1.003
2.167PheGly: 2.167 ± 0.956
0.619PheHis: 0.619 ± 0.241
1.238PheIle: 1.238 ± 0.524
0.929PheLys: 0.929 ± 0.685
3.096PheLeu: 3.096 ± 0.695
0.619PheMet: 0.619 ± 0.368
1.238PheAsn: 1.238 ± 0.679
2.786PhePro: 2.786 ± 0.653
0.929PheGln: 0.929 ± 0.285
4.954PheArg: 4.954 ± 1.495
4.334PheSer: 4.334 ± 1.893
1.548PheThr: 1.548 ± 0.431
2.167PheVal: 2.167 ± 1.121
1.548PheTrp: 1.548 ± 0.853
0.619PheTyr: 0.619 ± 0.31
0.0PheXaa: 0.0 ± 0.0
Gly
8.978GlyAla: 8.978 ± 1.385
1.858GlyCys: 1.858 ± 0.861
5.573GlyAsp: 5.573 ± 1.46
4.334GlyGlu: 4.334 ± 1.215
2.477GlyPhe: 2.477 ± 0.613
7.43GlyGly: 7.43 ± 0.649
2.477GlyHis: 2.477 ± 0.723
4.025GlyIle: 4.025 ± 1.135
3.096GlyLys: 3.096 ± 1.245
5.263GlyLeu: 5.263 ± 1.091
2.786GlyMet: 2.786 ± 0.39
1.548GlyAsn: 1.548 ± 0.499
3.715GlyPro: 3.715 ± 0.4
1.238GlyGln: 1.238 ± 0.444
7.74GlyArg: 7.74 ± 1.553
5.573GlySer: 5.573 ± 1.761
2.167GlyThr: 2.167 ± 0.853
8.359GlyVal: 8.359 ± 2.103
2.786GlyTrp: 2.786 ± 0.783
0.929GlyTyr: 0.929 ± 0.557
0.0GlyXaa: 0.0 ± 0.0
His
3.096HisAla: 3.096 ± 0.602
0.0HisCys: 0.0 ± 0.0
1.548HisAsp: 1.548 ± 0.644
1.548HisGlu: 1.548 ± 0.523
0.619HisPhe: 0.619 ± 0.568
0.929HisGly: 0.929 ± 0.375
0.31HisHis: 0.31 ± 0.284
0.929HisIle: 0.929 ± 0.443
0.619HisLys: 0.619 ± 0.456
3.406HisLeu: 3.406 ± 0.453
1.548HisMet: 1.548 ± 0.435
0.31HisAsn: 0.31 ± 0.283
1.858HisPro: 1.858 ± 0.514
0.31HisGln: 0.31 ± 0.284
0.929HisArg: 0.929 ± 0.216
0.619HisSer: 0.619 ± 0.241
0.619HisThr: 0.619 ± 0.568
1.238HisVal: 1.238 ± 0.482
0.31HisTrp: 0.31 ± 0.284
1.548HisTyr: 1.548 ± 1.141
0.0HisXaa: 0.0 ± 0.0
Ile
5.263IleAla: 5.263 ± 1.018
1.548IleCys: 1.548 ± 0.868
2.477IleAsp: 2.477 ± 1.21
2.477IleGlu: 2.477 ± 0.922
1.858IlePhe: 1.858 ± 0.264
3.096IleGly: 3.096 ± 0.307
0.619IleHis: 0.619 ± 0.38
0.929IleIle: 0.929 ± 0.685
0.929IleLys: 0.929 ± 0.375
3.406IleLeu: 3.406 ± 0.425
0.929IleMet: 0.929 ± 0.284
0.619IleAsn: 0.619 ± 0.294
2.477IlePro: 2.477 ± 0.98
0.0IleGln: 0.0 ± 0.0
2.167IleArg: 2.167 ± 0.663
2.477IleSer: 2.477 ± 0.366
1.548IleThr: 1.548 ± 0.732
2.167IleVal: 2.167 ± 0.811
0.619IleTrp: 0.619 ± 0.342
0.619IleTyr: 0.619 ± 0.456
0.0IleXaa: 0.0 ± 0.0
Lys
2.786LysAla: 2.786 ± 1.374
0.0LysCys: 0.0 ± 0.0
0.31LysAsp: 0.31 ± 0.228
1.238LysGlu: 1.238 ± 0.656
0.619LysPhe: 0.619 ± 0.456
1.858LysGly: 1.858 ± 0.774
0.619LysHis: 0.619 ± 0.294
0.929LysIle: 0.929 ± 0.568
0.31LysLys: 0.31 ± 0.228
2.477LysLeu: 2.477 ± 1.454
0.929LysMet: 0.929 ± 0.216
0.0LysAsn: 0.0 ± 0.0
0.0LysPro: 0.0 ± 0.0
2.167LysGln: 2.167 ± 0.349
1.238LysArg: 1.238 ± 0.528
0.929LysSer: 0.929 ± 0.375
1.238LysThr: 1.238 ± 0.913
2.167LysVal: 2.167 ± 0.656
0.929LysTrp: 0.929 ± 0.443
2.167LysTyr: 2.167 ± 1.309
0.0LysXaa: 0.0 ± 0.0
Leu
10.217LeuAla: 10.217 ± 1.41
3.406LeuCys: 3.406 ± 1.444
7.121LeuAsp: 7.121 ± 1.524
5.882LeuGlu: 5.882 ± 1.462
3.096LeuPhe: 3.096 ± 0.625
5.882LeuGly: 5.882 ± 1.092
1.858LeuHis: 1.858 ± 0.683
3.715LeuIle: 3.715 ± 1.251
1.858LeuLys: 1.858 ± 0.918
7.121LeuLeu: 7.121 ± 1.049
1.548LeuMet: 1.548 ± 0.67
2.477LeuAsn: 2.477 ± 0.882
4.954LeuPro: 4.954 ± 1.302
1.548LeuGln: 1.548 ± 0.359
8.978LeuArg: 8.978 ± 0.729
9.288LeuSer: 9.288 ± 1.592
3.715LeuThr: 3.715 ± 1.164
6.811LeuVal: 6.811 ± 1.296
3.715LeuTrp: 3.715 ± 0.363
2.477LeuTyr: 2.477 ± 0.698
0.0LeuXaa: 0.0 ± 0.0
Met
2.477MetAla: 2.477 ± 0.75
0.0MetCys: 0.0 ± 0.0
1.238MetAsp: 1.238 ± 0.409
0.619MetGlu: 0.619 ± 0.34
1.548MetPhe: 1.548 ± 0.587
0.929MetGly: 0.929 ± 0.284
0.619MetHis: 0.619 ± 0.241
0.31MetIle: 0.31 ± 0.228
0.619MetLys: 0.619 ± 0.294
4.025MetLeu: 4.025 ± 0.633
0.619MetMet: 0.619 ± 0.349
0.31MetAsn: 0.31 ± 0.292
0.31MetPro: 0.31 ± 0.284
1.238MetGln: 1.238 ± 0.679
3.715MetArg: 3.715 ± 0.909
1.238MetSer: 1.238 ± 0.444
0.929MetThr: 0.929 ± 0.685
0.929MetVal: 0.929 ± 0.285
0.619MetTrp: 0.619 ± 0.456
1.548MetTyr: 1.548 ± 0.499
0.0MetXaa: 0.0 ± 0.0
Asn
3.096AsnAla: 3.096 ± 0.625
0.0AsnCys: 0.0 ± 0.0
0.929AsnAsp: 0.929 ± 0.531
0.619AsnGlu: 0.619 ± 0.294
1.548AsnPhe: 1.548 ± 0.685
1.238AsnGly: 1.238 ± 0.587
0.31AsnHis: 0.31 ± 0.283
0.31AsnIle: 0.31 ± 0.284
0.619AsnLys: 0.619 ± 0.241
2.167AsnLeu: 2.167 ± 0.625
0.619AsnMet: 0.619 ± 0.34
0.31AsnAsn: 0.31 ± 0.283
1.858AsnPro: 1.858 ± 0.683
0.31AsnGln: 0.31 ± 0.228
1.238AsnArg: 1.238 ± 0.656
1.238AsnSer: 1.238 ± 0.409
0.929AsnThr: 0.929 ± 0.216
2.477AsnVal: 2.477 ± 0.75
0.31AsnTrp: 0.31 ± 0.284
0.929AsnTyr: 0.929 ± 0.559
0.0AsnXaa: 0.0 ± 0.0
Pro
4.954ProAla: 4.954 ± 1.314
0.619ProCys: 0.619 ± 0.38
2.167ProAsp: 2.167 ± 0.903
5.263ProGlu: 5.263 ± 1.337
3.406ProPhe: 3.406 ± 0.747
7.43ProGly: 7.43 ± 2.143
1.238ProHis: 1.238 ± 0.482
1.858ProIle: 1.858 ± 0.494
1.858ProLys: 1.858 ± 0.337
4.334ProLeu: 4.334 ± 1.62
0.619ProMet: 0.619 ± 0.38
1.238ProAsn: 1.238 ± 0.587
6.811ProPro: 6.811 ± 1.819
0.0ProGln: 0.0 ± 0.0
4.954ProArg: 4.954 ± 1.333
2.786ProSer: 2.786 ± 0.39
2.477ProThr: 2.477 ± 0.667
3.715ProVal: 3.715 ± 1.102
0.619ProTrp: 0.619 ± 0.241
0.31ProTyr: 0.31 ± 0.228
0.0ProXaa: 0.0 ± 0.0
Gln
2.477GlnAla: 2.477 ± 0.819
0.0GlnCys: 0.0 ± 0.0
1.858GlnAsp: 1.858 ± 0.42
1.238GlnGlu: 1.238 ± 0.572
1.238GlnPhe: 1.238 ± 0.528
1.548GlnGly: 1.548 ± 0.349
0.929GlnHis: 0.929 ± 0.285
0.619GlnIle: 0.619 ± 0.456
0.31GlnLys: 0.31 ± 0.284
2.167GlnLeu: 2.167 ± 0.656
0.619GlnMet: 0.619 ± 0.294
0.619GlnAsn: 0.619 ± 0.34
0.31GlnPro: 0.31 ± 0.283
0.929GlnGln: 0.929 ± 0.685
1.858GlnArg: 1.858 ± 0.528
1.858GlnSer: 1.858 ± 0.202
1.548GlnThr: 1.548 ± 0.522
2.477GlnVal: 2.477 ± 0.366
1.238GlnTrp: 1.238 ± 0.572
1.238GlnTyr: 1.238 ± 0.913
0.0GlnXaa: 0.0 ± 0.0
Arg
8.05ArgAla: 8.05 ± 2.075
1.238ArgCys: 1.238 ± 0.287
3.715ArgAsp: 3.715 ± 0.755
2.786ArgGlu: 2.786 ± 0.861
5.263ArgPhe: 5.263 ± 1.341
8.669ArgGly: 8.669 ± 1.101
2.786ArgHis: 2.786 ± 0.852
4.954ArgIle: 4.954 ± 0.554
1.238ArgLys: 1.238 ± 0.572
7.121ArgLeu: 7.121 ± 1.059
1.238ArgMet: 1.238 ± 0.482
2.477ArgAsn: 2.477 ± 0.542
5.882ArgPro: 5.882 ± 2.102
2.167ArgGln: 2.167 ± 1.0
7.121ArgArg: 7.121 ± 0.79
7.121ArgSer: 7.121 ± 1.335
4.334ArgThr: 4.334 ± 0.79
5.573ArgVal: 5.573 ± 1.329
0.929ArgTrp: 0.929 ± 0.557
2.477ArgTyr: 2.477 ± 0.167
0.0ArgXaa: 0.0 ± 0.0
Ser
7.121SerAla: 7.121 ± 1.327
0.929SerCys: 0.929 ± 0.603
5.263SerAsp: 5.263 ± 0.456
2.786SerGlu: 2.786 ± 0.452
2.786SerPhe: 2.786 ± 0.499
6.811SerGly: 6.811 ± 1.01
0.619SerHis: 0.619 ± 0.31
2.786SerIle: 2.786 ± 1.05
2.786SerLys: 2.786 ± 1.68
7.74SerLeu: 7.74 ± 1.213
1.548SerMet: 1.548 ± 0.318
0.619SerAsn: 0.619 ± 0.34
2.477SerPro: 2.477 ± 0.848
1.548SerGln: 1.548 ± 0.67
7.43SerArg: 7.43 ± 1.451
5.263SerSer: 5.263 ± 0.537
2.477SerThr: 2.477 ± 0.98
5.263SerVal: 5.263 ± 2.616
1.858SerTrp: 1.858 ± 0.774
3.096SerTyr: 3.096 ± 0.621
0.0SerXaa: 0.0 ± 0.0
Thr
2.786ThrAla: 2.786 ± 1.088
0.619ThrCys: 0.619 ± 0.456
2.786ThrAsp: 2.786 ± 0.721
1.548ThrGlu: 1.548 ± 0.716
1.548ThrPhe: 1.548 ± 0.523
1.858ThrGly: 1.858 ± 0.42
1.238ThrHis: 1.238 ± 0.444
1.238ThrIle: 1.238 ± 0.349
0.929ThrLys: 0.929 ± 0.531
3.715ThrLeu: 3.715 ± 1.234
1.238ThrMet: 1.238 ± 0.386
0.929ThrAsn: 0.929 ± 0.459
2.477ThrPro: 2.477 ± 0.539
2.786ThrGln: 2.786 ± 0.45
3.715ThrArg: 3.715 ± 0.527
3.406ThrSer: 3.406 ± 0.772
3.406ThrThr: 3.406 ± 0.743
4.644ThrVal: 4.644 ± 1.475
1.238ThrTrp: 1.238 ± 0.64
0.929ThrTyr: 0.929 ± 0.375
0.0ThrXaa: 0.0 ± 0.0
Val
5.882ValAla: 5.882 ± 1.376
1.238ValCys: 1.238 ± 0.683
6.192ValAsp: 6.192 ± 2.31
4.644ValGlu: 4.644 ± 1.764
3.406ValPhe: 3.406 ± 1.254
4.954ValGly: 4.954 ± 1.306
2.786ValHis: 2.786 ± 1.096
2.167ValIle: 2.167 ± 0.501
1.238ValLys: 1.238 ± 0.477
8.359ValLeu: 8.359 ± 0.892
2.477ValMet: 2.477 ± 0.47
3.096ValAsn: 3.096 ± 1.413
4.334ValPro: 4.334 ± 1.762
2.167ValGln: 2.167 ± 0.732
6.192ValArg: 6.192 ± 1.157
5.573ValSer: 5.573 ± 1.195
4.644ValThr: 4.644 ± 1.297
4.644ValVal: 4.644 ± 1.193
1.238ValTrp: 1.238 ± 0.64
3.096ValTyr: 3.096 ± 0.433
0.0ValXaa: 0.0 ± 0.0
Trp
3.096TrpAla: 3.096 ± 1.245
0.619TrpCys: 0.619 ± 0.38
2.477TrpAsp: 2.477 ± 0.758
1.858TrpGlu: 1.858 ± 0.494
0.31TrpPhe: 0.31 ± 0.292
2.786TrpGly: 2.786 ± 0.44
0.619TrpHis: 0.619 ± 0.456
0.31TrpIle: 0.31 ± 0.283
0.31TrpLys: 0.31 ± 0.284
4.954TrpLeu: 4.954 ± 0.931
0.31TrpMet: 0.31 ± 0.228
0.619TrpAsn: 0.619 ± 0.241
0.929TrpPro: 0.929 ± 0.531
0.31TrpGln: 0.31 ± 0.283
1.548TrpArg: 1.548 ± 0.431
2.477TrpSer: 2.477 ± 0.799
0.929TrpThr: 0.929 ± 0.685
1.548TrpVal: 1.548 ± 0.522
0.31TrpTrp: 0.31 ± 0.228
0.619TrpTyr: 0.619 ± 0.38
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.025TyrAla: 4.025 ± 1.762
0.619TyrCys: 0.619 ± 0.241
1.548TyrAsp: 1.548 ± 0.517
1.238TyrGlu: 1.238 ± 0.572
1.548TyrPhe: 1.548 ± 0.786
2.477TyrGly: 2.477 ± 0.613
0.929TyrHis: 0.929 ± 0.875
1.238TyrIle: 1.238 ± 0.64
0.619TyrLys: 0.619 ± 0.31
3.406TyrLeu: 3.406 ± 0.179
0.31TyrMet: 0.31 ± 0.284
0.31TyrAsn: 0.31 ± 0.283
1.858TyrPro: 1.858 ± 0.715
0.929TyrGln: 0.929 ± 0.375
2.167TyrArg: 2.167 ± 0.638
1.858TyrSer: 1.858 ± 0.827
1.858TyrThr: 1.858 ± 0.57
3.406TyrVal: 3.406 ± 1.146
0.929TyrTrp: 0.929 ± 0.375
1.238TyrTyr: 1.238 ± 0.409
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3231 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski