Amino acid dipepetide frequency for Aspergillus fumigatus chrysovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.206AlaAla: 4.206 ± 0.498
0.789AlaCys: 0.789 ± 0.458
2.629AlaAsp: 2.629 ± 0.955
3.155AlaGlu: 3.155 ± 0.52
2.103AlaPhe: 2.103 ± 0.381
4.732AlaGly: 4.732 ± 0.506
2.103AlaHis: 2.103 ± 0.33
4.206AlaIle: 4.206 ± 0.625
3.68AlaLys: 3.68 ± 0.743
7.886AlaLeu: 7.886 ± 0.949
1.84AlaMet: 1.84 ± 0.705
3.68AlaAsn: 3.68 ± 0.755
2.366AlaPro: 2.366 ± 0.692
3.417AlaGln: 3.417 ± 0.863
5.258AlaArg: 5.258 ± 0.618
4.995AlaSer: 4.995 ± 0.579
4.469AlaThr: 4.469 ± 0.801
5.521AlaVal: 5.521 ± 1.037
1.84AlaTrp: 1.84 ± 0.521
1.84AlaTyr: 1.84 ± 0.521
0.0AlaXaa: 0.0 ± 0.0
Cys
0.789CysAla: 0.789 ± 0.425
0.526CysCys: 0.526 ± 0.252
0.526CysAsp: 0.526 ± 0.269
0.263CysGlu: 0.263 ± 0.231
0.263CysPhe: 0.263 ± 0.23
1.052CysGly: 1.052 ± 0.388
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.526CysLys: 0.526 ± 0.252
1.577CysLeu: 1.577 ± 0.343
0.526CysMet: 0.526 ± 0.459
0.263CysAsn: 0.263 ± 0.231
0.263CysPro: 0.263 ± 0.23
1.052CysGln: 1.052 ± 0.307
2.103CysArg: 2.103 ± 0.112
1.577CysSer: 1.577 ± 0.511
0.526CysThr: 0.526 ± 0.269
0.789CysVal: 0.789 ± 0.215
0.263CysTrp: 0.263 ± 0.209
1.577CysTyr: 1.577 ± 0.473
0.0CysXaa: 0.0 ± 0.0
Asp
4.995AspAla: 4.995 ± 1.287
0.526AspCys: 0.526 ± 0.252
2.629AspAsp: 2.629 ± 0.527
3.68AspGlu: 3.68 ± 1.181
2.366AspPhe: 2.366 ± 1.215
3.68AspGly: 3.68 ± 0.625
1.314AspHis: 1.314 ± 0.787
3.68AspIle: 3.68 ± 1.323
2.629AspLys: 2.629 ± 0.419
6.572AspLeu: 6.572 ± 1.047
1.314AspMet: 1.314 ± 0.158
0.263AspAsn: 0.263 ± 0.209
2.366AspPro: 2.366 ± 0.432
1.84AspGln: 1.84 ± 0.435
4.469AspArg: 4.469 ± 0.885
2.366AspSer: 2.366 ± 0.742
2.103AspThr: 2.103 ± 0.366
6.572AspVal: 6.572 ± 1.579
2.366AspTrp: 2.366 ± 0.773
3.943AspTyr: 3.943 ± 0.498
0.0AspXaa: 0.0 ± 0.0
Glu
4.995GluAla: 4.995 ± 0.642
1.052GluCys: 1.052 ± 0.056
3.155GluAsp: 3.155 ± 0.322
8.149GluGlu: 8.149 ± 0.653
2.366GluPhe: 2.366 ± 0.614
5.521GluGly: 5.521 ± 0.923
1.577GluHis: 1.577 ± 0.525
3.155GluIle: 3.155 ± 0.92
3.943GluLys: 3.943 ± 0.845
6.046GluLeu: 6.046 ± 0.998
1.84GluMet: 1.84 ± 0.548
1.052GluAsn: 1.052 ± 0.368
1.577GluPro: 1.577 ± 0.684
3.68GluGln: 3.68 ± 0.14
7.098GluArg: 7.098 ± 0.573
3.155GluSer: 3.155 ± 0.819
2.629GluThr: 2.629 ± 1.295
4.995GluVal: 4.995 ± 1.147
1.84GluTrp: 1.84 ± 0.317
1.84GluTyr: 1.84 ± 0.838
0.0GluXaa: 0.0 ± 0.0
Phe
0.526PheAla: 0.526 ± 0.242
0.789PheCys: 0.789 ± 0.627
2.892PheAsp: 2.892 ± 1.092
2.103PheGlu: 2.103 ± 0.33
1.314PhePhe: 1.314 ± 0.787
2.892PheGly: 2.892 ± 0.648
1.052PheHis: 1.052 ± 0.307
1.577PheIle: 1.577 ± 0.535
0.789PheLys: 0.789 ± 0.435
2.892PheLeu: 2.892 ± 0.433
1.052PheMet: 1.052 ± 0.34
1.84PheAsn: 1.84 ± 0.872
0.789PhePro: 0.789 ± 0.405
0.789PheGln: 0.789 ± 0.402
2.629PheArg: 2.629 ± 0.829
2.892PheSer: 2.892 ± 0.806
2.629PheThr: 2.629 ± 0.584
2.892PheVal: 2.892 ± 0.932
0.263PheTrp: 0.263 ± 0.23
0.526PheTyr: 0.526 ± 0.418
0.0PheXaa: 0.0 ± 0.0
Gly
5.521GlyAla: 5.521 ± 1.292
1.314GlyCys: 1.314 ± 0.158
4.469GlyAsp: 4.469 ± 0.855
5.258GlyGlu: 5.258 ± 0.899
2.892GlyPhe: 2.892 ± 0.747
4.469GlyGly: 4.469 ± 0.52
1.577GlyHis: 1.577 ± 0.468
2.103GlyIle: 2.103 ± 0.748
3.155GlyLys: 3.155 ± 0.894
6.835GlyLeu: 6.835 ± 0.507
2.366GlyMet: 2.366 ± 0.988
2.103GlyAsn: 2.103 ± 0.677
2.629GlyPro: 2.629 ± 0.631
3.417GlyGln: 3.417 ± 0.551
5.258GlyArg: 5.258 ± 0.747
3.943GlySer: 3.943 ± 0.621
1.577GlyThr: 1.577 ± 0.204
6.046GlyVal: 6.046 ± 1.398
2.103GlyTrp: 2.103 ± 0.398
1.314GlyTyr: 1.314 ± 0.715
0.0GlyXaa: 0.0 ± 0.0
His
0.263HisAla: 0.263 ± 0.23
0.263HisCys: 0.263 ± 0.231
1.577HisAsp: 1.577 ± 0.546
1.84HisGlu: 1.84 ± 0.556
0.789HisPhe: 0.789 ± 0.215
2.103HisGly: 2.103 ± 0.366
0.263HisHis: 0.263 ± 0.205
0.526HisIle: 0.526 ± 0.269
1.577HisLys: 1.577 ± 0.593
1.577HisLeu: 1.577 ± 0.528
1.052HisMet: 1.052 ± 0.584
1.84HisAsn: 1.84 ± 0.371
0.0HisPro: 0.0 ± 0.0
0.789HisGln: 0.789 ± 0.614
0.789HisArg: 0.789 ± 0.208
1.84HisSer: 1.84 ± 0.711
1.84HisThr: 1.84 ± 0.617
1.84HisVal: 1.84 ± 0.924
1.052HisTrp: 1.052 ± 0.308
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.68IleAla: 3.68 ± 0.435
0.526IleCys: 0.526 ± 0.244
2.892IleAsp: 2.892 ± 1.086
3.155IleGlu: 3.155 ± 0.168
1.314IlePhe: 1.314 ± 0.797
3.417IleGly: 3.417 ± 0.988
2.103IleHis: 2.103 ± 0.421
2.103IleIle: 2.103 ± 0.421
1.314IleLys: 1.314 ± 0.553
3.155IleLeu: 3.155 ± 0.72
2.103IleMet: 2.103 ± 0.719
2.103IleAsn: 2.103 ± 0.112
2.629IlePro: 2.629 ± 0.977
1.577IleGln: 1.577 ± 0.204
1.314IleArg: 1.314 ± 0.6
3.68IleSer: 3.68 ± 1.062
2.366IleThr: 2.366 ± 1.096
3.68IleVal: 3.68 ± 1.115
1.052IleTrp: 1.052 ± 0.544
0.789IleTyr: 0.789 ± 0.208
0.0IleXaa: 0.0 ± 0.0
Lys
4.206LysAla: 4.206 ± 0.523
0.789LysCys: 0.789 ± 0.689
3.943LysAsp: 3.943 ± 0.226
4.206LysGlu: 4.206 ± 0.819
1.577LysPhe: 1.577 ± 0.199
2.366LysGly: 2.366 ± 0.341
1.314LysHis: 1.314 ± 0.158
1.577LysIle: 1.577 ± 0.586
2.366LysLys: 2.366 ± 1.189
5.258LysLeu: 5.258 ± 0.541
2.103LysMet: 2.103 ± 0.421
0.263LysAsn: 0.263 ± 0.231
1.84LysPro: 1.84 ± 0.523
1.577LysGln: 1.577 ± 0.468
2.892LysArg: 2.892 ± 1.087
2.366LysSer: 2.366 ± 0.584
1.577LysThr: 1.577 ± 0.697
4.206LysVal: 4.206 ± 0.171
0.526LysTrp: 0.526 ± 0.418
2.366LysTyr: 2.366 ± 1.064
0.0LysXaa: 0.0 ± 0.0
Leu
8.675LeuAla: 8.675 ± 0.27
1.314LeuCys: 1.314 ± 0.415
4.732LeuAsp: 4.732 ± 1.186
5.783LeuGlu: 5.783 ± 0.559
4.206LeuPhe: 4.206 ± 1.028
5.258LeuGly: 5.258 ± 1.46
1.314LeuHis: 1.314 ± 0.158
3.155LeuIle: 3.155 ± 1.069
3.417LeuLys: 3.417 ± 0.259
8.149LeuLeu: 8.149 ± 1.229
2.103LeuMet: 2.103 ± 0.32
4.732LeuAsn: 4.732 ± 0.586
2.892LeuPro: 2.892 ± 0.182
2.892LeuGln: 2.892 ± 0.747
8.675LeuArg: 8.675 ± 1.502
9.201LeuSer: 9.201 ± 2.822
4.206LeuThr: 4.206 ± 0.474
8.149LeuVal: 8.149 ± 1.536
0.526LeuTrp: 0.526 ± 0.242
4.469LeuTyr: 4.469 ± 0.397
0.0LeuXaa: 0.0 ± 0.0
Met
2.366MetAla: 2.366 ± 0.356
0.789MetCys: 0.789 ± 0.425
2.629MetAsp: 2.629 ± 0.497
1.577MetGlu: 1.577 ± 0.496
0.526MetPhe: 0.526 ± 0.418
1.84MetGly: 1.84 ± 0.583
0.789MetHis: 0.789 ± 0.693
0.789MetIle: 0.789 ± 0.208
1.314MetLys: 1.314 ± 0.365
2.366MetLeu: 2.366 ± 0.765
1.577MetMet: 1.577 ± 0.586
1.314MetAsn: 1.314 ± 0.158
1.577MetPro: 1.577 ± 0.367
0.526MetGln: 0.526 ± 0.409
2.629MetArg: 2.629 ± 0.659
3.417MetSer: 3.417 ± 1.539
1.052MetThr: 1.052 ± 0.056
2.366MetVal: 2.366 ± 0.828
0.789MetTrp: 0.789 ± 0.689
2.103MetTyr: 2.103 ± 0.549
0.0MetXaa: 0.0 ± 0.0
Asn
2.629AsnAla: 2.629 ± 0.33
0.0AsnCys: 0.0 ± 0.0
1.84AsnAsp: 1.84 ± 0.726
2.629AsnGlu: 2.629 ± 1.488
1.577AsnPhe: 1.577 ± 0.546
1.052AsnGly: 1.052 ± 0.341
0.789AsnHis: 0.789 ± 0.385
2.103AsnIle: 2.103 ± 0.661
2.366AsnLys: 2.366 ± 0.787
2.366AsnLeu: 2.366 ± 0.414
1.052AsnMet: 1.052 ± 0.504
2.629AsnAsn: 2.629 ± 0.446
1.577AsnPro: 1.577 ± 0.993
1.052AsnGln: 1.052 ± 0.597
3.155AsnArg: 3.155 ± 0.328
3.417AsnSer: 3.417 ± 1.343
2.366AsnThr: 2.366 ± 0.743
2.103AsnVal: 2.103 ± 0.417
0.789AsnTrp: 0.789 ± 0.474
0.526AsnTyr: 0.526 ± 0.459
0.0AsnXaa: 0.0 ± 0.0
Pro
2.892ProAla: 2.892 ± 1.239
0.263ProCys: 0.263 ± 0.209
3.943ProAsp: 3.943 ± 0.493
2.103ProGlu: 2.103 ± 0.381
0.526ProPhe: 0.526 ± 0.217
2.892ProGly: 2.892 ± 0.37
0.526ProHis: 0.526 ± 0.242
1.052ProIle: 1.052 ± 0.648
2.103ProLys: 2.103 ± 0.417
2.366ProLeu: 2.366 ± 0.973
0.789ProMet: 0.789 ± 0.418
1.314ProAsn: 1.314 ± 0.381
3.155ProPro: 3.155 ± 0.644
3.417ProGln: 3.417 ± 0.784
1.052ProArg: 1.052 ± 0.341
2.892ProSer: 2.892 ± 0.651
3.943ProThr: 3.943 ± 0.498
3.155ProVal: 3.155 ± 0.522
0.263ProTrp: 0.263 ± 0.209
0.526ProTyr: 0.526 ± 0.292
0.0ProXaa: 0.0 ± 0.0
Gln
3.155GlnAla: 3.155 ± 1.249
0.263GlnCys: 0.263 ± 0.23
0.789GlnAsp: 0.789 ± 0.262
1.84GlnGlu: 1.84 ± 0.371
2.366GlnPhe: 2.366 ± 0.993
3.417GlnGly: 3.417 ± 0.874
1.052GlnHis: 1.052 ± 0.648
3.68GlnIle: 3.68 ± 0.475
1.577GlnLys: 1.577 ± 0.565
3.417GlnLeu: 3.417 ± 0.379
2.366GlnMet: 2.366 ± 0.301
0.789GlnAsn: 0.789 ± 0.2
0.526GlnPro: 0.526 ± 0.269
1.052GlnGln: 1.052 ± 0.368
1.84GlnArg: 1.84 ± 0.435
3.417GlnSer: 3.417 ± 0.838
1.577GlnThr: 1.577 ± 0.529
3.943GlnVal: 3.943 ± 1.033
0.526GlnTrp: 0.526 ± 0.418
1.052GlnTyr: 1.052 ± 0.585
0.0GlnXaa: 0.0 ± 0.0
Arg
4.995ArgAla: 4.995 ± 0.788
1.314ArgCys: 1.314 ± 0.363
5.521ArgAsp: 5.521 ± 1.048
6.572ArgGlu: 6.572 ± 1.838
1.577ArgPhe: 1.577 ± 0.523
5.258ArgGly: 5.258 ± 1.163
1.577ArgHis: 1.577 ± 0.367
3.417ArgIle: 3.417 ± 0.445
4.469ArgLys: 4.469 ± 0.277
9.201ArgLeu: 9.201 ± 2.009
1.052ArgMet: 1.052 ± 0.639
1.84ArgAsn: 1.84 ± 0.614
2.892ArgPro: 2.892 ± 0.897
2.892ArgGln: 2.892 ± 0.821
6.572ArgArg: 6.572 ± 2.061
4.469ArgSer: 4.469 ± 0.325
2.103ArgThr: 2.103 ± 0.398
5.521ArgVal: 5.521 ± 0.415
1.052ArgTrp: 1.052 ± 0.43
3.155ArgTyr: 3.155 ± 0.595
0.0ArgXaa: 0.0 ± 0.0
Ser
2.892SerAla: 2.892 ± 0.747
2.103SerCys: 2.103 ± 0.96
4.469SerAsp: 4.469 ± 0.898
4.469SerGlu: 4.469 ± 0.718
1.84SerPhe: 1.84 ± 0.351
6.572SerGly: 6.572 ± 1.686
1.84SerHis: 1.84 ± 0.406
2.103SerIle: 2.103 ± 0.398
4.469SerLys: 4.469 ± 1.35
6.835SerLeu: 6.835 ± 1.677
3.68SerMet: 3.68 ± 1.208
2.366SerAsn: 2.366 ± 0.584
3.943SerPro: 3.943 ± 0.629
1.84SerGln: 1.84 ± 0.705
5.521SerArg: 5.521 ± 0.75
6.046SerSer: 6.046 ± 1.415
3.943SerThr: 3.943 ± 0.9
3.943SerVal: 3.943 ± 0.467
0.526SerTrp: 0.526 ± 0.269
3.68SerTyr: 3.68 ± 1.254
0.0SerXaa: 0.0 ± 0.0
Thr
3.68ThrAla: 3.68 ± 0.435
0.263ThrCys: 0.263 ± 0.231
2.366ThrAsp: 2.366 ± 0.508
3.943ThrGlu: 3.943 ± 0.622
1.84ThrPhe: 1.84 ± 0.575
2.892ThrGly: 2.892 ± 1.004
0.263ThrHis: 0.263 ± 0.205
2.892ThrIle: 2.892 ± 0.642
2.892ThrLys: 2.892 ± 0.553
4.995ThrLeu: 4.995 ± 0.611
2.103ThrMet: 2.103 ± 0.517
1.052ThrAsn: 1.052 ± 0.589
1.84ThrPro: 1.84 ± 0.764
2.103ThrGln: 2.103 ± 0.606
4.206ThrArg: 4.206 ± 0.98
2.366ThrSer: 2.366 ± 1.078
2.629ThrThr: 2.629 ± 0.318
4.469ThrVal: 4.469 ± 1.029
0.526ThrTrp: 0.526 ± 0.418
1.577ThrTyr: 1.577 ± 0.535
0.0ThrXaa: 0.0 ± 0.0
Val
6.046ValAla: 6.046 ± 0.949
1.052ValCys: 1.052 ± 0.357
5.258ValAsp: 5.258 ± 1.678
4.995ValGlu: 4.995 ± 0.716
2.629ValPhe: 2.629 ± 1.3
4.732ValGly: 4.732 ± 1.199
2.103ValHis: 2.103 ± 1.023
5.783ValIle: 5.783 ± 0.593
2.103ValLys: 2.103 ± 0.824
6.835ValLeu: 6.835 ± 1.916
0.526ValMet: 0.526 ± 0.292
3.155ValAsn: 3.155 ± 1.019
4.995ValPro: 4.995 ± 0.833
1.84ValGln: 1.84 ± 0.315
8.149ValArg: 8.149 ± 0.645
6.572ValSer: 6.572 ± 0.922
4.995ValThr: 4.995 ± 1.606
4.206ValVal: 4.206 ± 1.385
1.84ValTrp: 1.84 ± 0.238
1.577ValTyr: 1.577 ± 0.199
0.0ValXaa: 0.0 ± 0.0
Trp
1.052TrpAla: 1.052 ± 0.433
0.263TrpCys: 0.263 ± 0.209
1.052TrpAsp: 1.052 ± 0.056
1.314TrpGlu: 1.314 ± 0.635
0.526TrpPhe: 0.526 ± 0.418
0.789TrpGly: 0.789 ± 0.689
0.263TrpHis: 0.263 ± 0.209
0.789TrpIle: 0.789 ± 0.215
1.052TrpLys: 1.052 ± 0.544
2.629TrpLeu: 2.629 ± 0.568
0.789TrpMet: 0.789 ± 0.472
1.052TrpAsn: 1.052 ± 0.056
0.263TrpPro: 0.263 ± 0.231
0.789TrpGln: 0.789 ± 0.396
0.526TrpArg: 0.526 ± 0.418
2.103TrpSer: 2.103 ± 0.72
0.526TrpThr: 0.526 ± 0.292
1.052TrpVal: 1.052 ± 0.308
0.0TrpTrp: 0.0 ± 0.0
1.314TrpTyr: 1.314 ± 0.268
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.155TyrAla: 3.155 ± 0.279
0.263TyrCys: 0.263 ± 0.23
2.366TyrAsp: 2.366 ± 0.342
2.629TyrGlu: 2.629 ± 0.972
0.526TyrPhe: 0.526 ± 0.409
3.417TyrGly: 3.417 ± 0.713
0.0TyrHis: 0.0 ± 0.0
0.263TyrIle: 0.263 ± 0.231
1.577TyrLys: 1.577 ± 0.731
2.892TyrLeu: 2.892 ± 0.182
1.84TyrMet: 1.84 ± 0.45
2.366TyrAsn: 2.366 ± 0.9
1.052TyrPro: 1.052 ± 0.368
2.366TyrGln: 2.366 ± 0.342
1.314TyrArg: 1.314 ± 0.264
2.366TyrSer: 2.366 ± 0.742
1.84TyrThr: 1.84 ± 0.464
3.943TyrVal: 3.943 ± 1.449
0.0TyrTrp: 0.0 ± 0.0
0.789TyrTyr: 0.789 ± 0.402
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3805 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski