Amino acid dipepetide frequency for Mothra virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.677AlaAla: 5.677 ± 3.262
2.39AlaCys: 2.39 ± 0.883
5.079AlaAsp: 5.079 ± 2.25
2.39AlaGlu: 2.39 ± 0.835
1.793AlaPhe: 1.793 ± 0.551
3.287AlaGly: 3.287 ± 1.957
1.195AlaHis: 1.195 ± 0.452
3.884AlaIle: 3.884 ± 1.242
4.78AlaLys: 4.78 ± 0.439
5.378AlaLeu: 5.378 ± 1.617
2.689AlaMet: 2.689 ± 1.237
3.884AlaAsn: 3.884 ± 1.479
1.494AlaPro: 1.494 ± 0.725
1.793AlaGln: 1.793 ± 0.978
3.585AlaArg: 3.585 ± 1.546
5.079AlaSer: 5.079 ± 1.56
3.884AlaThr: 3.884 ± 0.714
4.482AlaVal: 4.482 ± 0.655
0.896AlaTrp: 0.896 ± 0.489
2.689AlaTyr: 2.689 ± 0.793
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.598CysCys: 0.598 ± 0.313
0.896CysAsp: 0.896 ± 0.718
0.598CysGlu: 0.598 ± 0.827
0.0CysPhe: 0.0 ± 0.0
0.598CysGly: 0.598 ± 0.313
0.896CysHis: 0.896 ± 0.718
0.598CysIle: 0.598 ± 0.314
2.091CysLys: 2.091 ± 0.824
1.195CysLeu: 1.195 ± 0.854
0.299CysMet: 0.299 ± 0.414
0.598CysAsn: 0.598 ± 0.827
0.896CysPro: 0.896 ± 0.47
0.896CysGln: 0.896 ± 0.754
0.598CysArg: 0.598 ± 0.314
2.689CysSer: 2.689 ± 1.456
1.195CysThr: 1.195 ± 1.129
0.0CysVal: 0.0 ± 0.0
0.299CysTrp: 0.299 ± 0.414
0.598CysTyr: 0.598 ± 0.313
0.0CysXaa: 0.0 ± 0.0
Asp
4.183AspAla: 4.183 ± 1.794
0.299AspCys: 0.299 ± 0.414
3.585AspAsp: 3.585 ± 0.663
4.183AspGlu: 4.183 ± 1.173
2.39AspPhe: 2.39 ± 0.718
3.287AspGly: 3.287 ± 1.288
0.598AspHis: 0.598 ± 0.313
3.287AspIle: 3.287 ± 0.974
4.183AspLys: 4.183 ± 1.149
4.78AspLeu: 4.78 ± 1.367
1.793AspMet: 1.793 ± 0.94
0.0AspAsn: 0.0 ± 0.0
3.585AspPro: 3.585 ± 1.441
2.39AspGln: 2.39 ± 0.838
2.39AspArg: 2.39 ± 0.838
2.988AspSer: 2.988 ± 0.41
3.585AspThr: 3.585 ± 0.612
2.988AspVal: 2.988 ± 0.844
0.896AspTrp: 0.896 ± 0.718
2.988AspTyr: 2.988 ± 1.14
0.0AspXaa: 0.0 ± 0.0
Glu
3.884GluAla: 3.884 ± 1.287
0.598GluCys: 0.598 ± 0.892
4.482GluAsp: 4.482 ± 1.938
5.677GluGlu: 5.677 ± 2.381
1.195GluPhe: 1.195 ± 0.319
4.482GluGly: 4.482 ± 0.94
2.39GluHis: 2.39 ± 0.761
1.793GluIle: 1.793 ± 0.549
2.39GluLys: 2.39 ± 0.835
10.158GluLeu: 10.158 ± 2.603
3.884GluMet: 3.884 ± 1.03
1.793GluAsn: 1.793 ± 0.549
0.299GluPro: 0.299 ± 0.157
2.988GluGln: 2.988 ± 1.405
2.689GluArg: 2.689 ± 0.373
5.677GluSer: 5.677 ± 1.454
2.39GluThr: 2.39 ± 0.467
6.573GluVal: 6.573 ± 1.462
0.0GluTrp: 0.0 ± 0.0
2.091GluTyr: 2.091 ± 1.097
0.0GluXaa: 0.0 ± 0.0
Phe
1.494PheAla: 1.494 ± 0.989
0.896PheCys: 0.896 ± 0.718
2.091PheAsp: 2.091 ± 0.8
3.287PheGlu: 3.287 ± 0.686
2.39PhePhe: 2.39 ± 0.718
2.091PheGly: 2.091 ± 1.515
0.896PheHis: 0.896 ± 0.47
1.195PheIle: 1.195 ± 0.854
2.689PheLys: 2.689 ± 0.782
3.287PheLeu: 3.287 ± 0.497
1.793PheMet: 1.793 ± 0.551
2.988PheAsn: 2.988 ± 1.307
2.091PhePro: 2.091 ± 0.941
1.793PheGln: 1.793 ± 0.94
2.988PheArg: 2.988 ± 0.844
3.884PheSer: 3.884 ± 0.743
2.091PheThr: 2.091 ± 0.477
1.793PheVal: 1.793 ± 0.588
0.598PheTrp: 0.598 ± 0.313
1.195PheTyr: 1.195 ± 0.864
0.0PheXaa: 0.0 ± 0.0
Gly
3.585GlyAla: 3.585 ± 2.422
0.896GlyCys: 0.896 ± 1.253
2.091GlyAsp: 2.091 ± 0.575
2.39GlyGlu: 2.39 ± 1.445
3.287GlyPhe: 3.287 ± 1.035
4.183GlyGly: 4.183 ± 1.794
0.299GlyHis: 0.299 ± 0.157
4.482GlyIle: 4.482 ± 2.147
3.287GlyLys: 3.287 ± 2.189
4.482GlyLeu: 4.482 ± 1.529
1.494GlyMet: 1.494 ± 0.584
2.689GlyAsn: 2.689 ± 1.124
2.988GlyPro: 2.988 ± 0.726
1.195GlyGln: 1.195 ± 0.515
4.183GlyArg: 4.183 ± 1.371
3.287GlySer: 3.287 ± 0.8
3.884GlyThr: 3.884 ± 1.226
4.482GlyVal: 4.482 ± 0.655
0.896GlyTrp: 0.896 ± 0.47
1.793GlyTyr: 1.793 ± 0.551
0.0GlyXaa: 0.0 ± 0.0
His
2.091HisAla: 2.091 ± 0.685
0.0HisCys: 0.0 ± 0.0
0.896HisAsp: 0.896 ± 0.275
2.39HisGlu: 2.39 ± 0.835
1.494HisPhe: 1.494 ± 0.784
0.598HisGly: 0.598 ± 0.313
0.598HisHis: 0.598 ± 0.313
1.793HisIle: 1.793 ± 0.943
1.494HisLys: 1.494 ± 0.421
2.091HisLeu: 2.091 ± 0.689
0.0HisMet: 0.0 ± 0.0
1.195HisAsn: 1.195 ± 0.319
1.494HisPro: 1.494 ± 0.367
0.896HisGln: 0.896 ± 0.718
0.896HisArg: 0.896 ± 0.47
2.091HisSer: 2.091 ± 0.993
1.494HisThr: 1.494 ± 1.001
1.195HisVal: 1.195 ± 0.627
0.299HisTrp: 0.299 ± 0.157
2.39HisTyr: 2.39 ± 0.835
0.0HisXaa: 0.0 ± 0.0
Ile
5.378IleAla: 5.378 ± 1.009
0.598IleCys: 0.598 ± 0.314
3.287IleAsp: 3.287 ± 1.003
5.079IleGlu: 5.079 ± 1.375
2.988IlePhe: 2.988 ± 0.812
1.195IleGly: 1.195 ± 0.627
1.793IleHis: 1.793 ± 0.336
3.287IleIle: 3.287 ± 1.467
3.287IleLys: 3.287 ± 0.446
4.183IleLeu: 4.183 ± 0.814
2.39IleMet: 2.39 ± 0.588
3.287IleAsn: 3.287 ± 0.888
1.195IlePro: 1.195 ± 0.319
1.494IleGln: 1.494 ± 0.421
2.988IleArg: 2.988 ± 0.545
4.78IleSer: 4.78 ± 0.857
1.195IleThr: 1.195 ± 0.515
4.482IleVal: 4.482 ± 0.833
0.299IleTrp: 0.299 ± 0.414
1.494IleTyr: 1.494 ± 1.028
0.0IleXaa: 0.0 ± 0.0
Lys
5.677LysAla: 5.677 ± 1.441
1.195LysCys: 1.195 ± 1.129
2.988LysAsp: 2.988 ± 0.842
3.287LysGlu: 3.287 ± 1.29
1.793LysPhe: 1.793 ± 0.588
2.39LysGly: 2.39 ± 1.445
1.793LysHis: 1.793 ± 0.94
2.988LysIle: 2.988 ± 0.733
5.079LysLys: 5.079 ± 1.289
3.884LysLeu: 3.884 ± 0.387
2.39LysMet: 2.39 ± 0.827
2.988LysAsn: 2.988 ± 1.567
1.793LysPro: 1.793 ± 2.098
1.793LysGln: 1.793 ± 0.94
2.988LysArg: 2.988 ± 2.26
4.183LysSer: 4.183 ± 1.006
5.677LysThr: 5.677 ± 1.65
3.287LysVal: 3.287 ± 0.758
0.896LysTrp: 0.896 ± 0.47
1.494LysTyr: 1.494 ± 0.57
0.0LysXaa: 0.0 ± 0.0
Leu
5.677LeuAla: 5.677 ± 0.94
2.091LeuCys: 2.091 ± 0.578
3.585LeuAsp: 3.585 ± 0.387
6.573LeuGlu: 6.573 ± 2.328
3.287LeuPhe: 3.287 ± 1.281
6.573LeuGly: 6.573 ± 1.753
4.183LeuHis: 4.183 ± 0.667
5.677LeuIle: 5.677 ± 1.251
2.988LeuLys: 2.988 ± 0.94
8.067LeuLeu: 8.067 ± 1.359
3.585LeuMet: 3.585 ± 0.749
4.482LeuAsn: 4.482 ± 0.58
5.079LeuPro: 5.079 ± 0.981
4.482LeuGln: 4.482 ± 2.158
6.573LeuArg: 6.573 ± 1.717
5.976LeuSer: 5.976 ± 0.687
8.067LeuThr: 8.067 ± 1.299
4.183LeuVal: 4.183 ± 0.446
0.598LeuTrp: 0.598 ± 0.313
1.793LeuTyr: 1.793 ± 0.94
0.0LeuXaa: 0.0 ± 0.0
Met
3.287MetAla: 3.287 ± 0.859
0.299MetCys: 0.299 ± 0.157
1.494MetAsp: 1.494 ± 0.584
1.494MetGlu: 1.494 ± 0.784
1.195MetPhe: 1.195 ± 0.515
0.896MetGly: 0.896 ± 0.569
0.896MetHis: 0.896 ± 0.275
2.39MetIle: 2.39 ± 0.761
2.689MetLys: 2.689 ± 0.544
2.091MetLeu: 2.091 ± 0.667
1.195MetMet: 1.195 ± 0.627
1.195MetAsn: 1.195 ± 0.319
1.494MetPro: 1.494 ± 1.536
1.195MetGln: 1.195 ± 0.319
2.091MetArg: 2.091 ± 0.376
5.378MetSer: 5.378 ± 0.95
2.39MetThr: 2.39 ± 0.827
1.494MetVal: 1.494 ± 0.584
0.896MetTrp: 0.896 ± 0.47
0.299MetTyr: 0.299 ± 0.157
0.0MetXaa: 0.0 ± 0.0
Asn
1.494AsnAla: 1.494 ± 0.584
0.299AsnCys: 0.299 ± 0.414
2.39AsnAsp: 2.39 ± 0.827
3.585AsnGlu: 3.585 ± 0.562
3.585AsnPhe: 3.585 ± 1.009
0.896AsnGly: 0.896 ± 0.569
0.598AsnHis: 0.598 ± 0.314
0.598AsnIle: 0.598 ± 0.313
2.091AsnLys: 2.091 ± 0.685
3.585AsnLeu: 3.585 ± 1.992
1.494AsnMet: 1.494 ± 0.584
1.793AsnAsn: 1.793 ± 1.535
1.793AsnPro: 1.793 ± 0.336
2.689AsnGln: 2.689 ± 0.587
1.793AsnArg: 1.793 ± 0.549
3.884AsnSer: 3.884 ± 0.589
4.183AsnThr: 4.183 ± 1.764
3.585AsnVal: 3.585 ± 0.612
1.195AsnTrp: 1.195 ± 0.452
0.299AsnTyr: 0.299 ± 0.578
0.0AsnXaa: 0.0 ± 0.0
Pro
1.793ProAla: 1.793 ± 0.336
0.598ProCys: 0.598 ± 0.702
4.183ProAsp: 4.183 ± 1.149
5.378ProGlu: 5.378 ± 1.41
2.091ProPhe: 2.091 ± 0.824
2.39ProGly: 2.39 ± 0.398
0.598ProHis: 0.598 ± 0.313
2.689ProIle: 2.689 ± 0.826
2.091ProLys: 2.091 ± 1.668
4.78ProLeu: 4.78 ± 0.396
0.598ProMet: 0.598 ± 1.703
2.988ProAsn: 2.988 ± 1.307
2.988ProPro: 2.988 ± 4.94
1.494ProGln: 1.494 ± 0.367
1.793ProArg: 1.793 ± 0.336
4.183ProSer: 4.183 ± 1.635
3.884ProThr: 3.884 ± 0.589
2.39ProVal: 2.39 ± 0.8
0.598ProTrp: 0.598 ± 0.313
0.598ProTyr: 0.598 ± 0.313
0.0ProXaa: 0.0 ± 0.0
Gln
0.598GlnAla: 0.598 ± 0.313
0.598GlnCys: 0.598 ± 0.827
1.494GlnAsp: 1.494 ± 0.725
0.896GlnGlu: 0.896 ± 0.275
2.39GlnPhe: 2.39 ± 0.398
4.183GlnGly: 4.183 ± 1.258
1.494GlnHis: 1.494 ± 0.421
3.884GlnIle: 3.884 ± 0.904
1.195GlnLys: 1.195 ± 0.751
2.091GlnLeu: 2.091 ± 0.847
1.793GlnMet: 1.793 ± 0.549
1.195GlnAsn: 1.195 ± 0.854
2.689GlnPro: 2.689 ± 0.782
0.299GlnGln: 0.299 ± 0.414
2.39GlnArg: 2.39 ± 0.467
1.793GlnSer: 1.793 ± 0.549
2.689GlnThr: 2.689 ± 1.494
1.195GlnVal: 1.195 ± 0.515
0.299GlnTrp: 0.299 ± 0.157
1.494GlnTyr: 1.494 ± 0.716
0.0GlnXaa: 0.0 ± 0.0
Arg
5.079ArgAla: 5.079 ± 0.871
0.0ArgCys: 0.0 ± 0.0
1.195ArgAsp: 1.195 ± 0.627
2.689ArgGlu: 2.689 ± 0.587
2.988ArgPhe: 2.988 ± 0.812
4.183ArgGly: 4.183 ± 0.446
0.0ArgHis: 0.0 ± 0.0
2.689ArgIle: 2.689 ± 0.793
1.494ArgLys: 1.494 ± 0.584
6.274ArgLeu: 6.274 ± 1.764
1.494ArgMet: 1.494 ± 1.214
2.39ArgAsn: 2.39 ± 1.474
2.689ArgPro: 2.689 ± 0.727
1.793ArgGln: 1.793 ± 0.771
2.689ArgArg: 2.689 ± 0.856
5.677ArgSer: 5.677 ± 2.024
2.988ArgThr: 2.988 ± 1.136
4.482ArgVal: 4.482 ± 0.546
0.896ArgTrp: 0.896 ± 0.47
2.689ArgTyr: 2.689 ± 0.666
0.0ArgXaa: 0.0 ± 0.0
Ser
5.677SerAla: 5.677 ± 0.94
2.39SerCys: 2.39 ± 1.746
5.378SerAsp: 5.378 ± 1.473
4.78SerGlu: 4.78 ± 1.176
4.183SerPhe: 4.183 ± 1.882
4.78SerGly: 4.78 ± 0.396
1.494SerHis: 1.494 ± 0.784
2.988SerIle: 2.988 ± 1.307
5.976SerLys: 5.976 ± 1.466
7.768SerLeu: 7.768 ± 1.706
1.793SerMet: 1.793 ± 0.688
2.988SerAsn: 2.988 ± 1.14
5.378SerPro: 5.378 ± 1.597
2.091SerGln: 2.091 ± 0.948
5.079SerArg: 5.079 ± 1.049
10.158SerSer: 10.158 ± 3.492
4.78SerThr: 4.78 ± 1.155
5.976SerVal: 5.976 ± 1.236
1.195SerTrp: 1.195 ± 0.319
0.896SerTyr: 0.896 ± 0.718
0.0SerXaa: 0.0 ± 0.0
Thr
3.287ThrAla: 3.287 ± 1.281
0.299ThrCys: 0.299 ± 0.414
3.585ThrAsp: 3.585 ± 1.177
4.482ThrGlu: 4.482 ± 1.243
2.689ThrPhe: 2.689 ± 0.373
5.079ThrGly: 5.079 ± 1.615
2.689ThrHis: 2.689 ± 0.984
2.988ThrIle: 2.988 ± 1.136
2.091ThrLys: 2.091 ± 0.879
7.469ThrLeu: 7.469 ± 1.425
1.793ThrMet: 1.793 ± 0.683
2.091ThrAsn: 2.091 ± 0.477
3.884ThrPro: 3.884 ± 0.853
1.793ThrGln: 1.793 ± 0.94
3.287ThrArg: 3.287 ± 0.888
5.976ThrSer: 5.976 ± 1.789
2.988ThrThr: 2.988 ± 0.41
3.287ThrVal: 3.287 ± 1.261
0.299ThrTrp: 0.299 ± 0.157
2.091ThrTyr: 2.091 ± 1.097
0.0ThrXaa: 0.0 ± 0.0
Val
2.988ValAla: 2.988 ± 1.168
0.896ValCys: 0.896 ± 0.47
3.287ValAsp: 3.287 ± 0.863
4.183ValGlu: 4.183 ± 1.529
1.494ValPhe: 1.494 ± 0.716
3.287ValGly: 3.287 ± 1.593
1.793ValHis: 1.793 ± 0.683
4.78ValIle: 4.78 ± 0.679
4.183ValLys: 4.183 ± 1.378
7.768ValLeu: 7.768 ± 1.227
2.091ValMet: 2.091 ± 1.097
2.091ValAsn: 2.091 ± 0.8
4.482ValPro: 4.482 ± 1.703
2.091ValGln: 2.091 ± 0.477
4.183ValArg: 4.183 ± 1.173
4.482ValSer: 4.482 ± 1.71
2.091ValThr: 2.091 ± 1.097
7.171ValVal: 7.171 ± 1.041
0.598ValTrp: 0.598 ± 0.313
1.195ValTyr: 1.195 ± 0.627
0.0ValXaa: 0.0 ± 0.0
Trp
2.091TrpAla: 2.091 ± 0.376
0.0TrpCys: 0.0 ± 0.0
0.896TrpAsp: 0.896 ± 0.275
0.896TrpGlu: 0.896 ± 0.275
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.598TrpHis: 0.598 ± 0.313
0.598TrpIle: 0.598 ± 0.314
0.896TrpLys: 0.896 ± 0.275
0.896TrpLeu: 0.896 ± 0.47
0.299TrpMet: 0.299 ± 0.157
0.0TrpAsn: 0.0 ± 0.0
0.299TrpPro: 0.299 ± 0.414
0.896TrpGln: 0.896 ± 0.47
0.598TrpArg: 0.598 ± 0.313
1.195TrpSer: 1.195 ± 0.515
0.896TrpThr: 0.896 ± 0.47
0.896TrpVal: 0.896 ± 0.47
0.0TrpTrp: 0.0 ± 0.0
0.299TrpTyr: 0.299 ± 0.414
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.39TyrAla: 2.39 ± 1.474
0.598TyrCys: 0.598 ± 0.313
1.494TyrAsp: 1.494 ± 1.028
1.494TyrGlu: 1.494 ± 0.367
0.299TyrPhe: 0.299 ± 0.414
1.494TyrGly: 1.494 ± 1.013
0.598TyrHis: 0.598 ± 0.314
2.091TyrIle: 2.091 ± 1.097
3.585TyrLys: 3.585 ± 1.447
3.287TyrLeu: 3.287 ± 1.035
1.195TyrMet: 1.195 ± 0.627
1.195TyrAsn: 1.195 ± 0.627
1.494TyrPro: 1.494 ± 0.421
0.598TyrGln: 0.598 ± 0.314
0.598TyrArg: 0.598 ± 0.314
2.39TyrSer: 2.39 ± 0.835
1.793TyrThr: 1.793 ± 0.336
1.195TyrVal: 1.195 ± 0.319
0.598TyrTrp: 0.598 ± 0.314
1.793TyrTyr: 1.793 ± 0.943
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3348 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski