Amino acid dipepetide frequency for Anopheles A virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.028AlaAla: 1.028 ± 0.253
0.514AlaCys: 0.514 ± 0.126
1.285AlaAsp: 1.285 ± 0.551
3.084AlaGlu: 3.084 ± 0.941
1.285AlaPhe: 1.285 ± 0.551
3.341AlaGly: 3.341 ± 1.671
1.028AlaHis: 1.028 ± 0.277
4.626AlaIle: 4.626 ± 0.727
2.056AlaLys: 2.056 ± 0.613
2.827AlaLeu: 2.827 ± 0.828
2.313AlaMet: 2.313 ± 1.009
3.341AlaAsn: 3.341 ± 0.266
1.028AlaPro: 1.028 ± 0.566
1.028AlaGln: 1.028 ± 0.568
1.799AlaArg: 1.799 ± 1.22
3.598AlaSer: 3.598 ± 1.512
2.056AlaThr: 2.056 ± 1.278
2.827AlaVal: 2.827 ± 1.366
0.257AlaTrp: 0.257 ± 0.15
1.799AlaTyr: 1.799 ± 0.436
0.0AlaXaa: 0.0 ± 0.0
Cys
1.542CysAla: 1.542 ± 0.577
0.0CysCys: 0.0 ± 0.0
1.285CysAsp: 1.285 ± 0.238
1.285CysGlu: 1.285 ± 0.798
2.056CysPhe: 2.056 ± 1.489
1.542CysGly: 1.542 ± 1.386
0.257CysHis: 0.257 ± 0.231
2.313CysIle: 2.313 ± 0.704
2.57CysLys: 2.57 ± 1.95
2.57CysLeu: 2.57 ± 0.918
0.771CysMet: 0.771 ± 0.341
1.285CysAsn: 1.285 ± 0.459
1.028CysPro: 1.028 ± 0.568
1.542CysGln: 1.542 ± 0.379
1.028CysArg: 1.028 ± 0.253
1.799CysSer: 1.799 ± 0.908
1.542CysThr: 1.542 ± 0.681
0.514CysVal: 0.514 ± 0.126
0.257CysTrp: 0.257 ± 0.231
2.056CysTyr: 2.056 ± 0.798
0.0CysXaa: 0.0 ± 0.0
Asp
2.313AspAla: 2.313 ± 0.704
2.313AspCys: 2.313 ± 0.467
5.14AspAsp: 5.14 ± 0.315
3.855AspGlu: 3.855 ± 0.714
4.369AspPhe: 4.369 ± 0.954
1.285AspGly: 1.285 ± 0.418
0.514AspHis: 0.514 ± 0.301
6.425AspIle: 6.425 ± 1.21
3.084AspLys: 3.084 ± 0.678
4.369AspLeu: 4.369 ± 1.552
0.514AspMet: 0.514 ± 0.126
3.341AspAsn: 3.341 ± 0.612
3.598AspPro: 3.598 ± 0.745
1.542AspGln: 1.542 ± 1.236
0.771AspArg: 0.771 ± 0.154
2.056AspSer: 2.056 ± 0.86
1.542AspThr: 1.542 ± 0.902
2.827AspVal: 2.827 ± 0.981
0.514AspTrp: 0.514 ± 0.699
3.855AspTyr: 3.855 ± 0.228
0.0AspXaa: 0.0 ± 0.0
Glu
2.056GluAla: 2.056 ± 0.392
1.285GluCys: 1.285 ± 0.459
3.084GluAsp: 3.084 ± 0.758
3.341GluGlu: 3.341 ± 1.646
5.14GluPhe: 5.14 ± 1.425
1.028GluGly: 1.028 ± 0.568
1.285GluHis: 1.285 ± 0.238
7.967GluIle: 7.967 ± 0.074
4.883GluLys: 4.883 ± 0.148
4.883GluLeu: 4.883 ± 1.249
1.542GluMet: 1.542 ± 0.846
3.084GluAsn: 3.084 ± 0.49
1.799GluPro: 1.799 ± 0.711
2.313GluGln: 2.313 ± 0.513
1.799GluArg: 1.799 ± 0.711
4.883GluSer: 4.883 ± 0.932
3.598GluThr: 3.598 ± 1.11
2.827GluVal: 2.827 ± 0.53
0.771GluTrp: 0.771 ± 0.451
2.827GluTyr: 2.827 ± 0.239
0.0GluXaa: 0.0 ± 0.0
Phe
1.799PheAla: 1.799 ± 0.756
1.285PheCys: 1.285 ± 0.551
2.313PheAsp: 2.313 ± 0.513
3.598PheGlu: 3.598 ± 1.11
3.341PhePhe: 3.341 ± 0.894
1.799PheGly: 1.799 ± 0.436
0.257PheHis: 0.257 ± 0.15
5.14PheIle: 5.14 ± 0.953
5.654PheLys: 5.654 ± 1.373
4.883PheLeu: 4.883 ± 2.206
1.028PheMet: 1.028 ± 0.253
2.827PheAsn: 2.827 ± 0.697
1.028PhePro: 1.028 ± 0.647
1.285PheGln: 1.285 ± 0.459
2.313PheArg: 2.313 ± 1.009
4.626PheSer: 4.626 ± 1.137
3.855PheThr: 3.855 ± 0.714
2.827PheVal: 2.827 ± 0.388
0.257PheTrp: 0.257 ± 0.15
1.542PheTyr: 1.542 ± 0.47
0.0PheXaa: 0.0 ± 0.0
Gly
2.056GlyAla: 2.056 ± 1.634
2.313GlyCys: 2.313 ± 1.366
2.313GlyAsp: 2.313 ± 0.462
1.542GlyGlu: 1.542 ± 0.564
1.542GlyPhe: 1.542 ± 0.902
1.285GlyGly: 1.285 ± 0.75
1.285GlyHis: 1.285 ± 0.75
3.084GlyIle: 3.084 ± 0.941
2.827GlyLys: 2.827 ± 0.388
3.855GlyLeu: 3.855 ± 1.09
1.542GlyMet: 1.542 ± 0.564
2.313GlyAsn: 2.313 ± 0.693
2.056GlyPro: 2.056 ± 0.381
2.056GlyGln: 2.056 ± 0.554
1.542GlyArg: 1.542 ± 0.564
3.084GlySer: 3.084 ± 0.678
3.341GlyThr: 3.341 ± 1.621
0.771GlyVal: 0.771 ± 0.671
0.514GlyTrp: 0.514 ± 0.126
1.285GlyTyr: 1.285 ± 0.418
0.0GlyXaa: 0.0 ± 0.0
His
0.771HisAla: 0.771 ± 0.154
0.257HisCys: 0.257 ± 0.15
0.514HisAsp: 0.514 ± 0.301
0.257HisGlu: 0.257 ± 0.15
1.028HisPhe: 1.028 ± 0.277
1.799HisGly: 1.799 ± 0.422
1.542HisHis: 1.542 ± 0.379
2.056HisIle: 2.056 ± 1.278
1.799HisLys: 1.799 ± 0.436
2.313HisLeu: 2.313 ± 0.467
0.514HisMet: 0.514 ± 0.126
1.285HisAsn: 1.285 ± 0.763
0.514HisPro: 0.514 ± 0.126
1.285HisGln: 1.285 ± 0.459
0.514HisArg: 0.514 ± 0.699
1.542HisSer: 1.542 ± 0.308
1.028HisThr: 1.028 ± 0.253
2.313HisVal: 2.313 ± 0.474
0.257HisTrp: 0.257 ± 0.15
1.028HisTyr: 1.028 ± 0.253
0.0HisXaa: 0.0 ± 0.0
Ile
4.112IleAla: 4.112 ± 0.45
3.084IleCys: 3.084 ± 1.704
7.196IleAsp: 7.196 ± 2.22
7.967IleGlu: 7.967 ± 1.642
3.084IlePhe: 3.084 ± 0.338
3.341IleGly: 3.341 ± 0.197
2.827IleHis: 2.827 ± 0.239
7.967IleIle: 7.967 ± 2.433
6.168IleLys: 6.168 ± 1.515
7.453IleLeu: 7.453 ± 1.445
2.57IleMet: 2.57 ± 0.327
6.168IleAsn: 6.168 ± 0.42
4.369IlePro: 4.369 ± 1.715
4.112IleGln: 4.112 ± 1.15
3.855IleArg: 3.855 ± 0.151
7.453IleSer: 7.453 ± 1.523
4.369IleThr: 4.369 ± 0.993
4.883IleVal: 4.883 ± 1.087
0.514IleTrp: 0.514 ± 0.301
4.883IleTyr: 4.883 ± 1.937
0.0IleXaa: 0.0 ± 0.0
Lys
2.57LysAla: 2.57 ± 0.948
3.341LysCys: 3.341 ± 1.589
3.341LysAsp: 3.341 ± 0.266
4.626LysGlu: 4.626 ± 0.946
4.369LysPhe: 4.369 ± 2.336
2.827LysGly: 2.827 ± 0.388
2.827LysHis: 2.827 ± 1.476
7.967LysIle: 7.967 ± 1.143
4.883LysLys: 4.883 ± 0.637
7.967LysLeu: 7.967 ± 0.308
3.341LysMet: 3.341 ± 0.479
6.168LysAsn: 6.168 ± 1.142
1.799LysPro: 1.799 ± 0.349
2.827LysGln: 2.827 ± 0.589
2.57LysArg: 2.57 ± 0.948
5.397LysSer: 5.397 ± 0.524
7.71LysThr: 7.71 ± 0.672
3.341LysVal: 3.341 ± 0.197
0.771LysTrp: 0.771 ± 0.154
2.57LysTyr: 2.57 ± 0.634
0.0LysXaa: 0.0 ± 0.0
Leu
4.369LeuAla: 4.369 ± 1.578
1.542LeuCys: 1.542 ± 0.681
4.369LeuAsp: 4.369 ± 1.905
3.084LeuGlu: 3.084 ± 0.338
4.369LeuPhe: 4.369 ± 1.545
3.084LeuGly: 3.084 ± 0.986
1.285LeuHis: 1.285 ± 0.551
7.453LeuIle: 7.453 ± 1.523
6.682LeuLys: 6.682 ± 1.229
8.224LeuLeu: 8.224 ± 1.533
3.855LeuMet: 3.855 ± 0.88
4.369LeuAsn: 4.369 ± 1.046
3.855LeuPro: 3.855 ± 0.77
3.084LeuGln: 3.084 ± 0.843
3.598LeuArg: 3.598 ± 1.656
9.252LeuSer: 9.252 ± 1.138
5.911LeuThr: 5.911 ± 1.529
3.084LeuVal: 3.084 ± 0.204
0.514LeuTrp: 0.514 ± 0.672
3.598LeuTyr: 3.598 ± 1.11
0.0LeuXaa: 0.0 ± 0.0
Met
1.542MetAla: 1.542 ± 1.236
1.028MetCys: 1.028 ± 0.568
2.056MetAsp: 2.056 ± 0.451
1.542MetGlu: 1.542 ± 0.308
1.285MetPhe: 1.285 ± 0.763
1.028MetGly: 1.028 ± 0.277
0.257MetHis: 0.257 ± 0.15
2.313MetIle: 2.313 ± 0.467
2.056MetLys: 2.056 ± 0.725
3.855MetLeu: 3.855 ± 0.151
1.028MetMet: 1.028 ± 0.277
1.542MetAsn: 1.542 ± 0.308
2.056MetPro: 2.056 ± 0.725
1.285MetGln: 1.285 ± 0.238
1.799MetArg: 1.799 ± 0.638
2.313MetSer: 2.313 ± 1.242
2.313MetThr: 2.313 ± 0.693
0.257MetVal: 0.257 ± 0.15
0.257MetTrp: 0.257 ± 0.15
2.313MetTyr: 2.313 ± 0.474
0.0MetXaa: 0.0 ± 0.0
Asn
3.084AsnAla: 3.084 ± 1.77
1.542AsnCys: 1.542 ± 1.386
4.883AsnAsp: 4.883 ± 1.249
3.598AsnGlu: 3.598 ± 0.681
3.598AsnPhe: 3.598 ± 0.844
0.771AsnGly: 0.771 ± 0.154
1.542AsnHis: 1.542 ± 0.564
3.598AsnIle: 3.598 ± 0.381
4.112AsnLys: 4.112 ± 1.107
4.883AsnLeu: 4.883 ± 0.932
2.57AsnMet: 2.57 ± 0.836
2.57AsnAsn: 2.57 ± 0.476
2.827AsnPro: 2.827 ± 0.999
2.827AsnGln: 2.827 ± 0.981
1.799AsnArg: 1.799 ± 0.638
5.654AsnSer: 5.654 ± 1.178
3.855AsnThr: 3.855 ± 0.963
3.084AsnVal: 3.084 ± 1.362
0.514AsnTrp: 0.514 ± 0.301
3.855AsnTyr: 3.855 ± 0.883
0.0AsnXaa: 0.0 ± 0.0
Pro
1.028ProAla: 1.028 ± 1.321
0.257ProCys: 0.257 ± 0.15
2.056ProAsp: 2.056 ± 0.451
3.341ProGlu: 3.341 ± 0.889
1.285ProPhe: 1.285 ± 0.238
2.313ProGly: 2.313 ± 1.108
1.285ProHis: 1.285 ± 0.75
4.883ProIle: 4.883 ± 0.932
3.341ProLys: 3.341 ± 0.889
2.827ProLeu: 2.827 ± 1.893
1.285ProMet: 1.285 ± 0.418
2.056ProAsn: 2.056 ± 0.505
0.514ProPro: 0.514 ± 0.672
0.514ProGln: 0.514 ± 0.301
1.542ProArg: 1.542 ± 0.681
1.799ProSer: 1.799 ± 0.711
1.542ProThr: 1.542 ± 0.379
1.285ProVal: 1.285 ± 0.418
0.514ProTrp: 0.514 ± 0.126
0.771ProTyr: 0.771 ± 0.451
0.0ProXaa: 0.0 ± 0.0
Gln
1.285GlnAla: 1.285 ± 0.418
1.028GlnCys: 1.028 ± 0.568
1.799GlnAsp: 1.799 ± 0.638
1.028GlnGlu: 1.028 ± 0.253
2.313GlnPhe: 2.313 ± 0.474
2.57GlnGly: 2.57 ± 0.836
0.257GlnHis: 0.257 ± 0.15
4.883GlnIle: 4.883 ± 0.73
3.855GlnLys: 3.855 ± 1.251
2.313GlnLeu: 2.313 ± 1.891
0.771GlnMet: 0.771 ± 0.693
2.57GlnAsn: 2.57 ± 0.836
0.257GlnPro: 0.257 ± 0.15
0.257GlnGln: 0.257 ± 0.15
2.056GlnArg: 2.056 ± 0.725
2.313GlnSer: 2.313 ± 0.462
1.799GlnThr: 1.799 ± 0.422
3.598GlnVal: 3.598 ± 0.378
0.0GlnTrp: 0.0 ± 0.0
1.028GlnTyr: 1.028 ± 0.277
0.0GlnXaa: 0.0 ± 0.0
Arg
1.799ArgAla: 1.799 ± 0.436
1.028ArgCys: 1.028 ± 0.253
2.056ArgAsp: 2.056 ± 0.381
3.855ArgGlu: 3.855 ± 1.62
2.056ArgPhe: 2.056 ± 0.381
0.257ArgGly: 0.257 ± 0.231
1.028ArgHis: 1.028 ± 0.601
3.341ArgIle: 3.341 ± 1.341
3.598ArgLys: 3.598 ± 0.741
3.341ArgLeu: 3.341 ± 0.612
1.028ArgMet: 1.028 ± 0.556
2.313ArgAsn: 2.313 ± 0.693
1.285ArgPro: 1.285 ± 0.238
0.771ArgGln: 0.771 ± 0.671
0.514ArgArg: 0.514 ± 0.301
3.341ArgSer: 3.341 ± 0.612
2.056ArgThr: 2.056 ± 1.295
1.799ArgVal: 1.799 ± 1.245
0.257ArgTrp: 0.257 ± 0.15
2.056ArgTyr: 2.056 ± 0.451
0.0ArgXaa: 0.0 ± 0.0
Ser
2.827SerAla: 2.827 ± 1.142
2.313SerCys: 2.313 ± 1.366
4.369SerAsp: 4.369 ± 0.678
3.855SerGlu: 3.855 ± 1.571
3.341SerPhe: 3.341 ± 0.713
3.084SerGly: 3.084 ± 1.362
1.799SerHis: 1.799 ± 0.349
7.453SerIle: 7.453 ± 1.715
8.738SerLys: 8.738 ± 1.325
7.453SerLeu: 7.453 ± 0.688
2.827SerMet: 2.827 ± 1.141
4.626SerAsn: 4.626 ± 0.226
2.313SerPro: 2.313 ± 1.108
2.827SerGln: 2.827 ± 1.308
4.112SerArg: 4.112 ± 1.024
5.654SerSer: 5.654 ± 1.373
4.369SerThr: 4.369 ± 0.833
4.369SerVal: 4.369 ± 0.678
0.257SerTrp: 0.257 ± 0.231
0.771SerTyr: 0.771 ± 0.451
0.0SerXaa: 0.0 ± 0.0
Thr
3.341ThrAla: 3.341 ± 0.197
1.028ThrCys: 1.028 ± 0.253
2.827ThrAsp: 2.827 ± 0.828
3.598ThrGlu: 3.598 ± 0.697
2.056ThrPhe: 2.056 ± 0.86
3.084ThrGly: 3.084 ± 0.758
1.542ThrHis: 1.542 ± 0.47
6.168ThrIle: 6.168 ± 1.231
6.425ThrLys: 6.425 ± 1.54
4.112ThrLeu: 4.112 ± 2.608
1.285ThrMet: 1.285 ± 0.238
4.369ThrAsn: 4.369 ± 1.046
2.57ThrPro: 2.57 ± 0.631
0.771ThrGln: 0.771 ± 0.154
2.056ThrArg: 2.056 ± 0.381
4.883ThrSer: 4.883 ± 0.909
4.112ThrThr: 4.112 ± 1.284
3.341ThrVal: 3.341 ± 0.938
1.028ThrTrp: 1.028 ± 0.647
3.084ThrTyr: 3.084 ± 0.83
0.0ThrXaa: 0.0 ± 0.0
Val
2.056ValAla: 2.056 ± 0.613
2.313ValCys: 2.313 ± 1.366
1.799ValAsp: 1.799 ± 1.176
3.855ValGlu: 3.855 ± 2.53
2.57ValPhe: 2.57 ± 0.631
2.57ValGly: 2.57 ± 0.918
0.771ValHis: 0.771 ± 0.451
4.112ValIle: 4.112 ± 0.761
4.626ValLys: 4.626 ± 1.24
3.084ValLeu: 3.084 ± 0.83
1.285ValMet: 1.285 ± 0.751
2.827ValAsn: 2.827 ± 0.697
0.771ValPro: 0.771 ± 0.341
2.056ValGln: 2.056 ± 0.392
1.799ValArg: 1.799 ± 0.756
4.369ValSer: 4.369 ± 0.433
3.084ValThr: 3.084 ± 0.49
1.542ValVal: 1.542 ± 0.308
0.257ValTrp: 0.257 ± 0.231
2.056ValTyr: 2.056 ± 0.554
0.0ValXaa: 0.0 ± 0.0
Trp
0.514TrpAla: 0.514 ± 0.126
0.0TrpCys: 0.0 ± 0.0
0.514TrpAsp: 0.514 ± 0.672
0.771TrpGlu: 0.771 ± 0.671
0.771TrpPhe: 0.771 ± 0.154
0.514TrpGly: 0.514 ± 0.462
0.257TrpHis: 0.257 ± 0.231
0.514TrpIle: 0.514 ± 0.126
0.514TrpLys: 0.514 ± 0.126
0.771TrpLeu: 0.771 ± 0.451
0.514TrpMet: 0.514 ± 0.672
0.514TrpAsn: 0.514 ± 0.301
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.257TrpArg: 0.257 ± 0.231
1.285TrpSer: 1.285 ± 0.418
0.257TrpThr: 0.257 ± 0.15
0.257TrpVal: 0.257 ± 0.15
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.028TyrAla: 1.028 ± 0.277
0.771TyrCys: 0.771 ± 0.341
1.028TyrAsp: 1.028 ± 0.253
2.313TyrGlu: 2.313 ± 0.693
1.799TyrPhe: 1.799 ± 0.711
3.084TyrGly: 3.084 ± 1.402
0.771TyrHis: 0.771 ± 0.341
4.112TyrIle: 4.112 ± 1.188
3.341TyrLys: 3.341 ± 1.257
3.598TyrLeu: 3.598 ± 1.028
1.285TyrMet: 1.285 ± 1.335
3.598TyrAsn: 3.598 ± 0.844
0.771TyrPro: 0.771 ± 0.618
3.598TyrGln: 3.598 ± 0.745
2.056TyrArg: 2.056 ± 0.505
2.313TyrSer: 2.313 ± 0.693
3.341TyrThr: 3.341 ± 1.275
2.056TyrVal: 2.056 ± 0.381
0.514TyrTrp: 0.514 ± 0.301
1.028TyrTyr: 1.028 ± 0.253
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3892 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski