Amino acid dipepetide frequency for Simian immunodeficiency virus - agm

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.657AlaAla: 5.657 ± 1.874
2.514AlaCys: 2.514 ± 0.736
1.886AlaAsp: 1.886 ± 0.592
3.457AlaGlu: 3.457 ± 1.051
0.943AlaPhe: 0.943 ± 0.461
5.657AlaGly: 5.657 ± 1.17
1.257AlaHis: 1.257 ± 0.398
3.143AlaIle: 3.143 ± 0.786
3.143AlaLys: 3.143 ± 1.166
5.028AlaLeu: 5.028 ± 0.878
0.943AlaMet: 0.943 ± 0.572
1.257AlaAsn: 1.257 ± 0.913
3.143AlaPro: 3.143 ± 1.012
4.714AlaGln: 4.714 ± 1.151
3.457AlaArg: 3.457 ± 1.143
1.886AlaSer: 1.886 ± 0.536
2.2AlaThr: 2.2 ± 0.45
4.085AlaVal: 4.085 ± 0.785
2.828AlaTrp: 2.828 ± 1.122
2.2AlaTyr: 2.2 ± 0.403
0.0AlaXaa: 0.0 ± 0.0
Cys
0.943CysAla: 0.943 ± 0.496
0.314CysCys: 0.314 ± 0.302
0.629CysAsp: 0.629 ± 0.25
0.629CysGlu: 0.629 ± 0.383
2.2CysPhe: 2.2 ± 1.163
1.571CysGly: 1.571 ± 0.823
1.571CysHis: 1.571 ± 0.582
1.571CysIle: 1.571 ± 0.582
3.143CysLys: 3.143 ± 1.118
0.943CysLeu: 0.943 ± 0.365
0.314CysMet: 0.314 ± 0.246
1.886CysAsn: 1.886 ± 0.666
0.629CysPro: 0.629 ± 0.44
2.2CysGln: 2.2 ± 0.582
1.257CysArg: 1.257 ± 0.423
0.314CysSer: 0.314 ± 0.246
1.571CysThr: 1.571 ± 0.582
1.886CysVal: 1.886 ± 0.653
0.314CysTrp: 0.314 ± 0.22
0.629CysTyr: 0.629 ± 0.354
0.0CysXaa: 0.0 ± 0.0
Asp
0.943AspAla: 0.943 ± 0.402
0.943AspCys: 0.943 ± 0.394
2.514AspAsp: 2.514 ± 0.844
2.514AspGlu: 2.514 ± 0.448
1.257AspPhe: 1.257 ± 0.493
2.828AspGly: 2.828 ± 0.722
1.257AspHis: 1.257 ± 0.362
2.2AspIle: 2.2 ± 0.902
1.571AspLys: 1.571 ± 0.504
2.2AspLeu: 2.2 ± 0.773
1.257AspMet: 1.257 ± 1.11
1.571AspAsn: 1.571 ± 0.683
4.085AspPro: 4.085 ± 1.245
2.2AspGln: 2.2 ± 1.174
1.571AspArg: 1.571 ± 0.732
2.828AspSer: 2.828 ± 0.837
1.886AspThr: 1.886 ± 0.46
0.629AspVal: 0.629 ± 0.284
0.943AspTrp: 0.943 ± 0.528
1.886AspTyr: 1.886 ± 0.424
0.0AspXaa: 0.0 ± 0.0
Glu
5.343GluAla: 5.343 ± 0.596
1.257GluCys: 1.257 ± 0.507
3.457GluAsp: 3.457 ± 0.689
9.114GluGlu: 9.114 ± 2.04
1.571GluPhe: 1.571 ± 0.337
7.542GluGly: 7.542 ± 1.507
1.571GluHis: 1.571 ± 0.731
4.085GluIle: 4.085 ± 0.561
7.542GluLys: 7.542 ± 1.693
5.028GluLeu: 5.028 ± 1.252
1.886GluMet: 1.886 ± 0.857
1.571GluAsn: 1.571 ± 0.571
3.457GluPro: 3.457 ± 1.199
4.714GluGln: 4.714 ± 0.852
7.228GluArg: 7.228 ± 1.878
1.886GluSer: 1.886 ± 0.46
2.514GluThr: 2.514 ± 0.942
3.771GluVal: 3.771 ± 0.484
2.828GluTrp: 2.828 ± 0.888
0.629GluTyr: 0.629 ± 0.355
0.0GluXaa: 0.0 ± 0.0
Phe
0.943PheAla: 0.943 ± 0.739
1.571PheCys: 1.571 ± 0.482
0.943PheAsp: 0.943 ± 0.528
2.2PheGlu: 2.2 ± 0.403
0.943PhePhe: 0.943 ± 0.402
2.514PheGly: 2.514 ± 1.238
0.314PheHis: 0.314 ± 0.246
0.314PheIle: 0.314 ± 0.22
2.2PheLys: 2.2 ± 0.808
4.4PheLeu: 4.4 ± 0.505
0.0PheMet: 0.0 ± 0.0
2.2PheAsn: 2.2 ± 0.673
0.629PhePro: 0.629 ± 0.456
1.571PheGln: 1.571 ± 0.447
1.886PheArg: 1.886 ± 0.864
0.943PheSer: 0.943 ± 0.751
0.629PheThr: 0.629 ± 0.355
1.571PheVal: 1.571 ± 0.469
0.943PheTrp: 0.943 ± 0.68
1.257PheTyr: 1.257 ± 0.707
0.0PheXaa: 0.0 ± 0.0
Gly
5.028GlyAla: 5.028 ± 0.942
1.257GlyCys: 1.257 ± 0.777
2.2GlyAsp: 2.2 ± 0.447
5.343GlyGlu: 5.343 ± 1.241
3.771GlyPhe: 3.771 ± 1.05
6.914GlyGly: 6.914 ± 1.539
1.886GlyHis: 1.886 ± 0.578
7.228GlyIle: 7.228 ± 1.878
5.657GlyLys: 5.657 ± 2.048
7.228GlyLeu: 7.228 ± 0.659
1.257GlyMet: 1.257 ± 0.271
4.4GlyAsn: 4.4 ± 0.951
4.085GlyPro: 4.085 ± 1.269
4.085GlyGln: 4.085 ± 1.462
4.4GlyArg: 4.4 ± 1.889
4.714GlySer: 4.714 ± 0.546
3.143GlyThr: 3.143 ± 0.746
2.2GlyVal: 2.2 ± 0.606
1.257GlyTrp: 1.257 ± 0.662
2.828GlyTyr: 2.828 ± 0.791
0.0GlyXaa: 0.0 ± 0.0
His
0.314HisAla: 0.314 ± 0.37
1.571HisCys: 1.571 ± 0.522
0.943HisAsp: 0.943 ± 0.446
1.257HisGlu: 1.257 ± 0.301
1.571HisPhe: 1.571 ± 1.106
0.629HisGly: 0.629 ± 0.411
0.0HisHis: 0.0 ± 0.0
2.514HisIle: 2.514 ± 0.523
1.571HisLys: 1.571 ± 0.842
2.828HisLeu: 2.828 ± 1.229
0.314HisMet: 0.314 ± 0.32
0.629HisAsn: 0.629 ± 0.44
1.886HisPro: 1.886 ± 0.441
0.629HisGln: 0.629 ± 0.337
1.257HisArg: 1.257 ± 0.552
0.629HisSer: 0.629 ± 0.25
1.257HisThr: 1.257 ± 0.597
0.943HisVal: 0.943 ± 0.394
0.314HisTrp: 0.314 ± 0.346
0.629HisTyr: 0.629 ± 0.691
0.0HisXaa: 0.0 ± 0.0
Ile
0.943IleAla: 0.943 ± 0.212
1.257IleCys: 1.257 ± 0.88
1.257IleAsp: 1.257 ± 0.72
4.085IleGlu: 4.085 ± 1.6
0.314IlePhe: 0.314 ± 0.22
5.971IleGly: 5.971 ± 1.434
3.457IleHis: 3.457 ± 0.623
5.343IleIle: 5.343 ± 0.964
5.028IleLys: 5.028 ± 1.939
5.343IleLeu: 5.343 ± 1.091
0.314IleMet: 0.314 ± 0.246
3.457IleAsn: 3.457 ± 0.65
3.457IlePro: 3.457 ± 1.806
2.828IleGln: 2.828 ± 1.166
3.457IleArg: 3.457 ± 0.667
0.943IleSer: 0.943 ± 0.482
1.886IleThr: 1.886 ± 0.618
3.771IleVal: 3.771 ± 0.495
3.143IleTrp: 3.143 ± 0.892
1.571IleTyr: 1.571 ± 0.745
0.0IleXaa: 0.0 ± 0.0
Lys
4.4LysAla: 4.4 ± 1.102
2.2LysCys: 2.2 ± 0.801
2.514LysAsp: 2.514 ± 0.76
7.228LysGlu: 7.228 ± 1.798
2.2LysPhe: 2.2 ± 0.483
6.6LysGly: 6.6 ± 1.577
0.943LysHis: 0.943 ± 0.212
5.028LysIle: 5.028 ± 1.925
5.028LysLys: 5.028 ± 1.127
7.857LysLeu: 7.857 ± 1.315
1.571LysMet: 1.571 ± 0.749
3.143LysAsn: 3.143 ± 1.861
1.571LysPro: 1.571 ± 0.64
5.657LysGln: 5.657 ± 0.812
4.085LysArg: 4.085 ± 1.092
1.257LysSer: 1.257 ± 0.674
4.714LysThr: 4.714 ± 1.221
3.771LysVal: 3.771 ± 1.205
1.257LysTrp: 1.257 ± 0.501
1.886LysTyr: 1.886 ± 0.523
0.0LysXaa: 0.0 ± 0.0
Leu
5.028LeuAla: 5.028 ± 1.598
1.257LeuCys: 1.257 ± 0.747
2.828LeuAsp: 2.828 ± 0.589
7.857LeuGlu: 7.857 ± 1.005
2.514LeuPhe: 2.514 ± 0.825
5.343LeuGly: 5.343 ± 0.984
1.571LeuHis: 1.571 ± 0.905
3.143LeuIle: 3.143 ± 0.708
5.343LeuLys: 5.343 ± 0.962
8.171LeuLeu: 8.171 ± 1.148
1.257LeuMet: 1.257 ± 0.271
5.657LeuAsn: 5.657 ± 1.29
3.457LeuPro: 3.457 ± 0.715
7.228LeuGln: 7.228 ± 0.949
5.028LeuArg: 5.028 ± 0.759
5.657LeuSer: 5.657 ± 1.194
4.4LeuThr: 4.4 ± 1.149
5.343LeuVal: 5.343 ± 1.483
2.514LeuTrp: 2.514 ± 0.971
1.886LeuTyr: 1.886 ± 0.532
0.0LeuXaa: 0.0 ± 0.0
Met
2.2MetAla: 2.2 ± 0.403
0.314MetCys: 0.314 ± 0.346
1.571MetAsp: 1.571 ± 0.457
1.257MetGlu: 1.257 ± 0.875
0.629MetPhe: 0.629 ± 0.504
2.2MetGly: 2.2 ± 0.483
0.943MetHis: 0.943 ± 0.606
0.943MetIle: 0.943 ± 0.479
0.314MetLys: 0.314 ± 0.32
1.886MetLeu: 1.886 ± 0.688
0.314MetMet: 0.314 ± 0.32
0.943MetAsn: 0.943 ± 0.446
0.314MetPro: 0.314 ± 0.413
0.629MetGln: 0.629 ± 0.641
0.629MetArg: 0.629 ± 0.377
0.629MetSer: 0.629 ± 0.419
2.2MetThr: 2.2 ± 0.708
0.629MetVal: 0.629 ± 0.641
0.0MetTrp: 0.0 ± 0.0
0.314MetTyr: 0.314 ± 0.32
0.0MetXaa: 0.0 ± 0.0
Asn
3.771AsnAla: 3.771 ± 1.245
2.828AsnCys: 2.828 ± 0.751
1.886AsnAsp: 1.886 ± 1.155
1.571AsnGlu: 1.571 ± 0.636
2.2AsnPhe: 2.2 ± 0.658
2.514AsnGly: 2.514 ± 0.68
0.314AsnHis: 0.314 ± 0.37
2.2AsnIle: 2.2 ± 1.117
2.514AsnLys: 2.514 ± 1.097
3.457AsnLeu: 3.457 ± 1.324
1.571AsnMet: 1.571 ± 0.676
3.143AsnAsn: 3.143 ± 1.31
2.2AsnPro: 2.2 ± 1.423
2.514AsnGln: 2.514 ± 1.101
1.571AsnArg: 1.571 ± 0.46
2.2AsnSer: 2.2 ± 0.762
2.514AsnThr: 2.514 ± 0.881
1.886AsnVal: 1.886 ± 0.653
1.571AsnTrp: 1.571 ± 0.336
3.143AsnTyr: 3.143 ± 0.977
0.0AsnXaa: 0.0 ± 0.0
Pro
3.143ProAla: 3.143 ± 0.828
1.886ProCys: 1.886 ± 0.687
1.257ProAsp: 1.257 ± 0.68
3.143ProGlu: 3.143 ± 0.687
1.257ProPhe: 1.257 ± 0.602
4.4ProGly: 4.4 ± 0.959
0.314ProHis: 0.314 ± 0.22
1.886ProIle: 1.886 ± 1.019
1.886ProLys: 1.886 ± 0.648
5.657ProLeu: 5.657 ± 0.466
1.257ProMet: 1.257 ± 0.628
1.257ProAsn: 1.257 ± 0.581
3.771ProPro: 3.771 ± 2.121
2.828ProGln: 2.828 ± 0.47
5.971ProArg: 5.971 ± 2.887
2.828ProSer: 2.828 ± 0.481
3.143ProThr: 3.143 ± 0.641
4.085ProVal: 4.085 ± 1.151
0.943ProTrp: 0.943 ± 0.549
1.571ProTyr: 1.571 ± 0.645
0.0ProXaa: 0.0 ± 0.0
Gln
3.771GlnAla: 3.771 ± 0.782
0.629GlnCys: 0.629 ± 0.25
1.886GlnAsp: 1.886 ± 0.488
5.028GlnGlu: 5.028 ± 0.988
0.629GlnPhe: 0.629 ± 0.409
5.971GlnGly: 5.971 ± 1.576
1.571GlnHis: 1.571 ± 0.81
4.4GlnIle: 4.4 ± 1.188
7.542GlnLys: 7.542 ± 1.219
4.714GlnLeu: 4.714 ± 1.52
2.2GlnMet: 2.2 ± 0.995
2.2GlnAsn: 2.2 ± 0.746
1.571GlnPro: 1.571 ± 0.482
3.771GlnGln: 3.771 ± 1.365
2.2GlnArg: 2.2 ± 0.38
2.828GlnSer: 2.828 ± 0.899
2.828GlnThr: 2.828 ± 1.05
3.771GlnVal: 3.771 ± 1.371
2.828GlnTrp: 2.828 ± 1.15
3.457GlnTyr: 3.457 ± 0.592
0.0GlnXaa: 0.0 ± 0.0
Arg
3.143ArgAla: 3.143 ± 1.063
1.257ArgCys: 1.257 ± 0.768
2.514ArgAsp: 2.514 ± 0.747
7.542ArgGlu: 7.542 ± 1.155
1.257ArgPhe: 1.257 ± 0.719
3.771ArgGly: 3.771 ± 1.656
1.257ArgHis: 1.257 ± 0.787
3.143ArgIle: 3.143 ± 0.529
2.828ArgLys: 2.828 ± 1.092
4.4ArgLeu: 4.4 ± 0.967
0.629ArgMet: 0.629 ± 0.411
2.514ArgAsn: 2.514 ± 1.296
3.771ArgPro: 3.771 ± 0.842
4.4ArgGln: 4.4 ± 1.14
6.914ArgArg: 6.914 ± 3.883
2.2ArgSer: 2.2 ± 0.713
1.886ArgThr: 1.886 ± 0.917
4.4ArgVal: 4.4 ± 0.949
1.571ArgTrp: 1.571 ± 0.788
3.457ArgTyr: 3.457 ± 0.942
0.0ArgXaa: 0.0 ± 0.0
Ser
1.886SerAla: 1.886 ± 0.716
0.314SerCys: 0.314 ± 0.246
2.2SerAsp: 2.2 ± 0.587
3.457SerGlu: 3.457 ± 1.131
0.314SerPhe: 0.314 ± 0.246
3.457SerGly: 3.457 ± 0.691
0.314SerHis: 0.314 ± 0.37
2.2SerIle: 2.2 ± 1.253
3.771SerLys: 3.771 ± 1.136
2.828SerLeu: 2.828 ± 1.202
0.943SerMet: 0.943 ± 0.665
1.571SerAsn: 1.571 ± 0.582
2.514SerPro: 2.514 ± 0.98
2.514SerGln: 2.514 ± 0.436
3.143SerArg: 3.143 ± 1.146
0.629SerSer: 0.629 ± 0.383
3.771SerThr: 3.771 ± 1.736
3.143SerVal: 3.143 ± 1.124
2.2SerTrp: 2.2 ± 0.922
0.629SerTyr: 0.629 ± 0.377
0.0SerXaa: 0.0 ± 0.0
Thr
4.714ThrAla: 4.714 ± 0.58
0.314ThrCys: 0.314 ± 0.246
1.571ThrAsp: 1.571 ± 0.768
3.143ThrGlu: 3.143 ± 1.108
1.257ThrPhe: 1.257 ± 0.446
4.4ThrGly: 4.4 ± 1.159
0.314ThrHis: 0.314 ± 0.22
2.514ThrIle: 2.514 ± 1.03
1.886ThrLys: 1.886 ± 0.532
4.4ThrLeu: 4.4 ± 0.609
0.314ThrMet: 0.314 ± 0.346
1.886ThrAsn: 1.886 ± 0.891
4.714ThrPro: 4.714 ± 0.657
3.457ThrGln: 3.457 ± 1.034
1.257ThrArg: 1.257 ± 0.868
2.828ThrSer: 2.828 ± 1.136
5.028ThrThr: 5.028 ± 1.588
4.714ThrVal: 4.714 ± 2.044
1.257ThrTrp: 1.257 ± 0.568
1.571ThrTyr: 1.571 ± 0.733
0.0ThrXaa: 0.0 ± 0.0
Val
2.828ValAla: 2.828 ± 0.821
0.629ValCys: 0.629 ± 0.337
2.514ValAsp: 2.514 ± 0.904
4.4ValGlu: 4.4 ± 0.738
1.257ValPhe: 1.257 ± 0.446
2.2ValGly: 2.2 ± 0.721
1.886ValHis: 1.886 ± 0.464
3.457ValIle: 3.457 ± 0.727
5.971ValLys: 5.971 ± 1.164
4.714ValLeu: 4.714 ± 0.887
0.629ValMet: 0.629 ± 0.355
1.886ValAsn: 1.886 ± 0.438
4.714ValPro: 4.714 ± 0.87
4.4ValGln: 4.4 ± 0.604
3.143ValArg: 3.143 ± 0.82
2.514ValSer: 2.514 ± 0.66
3.143ValThr: 3.143 ± 1.46
1.886ValVal: 1.886 ± 0.523
2.514ValTrp: 2.514 ± 0.719
1.257ValTyr: 1.257 ± 0.597
0.0ValXaa: 0.0 ± 0.0
Trp
1.886TrpAla: 1.886 ± 0.577
0.629TrpCys: 0.629 ± 0.493
1.886TrpAsp: 1.886 ± 0.982
2.2TrpGlu: 2.2 ± 0.634
1.257TrpPhe: 1.257 ± 0.986
2.514TrpGly: 2.514 ± 0.971
0.314TrpHis: 0.314 ± 0.346
0.943TrpIle: 0.943 ± 0.564
3.457TrpLys: 3.457 ± 1.058
2.514TrpLeu: 2.514 ± 1.176
1.257TrpMet: 1.257 ± 0.678
1.257TrpAsn: 1.257 ± 0.761
1.257TrpPro: 1.257 ± 0.88
1.886TrpGln: 1.886 ± 0.562
2.2TrpArg: 2.2 ± 0.693
1.257TrpSer: 1.257 ± 0.467
0.943TrpThr: 0.943 ± 0.546
1.571TrpVal: 1.571 ± 0.649
0.629TrpTrp: 0.629 ± 0.44
0.943TrpTyr: 0.943 ± 0.319
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.514TyrAla: 2.514 ± 0.692
1.571TyrCys: 1.571 ± 0.748
0.943TyrAsp: 0.943 ± 0.564
1.571TyrGlu: 1.571 ± 0.571
0.314TyrPhe: 0.314 ± 0.346
1.886TyrGly: 1.886 ± 0.303
0.629TyrHis: 0.629 ± 0.373
1.571TyrIle: 1.571 ± 0.646
2.514TyrLys: 2.514 ± 0.781
1.886TyrLeu: 1.886 ± 0.666
0.629TyrMet: 0.629 ± 0.25
3.143TyrAsn: 3.143 ± 0.864
1.257TyrPro: 1.257 ± 0.536
1.571TyrGln: 1.571 ± 0.457
2.2TyrArg: 2.2 ± 1.066
2.828TyrSer: 2.828 ± 0.636
1.886TyrThr: 1.886 ± 0.577
1.886TyrVal: 1.886 ± 0.322
0.943TyrTrp: 0.943 ± 0.319
1.886TyrTyr: 1.886 ± 0.927
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (3183 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski