FA*IR Method for Neural Team Formation #80

Hamedloghmani · 2023-08-14T15:52:36Z

Since the paper summarized in #66 seemed to be sufficient for our problem in theory, I decided to prototype an implementation to see the results on our partial or even toy dataset.
The following diagram shows my proposed pipeline for Adila based on FA*IR algorithm:

Please note that this is an ongoing process and I will update each step in this thread.

hosseinfani · 2023-08-14T17:17:23Z

@Hamedloghmani
I'll be in lab ~3pm today to discuss flow and the codes.

Hamedloghmani · 2023-08-14T17:19:34Z

@hosseinfani
Thanks a lot, see you soon.

Hamedloghmani · 2023-08-17T18:38:30Z

@hosseinfani
The following table contains my early experiment results on imdb dataset. I used unigram_b outputs for bnn and bnn_emb baselines. The following experiment was done with k (top-k) = 100 and the significance level of 0.08. The significance level represents the threshold at which you're willing to consider evidence strong enough to reject the null hypothesis. Choosing an appropriate significance level involves a trade-off between being cautious about making false claims (Type I errors) and being open to detecting true effects (avoiding Type II errors).
To the best of my knowledge, these early results demonstrate negligible changes in performance ( in terms of MAP10 and NDCG10) while boosting the fairness which in our problem means lowering the ndkl metric. If you kindly approve and comfirm the validity of my pipeline and methodology, I can proceed and do the experiment on dblp as well.
One of the key parts of this new pipeline is, we only rerank in case the team determined to be unfair ( which is done by a function from the fa*ir library called is_fair())

In the literature, color blind ranking indicates sorting based on score or probability and ignore fairness considerations.
I picked 0.08 as the significance level since it is a common choice in the literature.

Dataset	Fairness Notion	Baseline	k	significance level	Reranking Algorithm	Color Blind NDKL	NDKL After	Color Blind MAP10	MAP10 After	Color Blind NDCG10	NDCG10 After
imdb	Demographic Parity	random	100	0.08	fa*ir	0.007230588657	0.06924057782	0.001588093065	0.0012440639	0.003662102291	0.003086464484
		bnn	100	0.08	fa*ir	0.2316633978	0.1792892199	0.00466983802	0.004678485827	0.01057994689	0.01059885315
		bnn_emb	100	0.08	fa*ir	0.2779183553	0.182014503	0.005727984715	0.005727984715	0.0126618403	0.0126618403

hosseinfani · 2023-08-20T04:20:59Z

@Hamedloghmani
Thank you. As we discussed during office hour, please

choose 0.05 as significance value, also after that with 0.01
consider experiments on equality of odds (the other notion of fairness)
consider a second metric of fairness

Please continue to log your progress here as you perform the experiments.

…80

Hamedloghmani · 2023-08-22T21:48:18Z

Hi, just wanted to share an update on the recent commits although I have updated them in #47 as well.

Initial integration of fa*ir in the pipeline has been finished.
skew implementation will be pushed tonight or tomorrow ( 5 days ahead of our initial schedule). Afterwards I'll dedicate 1 day to find potential bugs and clean the written code in the past few days.

I will start running the experiments after we finalize the above bulletpoints.
Thank you.

hosseinfani · 2023-08-23T03:25:43Z

@Hamedloghmani
let me know when we can review the code together. thanks.

Hamedloghmani · 2023-08-23T20:00:45Z

In the last commit the initial implementation of skew into the pipeline has been done. I am writing this comment to provide some explanation about how to interpret skew for ease of access.
Skew represents the logarithmic comparison between the percentage of candidates with a particular attribute value among the top k ranked results and the ideal percentage for that particular attribute. A negative Skew for a indicates an underrepresentation of candidates with the sensitive attribute a in the top k results, while a positive Skew implies a preference for such candidates. We use the logarithm to ensure that Skew values are symmetrical around zero concerning the ratios in favor of or against a specific attribute value a. For instance, a ratio of 2 to 0.5 results in the same Skew value in terms of magnitude but with opposite signs.

Hamedloghmani added the enhancement New feature or request label Aug 14, 2023

Hamedloghmani self-assigned this Aug 14, 2023

Hamedloghmani added a commit that referenced this issue Aug 14, 2023

fa-ir experiment initial sketch #80

c66bdf8

Hamedloghmani added a commit that referenced this issue Aug 14, 2023

fa-ir experiment #80

9c89c51

Hamedloghmani added a commit that referenced this issue Aug 14, 2023

fa-ir experiment, printing ndkl as well as boolean fairness #80

2367619

Hamedloghmani added a commit that referenced this issue Aug 16, 2023

fa-ir experiment, minor change #80

273a30d

Hamedloghmani added a commit that referenced this issue Aug 17, 2023

fa-ir experiment, getting ready for initial evaluations #80

b562ef1

hosseinfani added the experiment label Aug 20, 2023

hosseinfani changed the title ~~FA*IR in Practice~~ FA*IR Method for Neural Team Formation Aug 20, 2023

Hamedloghmani mentioned this issue Aug 20, 2023

The Epic Road to Fairness #47

Open

Hamedloghmani added a commit that referenced this issue Aug 21, 2023

Integrating fa*ir in the pipeline (rerank() function) #80

83b11d7

Hamedloghmani added a commit that referenced this issue Aug 21, 2023

fa-ir integration in the pipeline #80

7063e7a

Hamedloghmani added a commit that referenced this issue Aug 21, 2023

fa-ir integration in the pipeline (fair_eval function and parameters) #…

f415724

…80

Hamedloghmani added a commit that referenced this issue Aug 23, 2023

changed ndkl(before) threshold to k #80

39d6f9b

Hamedloghmani added a commit that referenced this issue Aug 23, 2023

skew implementation in eval_fairness function #80

0710bda

Hamedloghmani added a commit that referenced this issue Aug 23, 2023

minor changes #80

09b5ce4

Hamedloghmani added a commit that referenced this issue Aug 23, 2023

Equality of opportunity using fa*ir implementation #80

c1dc2e8

Hamedloghmani added a commit that referenced this issue Aug 24, 2023

minor fix in indentation #80

c150de3

Hamedloghmani added a commit that referenced this issue Aug 24, 2023

added some comments and todos #80

20412ba

Hamedloghmani closed this as completed in 31516c7 Aug 26, 2023

Hamedloghmani added a commit that referenced this issue Aug 28, 2023

progress bar for fa-ir reranking #80

c454324

Hamedloghmani added a commit that referenced this issue Aug 30, 2023

added gender as a sensitive attribute and att as argument #80

e8d4b15

Hamedloghmani added a commit that referenced this issue Sep 5, 2023

ratios.pkl loading condition fix #80

9ebc261

Hamedloghmani added a commit that referenced this issue Oct 5, 2023

attribute distribution plot #80

7473fbb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FA*IR Method for Neural Team Formation #80

FA*IR Method for Neural Team Formation #80

Hamedloghmani commented Aug 14, 2023

hosseinfani commented Aug 14, 2023

Hamedloghmani commented Aug 14, 2023

Hamedloghmani commented Aug 17, 2023

hosseinfani commented Aug 20, 2023

Hamedloghmani commented Aug 22, 2023

hosseinfani commented Aug 23, 2023

Hamedloghmani commented Aug 23, 2023

FA*IR Method for Neural Team Formation #80

FA*IR Method for Neural Team Formation #80

Comments

Hamedloghmani commented Aug 14, 2023

hosseinfani commented Aug 14, 2023

Hamedloghmani commented Aug 14, 2023

Hamedloghmani commented Aug 17, 2023

hosseinfani commented Aug 20, 2023

Hamedloghmani commented Aug 22, 2023

hosseinfani commented Aug 23, 2023

Hamedloghmani commented Aug 23, 2023